Commit Graph

6341 Commits

Author SHA1 Message Date
Martin Kroeker
d9894f45d3 Define sbgemm_r to fix DYNAMIC_ARCH builds 2022-02-25 10:04:00 +01:00
Martin Kroeker
522f809825 Merge pull request #3542 from martin-frbg/issue3540
Fix compilation for CooperLake on Windows/clang
2022-02-24 00:00:00 +01:00
Martin Kroeker
d50287fa5b Merge pull request #3544 from giordano/mg/gcc6
Fix compilation of Skylake AVX512 kernels with GCC 6
2022-02-23 23:57:57 +01:00
Mosè Giordano
abbc947edb Fix compilation of Skylake AVX512 kernels with GCC 6 2022-02-23 22:51:59 +00:00
Martin Kroeker
f2f0e1287b Merge pull request #3541 from martin-frbg/issue3530
Fix compilation for SkylakeX with gcc 6.x
2022-02-23 23:13:53 +01:00
Martin Kroeker
c62f8e2c01 Prevent compiler attempts to use k0 as mask register 2022-02-23 20:12:20 +01:00
Martin Kroeker
80eb581c83 Fix non-portable u_int64_t 2022-02-23 20:10:59 +01:00
Martin Kroeker
73ffabe6ba Guard uses of _mm512_reduce_add_p? 2022-02-23 20:06:14 +01:00
Martin Kroeker
5ad66f0e96 Merge pull request #3537 from xianyi/release-0.3.0
Merge back from 0.3.20 release to copy tag
2022-02-21 06:57:27 +01:00
Martin Kroeker
0b678b19dc Update version to 0.3.20 v0.3.20 2022-02-20 22:35:05 +01:00
Martin Kroeker
15ff556862 Merge pull request #3536 from xianyi/develop
Update from develop for release 0.3.20
2022-02-20 22:33:59 +01:00
Martin Kroeker
1564b632ad Merge branch 'release-0.3.0' into develop 2022-02-20 22:33:45 +01:00
Martin Kroeker
dec53e0ca2 Update version to 0.3.20 2022-02-20 22:30:50 +01:00
Martin Kroeker
c3f8de7923 Merge pull request #3535 from martin-frbg/0320changes
Update with 0.3.20 changes
2022-02-20 22:21:02 +01:00
Martin Kroeker
c352ac0ae3 Update with 0.3.20 changes 2022-02-20 22:16:04 +01:00
Martin Kroeker
77433af83e Merge pull request #3532 from martin-frbg/issue3528-2
Fix building a shared library on Mac with flang-classic
2022-02-11 11:44:32 +01:00
Martin Kroeker
db7a03dd4c keep flang-classic on MacOS from trying to create an executable instead of a library 2022-02-10 23:04:45 +01:00
Martin Kroeker
0e04710099 filter out libflangmain as well 2022-02-10 23:03:05 +01:00
Martin Kroeker
dc80925c92 Merge pull request #3531 from martin-frbg/issue2973
Add .NOTPARALLEL: to MATGEN Makefile as a workaround for builds on DFS
2022-02-10 14:16:08 +01:00
Martin Kroeker
e2bf3f31a6 Add .NOTPARALLEL: as a workaround for builds on DFS 2022-02-09 22:09:25 +01:00
Martin Kroeker
92d243fee3 Merge pull request #3527 from martin-frbg/issue3490
Treat AVX512-enabled Alder Lake like Cooper Lake/Sapphire Rapids
2022-02-07 08:14:11 +01:00
Martin Kroeker
fa3e9f25e6 Support AVX512-enabled Alder Lake 2022-02-07 00:00:56 +01:00
Martin Kroeker
f7e8f9ec57 Support AVX512-enabled AlderLake 2022-02-07 00:00:15 +01:00
Martin Kroeker
7656aba00e Merge pull request #3493 from martin-frbg/casts+cleanup
WIP casts and cleanups
2022-02-06 23:55:06 +01:00
Martin Kroeker
aec32e5bd4 Update azure-pipelines.yml 2022-02-05 22:39:03 +01:00
Martin Kroeker
3007ca6371 Merge pull request #3524 from martin-frbg/lapack646
Fix input argument check in ?GEQRT2 (from Reference-LAPACK PR 646)
2022-02-03 22:31:23 +01:00
Martin Kroeker
a3eea3e127 Fix input argument check (LAPACK PR 646) 2022-02-03 11:43:17 +01:00
Martin Kroeker
b212577e50 Merge pull request #3521 from martin-frbg/issue3520
Add proper defaults for Sparc IMIN/IMAX
2022-01-28 13:39:36 +01:00
Martin Kroeker
63483ba0ff Merge pull request #3522 from martin-frbg/issue3517
Disable building C/Z SPMV,SPR,SYMV,SYR when NO_LAPACK=1
2022-01-28 10:36:57 +01:00
Martin Kroeker
d2b5fbf80f Exclude some complex (LAPACK) functions when NO_LAPACK is set 2022-01-27 22:02:08 +01:00
Martin Kroeker
7f0b11fbc1 Exclude some complex drivers when NO_LAPACK is set 2022-01-27 22:00:39 +01:00
Martin Kroeker
addc2a7aaa Add proper defaults for IMIN/IMAX 2022-01-27 19:56:32 +01:00
Martin Kroeker
204e021515 Merge pull request #3518 from martin-frbg/elbrus
Add basic support for the (mostly x86_64 compatible) Elbrus E2000 architecture
2022-01-25 20:57:59 +01:00
Martin Kroeker
b0d39349f9 Merge pull request #3516 from mmuetzel/no-fortran
cmake: Check if Fortran compiler is usable before enabling it.
2022-01-25 20:57:38 +01:00
Martin Kroeker
5d24f3d210 Update CONTRIBUTORS.md 2022-01-22 19:09:00 +01:00
Martin Kroeker
66a15e15a8 Update CONTRIBUTORS.md 2022-01-22 19:02:57 +01:00
Martin Kroeker
299d4d70a3 Add default KERNEL file for Elbrus E2K arch 2022-01-22 18:59:36 +01:00
Martin Kroeker
3492bea602 Create Makefile 2022-01-22 18:57:28 +01:00
Martin Kroeker
898cf5faf3 Add Elbrus e2k architecture support 2022-01-22 18:55:10 +01:00
Martin Kroeker
bc93f468ef Add Elbrus E2000 architecture as generic x86_64 compatible 2022-01-22 18:53:38 +01:00
Martin Kroeker
1937b4e435 Add Elbrus e2k architecture detection 2022-01-22 18:27:38 +01:00
Markus Mützel
00f44bfff7 cmake: Check if Fortran compiler is usable before enabling it. 2022-01-21 13:27:17 +01:00
Martin Kroeker
c1c0d5ce1d Merge pull request #3492 from binebrank/arm_sve_zgemm
SVE zgemm&cgemm (and other BLAS 3 complex)
2022-01-18 21:36:33 +01:00
Bine Brank
19d435b1b3 update armv8sve + contributors 2022-01-18 08:28:31 +01:00
Bine Brank
f158d59087 adapt CMake 2022-01-17 22:36:48 +01:00
Martin Kroeker
8ac2c1daf0 Merge pull request #3514 from martin-frbg/issue3513
Fix ?LASWP pivot index calculation for negative increments other than -1
2022-01-17 19:22:18 +01:00
Martin Kroeker
40003f8edb Fix pivot offset calculation for negative incx 2022-01-17 00:11:18 +01:00
Martin Kroeker
57e2a72f40 Fix pivot offset calculation for negative incx 2022-01-17 00:10:21 +01:00
Martin Kroeker
3b6293f5a0 Fix offset calculation for negative incx 2022-01-17 00:09:14 +01:00
Martin Kroeker
afa0cece5c Fix pivot offset calculation for negative incx 2022-01-17 00:08:20 +01:00