Commit Graph

7265 Commits

Author SHA1 Message Date
Martin Kroeker ada9e442eb
Add Apple M1 build using gcc,gmake and OpenMP 2023-07-17 23:13:56 +02:00
Martin Kroeker 81228fc586
Merge pull request #4147 from martin-frbg/aldern
Support Alder Lake N (family 6 exmodel 11 model 14) as Haswell
2023-07-17 09:11:23 +02:00
Martin Kroeker 8da6aca2ec
Support Alder Lake N (fam 6 exmodel 11 model 14) as Haswell 2023-07-16 22:15:15 +02:00
Martin Kroeker b61e64da6f
Merge pull request #4142 from exyntech/armv8-as-arm64
Fix armv8 detection in system_check.cmake
2023-07-15 23:15:49 +02:00
Martin Kroeker f82a197143
Merge pull request #4137 from felixonmars/patch-1
Fix riscv64 detection in system_check.cmake
2023-07-15 19:41:06 +02:00
Martin Kroeker 0a637cc403
Fix workspace query corner cases to always return at least 1 (Reference-LAPACK PR 883) (#4146)
* Fix workspace query corner cases to always return at least 1
2023-07-15 16:37:42 +02:00
Martin Kroeker 4c43d1eeba
Fix C prototypes and LAPACKE headers for ?GEDMD/?GEDMDQ (#4134)
* Fix prototypes for ?GEDMD/?GEDMDQ and their LAPACKE interfaces
2023-07-15 07:47:19 +02:00
Martin Kroeker 49077e7bde
Merge pull request #4145 from martin-frbg/issue4144
Restore zero-initialization of variables in generic ztrsm_utcopy
2023-07-14 12:44:05 +02:00
Martin Kroeker 3d31191b0f
Work around Clang failing to disambiguate SVE intrinsics and add AppleClang crossbuild to MacOS/arm64 DYNAMIC_ARCH in AzureCI (#4140)
* Add AppleClang crossbuild to MacOS/arm64 DYNAMIC_ARCH

* add casts to disambiguate svwhilelt for clang
2023-07-14 11:06:48 +02:00
Martin Kroeker cfa0a80664
Restore initialization of data variables 2023-07-13 23:23:12 +02:00
Martin Kroeker 9567305e4c
Restore initialization of data01,data02 2023-07-13 23:21:18 +02:00
Martin Kroeker affeef0b9c
Fix gmake build not always picking the right ARM64 arch options for clang (#4136)
* Fix gcc version checks erroneously excluding clang

* Avoid some mtune names not supported by (Apple)Clang
2023-07-13 08:38:03 +02:00
Andy Mroczkowski 45b2cd2fb2 treat armv8 CMAKE_SYSTEM_PROCESSOR as arm64
The cmake scripts incorrectly treated armv8 as 32-bit arm, causing
compilation issues. This just adds 'armv8' to the arm64 condition check.
2023-07-12 09:37:45 -04:00
Martin Kroeker 494313e75e
Merge pull request #4138 from martin-frbg/fix4126
Add converted C versions of C/ZRSCL to fix build errors introduced by PR4126
2023-07-11 20:41:02 +02:00
Martin Kroeker afef854863
Add C versions of C/ZRSCL 2023-07-11 17:08:27 +02:00
Martin Kroeker 35dedb68ce
Add C versions of C/ZRSCL 2023-07-11 17:07:30 +02:00
Felix Yan a721fccfdc
Fix riscv64 detection in system_check.cmake 2023-07-11 16:34:20 +03:00
Martin Kroeker 2edebc5fb9
Merge pull request #4133 from martin-frbg/issue4132
Fix info code returned for invalid ldb by IMATCOPY
2023-07-10 01:50:38 +02:00
Martin Kroeker bcebe9b4c9
Merge pull request #4131 from martin-frbg/lapack878
Fix computation of UPLO in LAPACKE_?larfb (Reference-LAPACK PR 878)
2023-07-10 01:50:16 +02:00
Martin Kroeker 26fd4b9c8c
Merge pull request #4129 from martin-frbg/lapack876
Fix segfault in ?GELSS when NRHS is zero (Reference-LAPACK PR 876)
2023-07-10 01:49:55 +02:00
Martin Kroeker 22ad23abb1
Merge pull request #4126 from martin-frbg/lapack839
Add C/ZRSCL for reciprocal scaling of a complex vector (Reference-LAPACK PR 839)
2023-07-10 01:49:33 +02:00
Martin Kroeker 351645b8af
Merge pull request #4123 from martin-frbg/lapack867
Correct order of eigenvals/vecs for 2x2 matrices in ?STEMR (Reference-LAPACK PR 867)
2023-07-10 01:48:18 +02:00
Martin Kroeker f5413447aa
Merge pull request #4122 from martin-frbg/issue4121
Fix CMAKE builds of SVE-capable targets in arm64 DYNAMIC_ARCH
2023-07-09 22:57:44 +02:00
Martin Kroeker 5dd1d9cacd
Merge pull request #4120 from martin-frbg/jenkinsbadge
Add status badges for OSUOSL's POWERCI and IBMZ-CI services to README.MD
2023-07-09 22:57:11 +02:00
Martin Kroeker 15dfb2f2cf
Merge pull request #4118 from XiWeiGu/develop
LoongArch64: Add WhereAmI()
2023-07-09 22:56:47 +02:00
Martin Kroeker 7e93ab1b9e
Fix info code returned for invalid ldb 2023-07-09 17:00:25 +02:00
Martin Kroeker 63ec2ad395
Fix computation of UPLO in LAPACKE_?larfb (Reference-LAPACK PR 878) 2023-07-08 10:00:05 +02:00
Martin Kroeker bed3a6a304
Fix segfault when NRHS is zero (Reference-LAPACK PR 876) 2023-07-07 10:13:41 +02:00
Martin Kroeker e3277e134c
Add CRSCL/ZRSCL (Reference-LAPACK PR 839) 2023-07-05 14:22:59 +02:00
Martin Kroeker 1d399ad0d6
Add CRSCL/ZRSCL (Reference-LAPACK PR839) 2023-07-05 14:17:26 +02:00
Martin Kroeker 069353bd44
Add reciprocal scaling of a complex vector and use it in C/ZGETF2 (Reference-LAPACK PR839) 2023-07-05 14:13:58 +02:00
Martin Kroeker 1363a7c4f1
Correct the order of eigenvalues/vector for 2x2 matrices (Reference-LAPACK PR 867) 2023-07-05 10:10:13 +02:00
Martin Kroeker 2f9f70584e
Fix target list syntax for ARM64 DYNAMIC_ARCH 2023-07-05 09:48:54 +02:00
Martin Kroeker ac698cedad
Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds 2023-07-05 09:47:49 +02:00
Martin Kroeker a32af56761
Add status badges for OSUOSL's POWERCI and IBMZ-CI services 2023-07-02 21:50:15 +02:00
Martin Kroeker 4d0b7fbec0
Adjust M1 crossbuilds after image autoupgrade and add an M1-to-Android crossbuild on Cirrus CI (#4117)
* Fix Apple crossbuilds after Cirrus updated the image; add a crossbuild to Android/ARMV7
2023-07-02 12:38:57 +02:00
gxw 6deb52812d LoongArch64: Add WhereAmI() 2023-06-30 14:11:01 +08:00
Martin Kroeker 2183dbcfe2
Merge pull request #4113 from xianyi/revert-4103-windows_perf
Revert "Improve Windows threading performance scaling"
2023-06-30 00:00:25 +02:00
Martin Kroeker bb862b82d5
Fix integer overflow in multithreading threshold calculation for SYMM/SYRK (#4116)
* Fix potential integer overflow
2023-06-29 23:59:25 +02:00
Martin Kroeker cbb5deb2cc
Merge pull request #4114 from xianyi/jenkins
Add Jenkins configuration files for OSUOSL powerci and ibmz-ci
2023-06-29 08:31:06 +02:00
Martin Kroeker f7b916076b
Add Jenkins configuration files for OSUOSL powerci and ibmz-ci 2023-06-28 21:30:59 +02:00
Xianyi Zhang e14a025bb1 Temporily walk around zaxpy vector kernel bug. 2023-06-28 11:17:38 +00:00
Martin Kroeker 616fdea82a
Revert "Improve Windows threading performance scaling" 2023-06-28 09:45:17 +02:00
Martin Kroeker b4f233e881
Merge pull request #4108 from martin-frbg/c910-iamax
Fix RISCV-C910V IDAMAX
2023-06-27 18:15:18 +02:00
Martin Kroeker ceaee7dc64
remove the limitation to -O1 again 2023-06-27 16:13:23 +02:00
Martin Kroeker 772b0cc715
Fix early bailout 2023-06-27 16:12:27 +02:00
Martin Kroeker d6be5036d7
Fix IDAMAX 2023-06-26 21:19:33 +02:00
Martin Kroeker dc2439108d
Merge pull request #4107 from martin-frbg/lapack860
Remove OMP-private variable outside OpenMP context (Reference-LAPACK PR 860)
2023-06-26 21:18:19 +02:00
Martin Kroeker 3688c42628
Merge pull request #4106 from martin-frbg/lapack852
Remove warnings and rename variable (Reference-LAPACK PR 852)
2023-06-26 18:09:28 +02:00
Martin Kroeker 49202228d7
Remove OMP-private variable outside OMP context (LAPACK PR860) 2023-06-26 16:29:04 +02:00