Commit Graph

7252 Commits

Author SHA1 Message Date
Martin Kroeker
affeef0b9c Fix gmake build not always picking the right ARM64 arch options for clang (#4136)
* Fix gcc version checks erroneously excluding clang

* Avoid some mtune names not supported by (Apple)Clang
2023-07-13 08:38:03 +02:00
Martin Kroeker
494313e75e Merge pull request #4138 from martin-frbg/fix4126
Add converted C versions of C/ZRSCL to fix build errors introduced by PR4126
2023-07-11 20:41:02 +02:00
Martin Kroeker
afef854863 Add C versions of C/ZRSCL 2023-07-11 17:08:27 +02:00
Martin Kroeker
35dedb68ce Add C versions of C/ZRSCL 2023-07-11 17:07:30 +02:00
Martin Kroeker
2edebc5fb9 Merge pull request #4133 from martin-frbg/issue4132
Fix info code returned for invalid ldb by IMATCOPY
2023-07-10 01:50:38 +02:00
Martin Kroeker
bcebe9b4c9 Merge pull request #4131 from martin-frbg/lapack878
Fix computation of UPLO in LAPACKE_?larfb (Reference-LAPACK PR 878)
2023-07-10 01:50:16 +02:00
Martin Kroeker
26fd4b9c8c Merge pull request #4129 from martin-frbg/lapack876
Fix segfault in ?GELSS when NRHS is zero (Reference-LAPACK PR 876)
2023-07-10 01:49:55 +02:00
Martin Kroeker
22ad23abb1 Merge pull request #4126 from martin-frbg/lapack839
Add C/ZRSCL for reciprocal scaling of a complex vector (Reference-LAPACK PR 839)
2023-07-10 01:49:33 +02:00
Martin Kroeker
351645b8af Merge pull request #4123 from martin-frbg/lapack867
Correct order of eigenvals/vecs for 2x2 matrices in ?STEMR (Reference-LAPACK PR 867)
2023-07-10 01:48:18 +02:00
Martin Kroeker
f5413447aa Merge pull request #4122 from martin-frbg/issue4121
Fix CMAKE builds of SVE-capable targets in arm64 DYNAMIC_ARCH
2023-07-09 22:57:44 +02:00
Martin Kroeker
5dd1d9cacd Merge pull request #4120 from martin-frbg/jenkinsbadge
Add status badges for OSUOSL's POWERCI and IBMZ-CI services to README.MD
2023-07-09 22:57:11 +02:00
Martin Kroeker
15dfb2f2cf Merge pull request #4118 from XiWeiGu/develop
LoongArch64: Add WhereAmI()
2023-07-09 22:56:47 +02:00
Martin Kroeker
7e93ab1b9e Fix info code returned for invalid ldb 2023-07-09 17:00:25 +02:00
Martin Kroeker
63ec2ad395 Fix computation of UPLO in LAPACKE_?larfb (Reference-LAPACK PR 878) 2023-07-08 10:00:05 +02:00
Martin Kroeker
bed3a6a304 Fix segfault when NRHS is zero (Reference-LAPACK PR 876) 2023-07-07 10:13:41 +02:00
Martin Kroeker
e3277e134c Add CRSCL/ZRSCL (Reference-LAPACK PR 839) 2023-07-05 14:22:59 +02:00
Martin Kroeker
1d399ad0d6 Add CRSCL/ZRSCL (Reference-LAPACK PR839) 2023-07-05 14:17:26 +02:00
Martin Kroeker
069353bd44 Add reciprocal scaling of a complex vector and use it in C/ZGETF2 (Reference-LAPACK PR839) 2023-07-05 14:13:58 +02:00
Martin Kroeker
1363a7c4f1 Correct the order of eigenvalues/vector for 2x2 matrices (Reference-LAPACK PR 867) 2023-07-05 10:10:13 +02:00
Martin Kroeker
2f9f70584e Fix target list syntax for ARM64 DYNAMIC_ARCH 2023-07-05 09:48:54 +02:00
Martin Kroeker
ac698cedad Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds 2023-07-05 09:47:49 +02:00
Martin Kroeker
a32af56761 Add status badges for OSUOSL's POWERCI and IBMZ-CI services 2023-07-02 21:50:15 +02:00
Martin Kroeker
4d0b7fbec0 Adjust M1 crossbuilds after image autoupgrade and add an M1-to-Android crossbuild on Cirrus CI (#4117)
* Fix Apple crossbuilds after Cirrus updated the image; add a crossbuild to Android/ARMV7
2023-07-02 12:38:57 +02:00
gxw
6deb52812d LoongArch64: Add WhereAmI() 2023-06-30 14:11:01 +08:00
Martin Kroeker
2183dbcfe2 Merge pull request #4113 from xianyi/revert-4103-windows_perf
Revert "Improve Windows threading performance scaling"
2023-06-30 00:00:25 +02:00
Martin Kroeker
bb862b82d5 Fix integer overflow in multithreading threshold calculation for SYMM/SYRK (#4116)
* Fix potential integer overflow
2023-06-29 23:59:25 +02:00
Martin Kroeker
cbb5deb2cc Merge pull request #4114 from xianyi/jenkins
Add Jenkins configuration files for OSUOSL powerci and ibmz-ci
2023-06-29 08:31:06 +02:00
Martin Kroeker
f7b916076b Add Jenkins configuration files for OSUOSL powerci and ibmz-ci 2023-06-28 21:30:59 +02:00
Xianyi Zhang
e14a025bb1 Temporily walk around zaxpy vector kernel bug. 2023-06-28 11:17:38 +00:00
Martin Kroeker
616fdea82a Revert "Improve Windows threading performance scaling" 2023-06-28 09:45:17 +02:00
Martin Kroeker
b4f233e881 Merge pull request #4108 from martin-frbg/c910-iamax
Fix RISCV-C910V IDAMAX
2023-06-27 18:15:18 +02:00
Martin Kroeker
ceaee7dc64 remove the limitation to -O1 again 2023-06-27 16:13:23 +02:00
Martin Kroeker
772b0cc715 Fix early bailout 2023-06-27 16:12:27 +02:00
Martin Kroeker
d6be5036d7 Fix IDAMAX 2023-06-26 21:19:33 +02:00
Martin Kroeker
dc2439108d Merge pull request #4107 from martin-frbg/lapack860
Remove OMP-private variable outside OpenMP context (Reference-LAPACK PR 860)
2023-06-26 21:18:19 +02:00
Martin Kroeker
3688c42628 Merge pull request #4106 from martin-frbg/lapack852
Remove warnings and rename variable (Reference-LAPACK PR 852)
2023-06-26 18:09:28 +02:00
Martin Kroeker
49202228d7 Remove OMP-private variable outside OMP context (LAPACK PR860) 2023-06-26 16:29:04 +02:00
Martin Kroeker
a0e413ef05 Update ssytrd_sb2st.FRemove OMP-private variable outside OMP context (LAPACK PR860) 2023-06-26 16:28:15 +02:00
Martin Kroeker
5b69217808 Update zhetrd_hb2st.FRemove OMP-private variable outside OMP context (LAPACK PR860) 2023-06-26 16:26:21 +02:00
Martin Kroeker
b9fb63c05e Remove OMP-private variable outside OMP context (LAPACK PR860) 2023-06-26 16:25:08 +02:00
Martin Kroeker
329bd3410b Remove legacy warning comments and rename variable LAMDA to LAMBDA (Reference-LAPACK PR 852) 2023-06-26 15:44:10 +02:00
Martin Kroeker
4ecb68554a Merge pull request #4086 from sl1pkn07/fix4083
Install .cmake files in CMAKE_INSTALL_LIBDIR
2023-06-26 14:53:34 +02:00
Martin Kroeker
8488dc9876 Merge pull request #4105 from martin-frbg/c910v-fix
Fix RISCV C910V build (and related CI job hanging) on develop branch
2023-06-26 01:19:39 +02:00
Martin Kroeker
b13787971e Enforce -O1 to work around miscompilation by the vendor gcc 2023-06-25 22:37:56 +02:00
Martin Kroeker
1fe96f8da7 Fix failures to handle increments of zero 2023-06-25 22:36:57 +02:00
Martin Kroeker
f863895ebf Merge pull request #4104 from martin-frbg/lapack855
Fix inline documentation of S/DLASD0 (Reference-LAPACK PR 855)
2023-06-25 20:58:49 +02:00
Martin Kroeker
7acff1be1c Merge pull request #4103 from mseminatore/windows_perf
Improve Windows threading performance scaling
2023-06-25 16:01:12 +02:00
Martin Kroeker
572e482b38 Fix inline documentation of S/DLASD0 (Reference-LAPACK PR 855) 2023-06-25 12:42:05 +02:00
Mark Seminatore
d6991dd230 fix missing #endif 2023-06-24 15:43:32 -07:00
Mark Seminatore
7783a9af02 attempt to fix old mingw gcc issue 2023-06-24 14:35:11 -07:00