Commit Graph

7452 Commits

Author SHA1 Message Date
Martin Kroeker 36fcb52094
Fix logic - we want real OR imaginary part of X to be nonzero here 2023-04-01 00:02:54 +02:00
Guillaume Horel 397108fba2 serialize shared prerequisites 2023-03-31 09:25:51 -04:00
Guillaume Horel 281e834566 do not pass -j flag to the MAKE variable 2023-03-31 09:25:51 -04:00
Martin Kroeker d708951375
Merge pull request #3980 from martin-frbg/fix3941-2
Split and improve test criteria in LU computation (?GETF2)
2023-03-30 06:56:05 +02:00
Martin Kroeker 6c431239da
Split test condition in LU computation - non-denormal for computation, exact zero for reporting singularity 2023-03-29 22:14:21 +02:00
Martin Kroeker 23f2c4ca5b
Merge pull request #3978 from martin-frbg/fix3941
fix division-by-zero guard in zgetf2
2023-03-29 16:22:27 +02:00
Martin Kroeker 12aabb9f9b
fix conditional 2023-03-29 09:44:33 +02:00
Martin Kroeker fd0614cbc0
Merge pull request #3975 from martin-frbg/issue3974
Fix build failures with NO_LAPACK
2023-03-28 22:57:27 +02:00
Martin Kroeker 912d713b52
redo lost edit 2023-03-28 18:31:04 +02:00
Martin Kroeker dc15c18efc
Fix build failures seen with the NO_LAPACK option - cspr/csymv/csyr belong on the LAPACK list 2023-03-28 16:33:09 +02:00
Martin Kroeker 5d9d382e36
Merge pull request #3970 from linouxis9/develop
Improve Intel Raptor Lake detection
2023-03-28 16:22:27 +02:00
Martin Kroeker 867eee60af
Merge pull request #3972 from h-vetinari/ifndef
remove unqualified ifndef's for NO_LAPACK(E)
2023-03-28 16:17:49 +02:00
H. Vetinari f2659516ef remove unqualified ifdef's for NO_LAPACK(E) 2023-03-28 19:01:31 +11:00
linouxis9 280b6d57d1 Add more Intel Raptor Lake CPUIDs 2023-03-28 00:55:23 +02:00
Martin Kroeker 13add99540
Update Changelog for 0.3.22 2023-03-27 00:13:07 +02:00
Martin Kroeker c05da5960d
Update Changelog for 0.3.22 (#3964) 2023-03-27 00:11:05 +02:00
Martin Kroeker 1c2a60e6cf
Update version to 0.3.22.dev 2023-03-27 00:00:04 +02:00
Martin Kroeker 3c9654219f
Update version to 0.3.22.dev 2023-03-26 23:37:13 +02:00
Martin Kroeker cf97c01244
Merge pull request #3963 from xianyi/release-0.3.0
mergeback from release branch
2023-03-26 23:34:58 +02:00
Martin Kroeker e46971b9d5
Merge branch 'develop' into release-0.3.0 2023-03-26 23:34:17 +02:00
Martin Kroeker 5736dbae16
Merge pull request #3961 from martin-frbg/lapack807
Replace the conditionally defined index INDIBL with a constant (Reference-LAPACK PR807)
2023-03-26 23:17:36 +02:00
Martin Kroeker 09ace3cf23
Merge pull request #3846 from lilh9598/sbgemm_opt
Improve the performance of sbgemm_tcopy on neoversen2
2023-03-26 19:04:57 +02:00
Martin Kroeker c48bbe93d4
Replace the conditionally defined index variable INDIBL with a constant 1 2023-03-26 17:33:21 +02:00
Martin Kroeker 2d39e715e2
Merge pull request #3960 from martin-frbg/symmsyrk_sp
Add multithreading threshold for SYMM and rework the one for SYRK
2023-03-26 17:02:39 +02:00
Martin Kroeker f2d6b1c70e
Add multithreading threshold 2023-03-26 00:25:28 +01:00
Martin Kroeker a495ffc554
Rework multithreading threshold 2023-03-26 00:23:57 +01:00
Martin Kroeker 1659750861
Add ARMV5 to CMAKE prebuild.cmake for cross-compiles (#3959)
* Add ARMV5
2023-03-26 00:22:06 +01:00
Martin Kroeker 7f56c31e21
Merge pull request #3958 from tttapa/develop
Add ARMv6 support to cmake/prebuild.cmake
2023-03-25 19:34:57 +01:00
Martin Kroeker cf80350b89
Merge pull request #3957 from martin-frbg/issue3954
Fix detection of Cray CCE after recent detection changes for gfortran on Cray
2023-03-25 18:33:58 +01:00
Pieter Pas a79ae116e2 Add ARMV6 to cmake/prebuild.cmake,
based on cpuid_arm.c and param.h
2023-03-25 16:30:42 +01:00
Martin Kroeker 280c8469d4
Fix detection of Cray CCE after detection changes for gfortran on Cray
CCE15 no longer produces a .ident line, so  look for HPE name in a note section as well
2023-03-25 14:18:21 +01:00
Martin Kroeker c68afeb567
Merge pull request #3956 from martin-frbg/issue3955
Update Reference-LAPACK README to 3.11.0
2023-03-24 23:06:25 +01:00
Martin Kroeker c2b01c6f81
Update to 3.11.0 2023-03-24 22:54:43 +01:00
Martin Kroeker 02920be53a
Merge pull request #3953 from martin-frbg/issue3801
Do not use multithreading for small SYMV/SYR2K workloads
2023-03-24 13:51:37 +01:00
Martin Kroeker 244147495a
Do not use multithreading for small workloads 2023-03-23 23:13:02 +01:00
Martin Kroeker 2885e4d898
Merge pull request #3951 from martin-frbg/issue3940
Do not change the global OMP_NUM_THREADS
2023-03-22 15:14:00 +01:00
Martin Kroeker e09e494f25
Merge pull request #3950 from martin-frbg/issue3856-syr
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
2023-03-22 10:42:57 +01:00
Martin Kroeker dac2d7aef0
Merge pull request #3952 from martin-frbg/cleanup3936
Avoid unused variable warning in non-OpenMP builds
2023-03-22 09:00:14 +01:00
Martin Kroeker 743ae5ab18
move declaration of openmp_nthreads under the related ifdef 2023-03-21 21:07:37 +01:00
Martin Kroeker 579bc86671
remove call to omp_set_num_threads 2023-03-21 20:58:56 +01:00
Martin Kroeker ab32f832a8
fix stray blank on continuation line 2023-03-21 08:29:05 +01:00
Martin Kroeker e359787e28
restore C/Z SPMV, SPR, SYR,SYMV 2023-03-21 07:43:03 +01:00
Martin Kroeker 6ca6eb7613
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations 2023-03-20 23:04:12 +01:00
Martin Kroeker 2b77f28810
Merge pull request #3949 from martin-frbg/lapack750
Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750)
2023-03-20 22:48:06 +01:00
Martin Kroeker 164eafd61d
Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750) 2023-03-20 19:51:07 +01:00
Martin Kroeker 871b730dc5
Create SECURITY.md 2023-03-20 15:12:35 +01:00
Martin Kroeker 7719dbecde
Merge pull request #3946 from martin-frbg/lapack682
Rewrite ?LAQR5 and S/DHGEQZ , add tests for TRECV3 (Reference-LAPACK PR 682)
2023-03-20 13:48:57 +01:00
Martin Kroeker 147e2fbf87
Add numerical tests for TRECV3 (Reference-LAPACK 682) 2023-03-20 10:04:05 +01:00
Martin Kroeker 2a83ec1f79
Rewrite to use FMA with Householder reflectors 2023-03-20 10:00:42 +01:00
Martin Kroeker e1c3c34178
Standardize coding style (Reference-LAPACK 682) 2023-03-20 09:59:03 +01:00