Commit Graph

7010 Commits

Author SHA1 Message Date
Martin Kroeker
516f22b8ca Update version to 0.3.23.dev 2023-04-01 22:25:55 +02:00
Martin Kroeker
3e8f51e7cf Update version to 0.3.23.dev 2023-04-01 22:25:07 +02:00
Martin Kroeker
f9a701b6dd Merge pull request #3988 from xianyi/release-0.3.0
Merge back from release branch into develop to copy tag
2023-04-01 22:24:26 +02:00
Martin Kroeker
394a9fbafe Increment version to 0.3.23 v0.3.23 2023-04-01 22:18:01 +02:00
Martin Kroeker
8f32384633 Increment version to 0.3.23 2023-04-01 22:17:27 +02:00
Martin Kroeker
af3606d9fb Merge pull request #3987 from xianyi/develop
Merge from develop branch for 0.3.23
2023-04-01 22:16:24 +02:00
Martin Kroeker
cd2e80ca2e Merge branch 'release-0.3.0' into develop 2023-04-01 22:15:52 +02:00
Martin Kroeker
e2614eb6ce Merge pull request #3986 from martin-frbg/changelog0323
Update with 0.3.23 changes
2023-04-01 22:08:43 +02:00
Martin Kroeker
1f70481384 Update with 0.3.23 changes 2023-04-01 20:33:31 +02:00
Martin Kroeker
eb0793bfd0 Merge pull request #3984 from martin-frbg/develop
Fix logic bug in single-threaded C/Z SPR
2023-04-01 11:35:52 +02:00
Martin Kroeker
36fcb52094 Fix logic - we want real OR imaginary part of X to be nonzero here 2023-04-01 00:02:54 +02:00
Martin Kroeker
d708951375 Merge pull request #3980 from martin-frbg/fix3941-2
Split and improve test criteria in LU computation (?GETF2)
2023-03-30 06:56:05 +02:00
Martin Kroeker
6c431239da Split test condition in LU computation - non-denormal for computation, exact zero for reporting singularity 2023-03-29 22:14:21 +02:00
Martin Kroeker
23f2c4ca5b Merge pull request #3978 from martin-frbg/fix3941
fix division-by-zero guard in zgetf2
2023-03-29 16:22:27 +02:00
Martin Kroeker
12aabb9f9b fix conditional 2023-03-29 09:44:33 +02:00
Martin Kroeker
fd0614cbc0 Merge pull request #3975 from martin-frbg/issue3974
Fix build failures with NO_LAPACK
2023-03-28 22:57:27 +02:00
Martin Kroeker
912d713b52 redo lost edit 2023-03-28 18:31:04 +02:00
Martin Kroeker
dc15c18efc Fix build failures seen with the NO_LAPACK option - cspr/csymv/csyr belong on the LAPACK list 2023-03-28 16:33:09 +02:00
Martin Kroeker
5d9d382e36 Merge pull request #3970 from linouxis9/develop
Improve Intel Raptor Lake detection
2023-03-28 16:22:27 +02:00
Martin Kroeker
867eee60af Merge pull request #3972 from h-vetinari/ifndef
remove unqualified ifndef's for NO_LAPACK(E)
2023-03-28 16:17:49 +02:00
H. Vetinari
f2659516ef remove unqualified ifdef's for NO_LAPACK(E) 2023-03-28 19:01:31 +11:00
linouxis9
280b6d57d1 Add more Intel Raptor Lake CPUIDs 2023-03-28 00:55:23 +02:00
Martin Kroeker
13add99540 Update Changelog for 0.3.22 2023-03-27 00:13:07 +02:00
Martin Kroeker
c05da5960d Update Changelog for 0.3.22 (#3964) 2023-03-27 00:11:05 +02:00
Martin Kroeker
1c2a60e6cf Update version to 0.3.22.dev 2023-03-27 00:00:04 +02:00
Martin Kroeker
3c9654219f Update version to 0.3.22.dev 2023-03-26 23:37:13 +02:00
Martin Kroeker
cf97c01244 Merge pull request #3963 from xianyi/release-0.3.0
mergeback from release branch
2023-03-26 23:34:58 +02:00
Martin Kroeker
e46971b9d5 Merge branch 'develop' into release-0.3.0 v0.3.22 2023-03-26 23:34:17 +02:00
Martin Kroeker
5736dbae16 Merge pull request #3961 from martin-frbg/lapack807
Replace the conditionally defined index INDIBL with a constant (Reference-LAPACK PR807)
2023-03-26 23:17:36 +02:00
Martin Kroeker
09ace3cf23 Merge pull request #3846 from lilh9598/sbgemm_opt
Improve the performance of sbgemm_tcopy on neoversen2
2023-03-26 19:04:57 +02:00
Martin Kroeker
c48bbe93d4 Replace the conditionally defined index variable INDIBL with a constant 1 2023-03-26 17:33:21 +02:00
Martin Kroeker
2d39e715e2 Merge pull request #3960 from martin-frbg/symmsyrk_sp
Add multithreading threshold for SYMM and rework the one for SYRK
2023-03-26 17:02:39 +02:00
Martin Kroeker
f2d6b1c70e Add multithreading threshold 2023-03-26 00:25:28 +01:00
Martin Kroeker
a495ffc554 Rework multithreading threshold 2023-03-26 00:23:57 +01:00
Martin Kroeker
1659750861 Add ARMV5 to CMAKE prebuild.cmake for cross-compiles (#3959)
* Add ARMV5
2023-03-26 00:22:06 +01:00
Martin Kroeker
7f56c31e21 Merge pull request #3958 from tttapa/develop
Add ARMv6 support to cmake/prebuild.cmake
2023-03-25 19:34:57 +01:00
Martin Kroeker
cf80350b89 Merge pull request #3957 from martin-frbg/issue3954
Fix detection of Cray CCE after recent detection changes for gfortran on Cray
2023-03-25 18:33:58 +01:00
Pieter Pas
a79ae116e2 Add ARMV6 to cmake/prebuild.cmake,
based on cpuid_arm.c and param.h
2023-03-25 16:30:42 +01:00
Martin Kroeker
280c8469d4 Fix detection of Cray CCE after detection changes for gfortran on Cray
CCE15 no longer produces a .ident line, so  look for HPE name in a note section as well
2023-03-25 14:18:21 +01:00
Martin Kroeker
c68afeb567 Merge pull request #3956 from martin-frbg/issue3955
Update Reference-LAPACK README to 3.11.0
2023-03-24 23:06:25 +01:00
Martin Kroeker
c2b01c6f81 Update to 3.11.0 2023-03-24 22:54:43 +01:00
Martin Kroeker
02920be53a Merge pull request #3953 from martin-frbg/issue3801
Do not use multithreading for small SYMV/SYR2K workloads
2023-03-24 13:51:37 +01:00
Martin Kroeker
244147495a Do not use multithreading for small workloads 2023-03-23 23:13:02 +01:00
Martin Kroeker
2885e4d898 Merge pull request #3951 from martin-frbg/issue3940
Do not change the global OMP_NUM_THREADS
2023-03-22 15:14:00 +01:00
Martin Kroeker
e09e494f25 Merge pull request #3950 from martin-frbg/issue3856-syr
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
2023-03-22 10:42:57 +01:00
Martin Kroeker
dac2d7aef0 Merge pull request #3952 from martin-frbg/cleanup3936
Avoid unused variable warning in non-OpenMP builds
2023-03-22 09:00:14 +01:00
Martin Kroeker
743ae5ab18 move declaration of openmp_nthreads under the related ifdef 2023-03-21 21:07:37 +01:00
Martin Kroeker
579bc86671 remove call to omp_set_num_threads 2023-03-21 20:58:56 +01:00
Martin Kroeker
ab32f832a8 fix stray blank on continuation line 2023-03-21 08:29:05 +01:00
Martin Kroeker
e359787e28 restore C/Z SPMV, SPR, SYR,SYMV 2023-03-21 07:43:03 +01:00