Commit Graph

6748 Commits

Author SHA1 Message Date
Martin Kroeker
258a83a642 change line endings from CRLF to LF 2022-11-16 21:57:42 +01:00
Martin Kroeker
ad95ad52f2 Merge pull request #3819 from martin-frbg/lapack703
Fix implicit conversions and unused variables in LAPACKE and TESTING (Reference-LAPACK PR 703)
2022-11-16 20:49:54 +01:00
Martin Kroeker
f16aa1ce7a Merge pull request #3821 from martin-frbg/lapack651
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
2022-11-16 13:59:02 +01:00
Martin Kroeker
2a97ca615f MSVC compatibility fixes 2022-11-16 07:36:40 +01:00
Martin Kroeker
5dec93e93b Complete the C conversion of the xTRSYL3 files 2022-11-15 20:36:58 +01:00
Martin Kroeker
b6d74b7fff Add f2c-converted files for the BLAS3-based Sylvester solver 2022-11-15 16:26:44 +01:00
Martin Kroeker
b2cc310470 Add f2c-converted versions of the new BLAS3-based Sylvester solver 2022-11-15 14:23:46 +01:00
Martin Kroeker
379efbe5af Fix typos 2022-11-15 11:03:12 +01:00
Martin Kroeker
95da5141f0 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-14 22:21:29 +01:00
Martin Kroeker
2592853fc7 Restore OpenBLAS-specific changes 2022-11-14 21:47:37 +01:00
Martin Kroeker
52c2a0397b Restore OpenBLAS modifications to link line 2022-11-14 17:13:08 +01:00
Martin Kroeker
bb652f65a3 Typo fix 2022-11-14 16:35:13 +01:00
Martin Kroeker
fb42a0cf8b Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-14 14:06:50 +01:00
Martin Kroeker
c5c4888ed6 Merge pull request #3818 from martin-frbg/lapack665
Fix leading dimension check of eigen-/Schur vectors in LAPACKE (Reference-LAPACK PR 665)
2022-11-14 07:27:58 +01:00
Martin Kroeker
92411dfecb Merge pull request #3817 from martin-frbg/lapack738742
Add NaN check functions for trapezoidal matrices to LAPACKE (Reference-LAPACK PR 738+742)
2022-11-13 23:49:15 +01:00
Martin Kroeker
13f3bbece1 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:18:09 +01:00
Martin Kroeker
92174725d9 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:16:12 +01:00
Martin Kroeker
6eb707d941 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:10:13 +01:00
Martin Kroeker
7eb2653268 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:07:10 +01:00
Martin Kroeker
4bc918a791 Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:03:31 +01:00
Martin Kroeker
35dac5677a Merge pull request #3816 from martin-frbg/lapack638
Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638)
2022-11-13 20:38:42 +01:00
Martin Kroeker
fdb012ceed Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:37:18 +01:00
Martin Kroeker
c99d27ae45 Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:33:20 +01:00
Martin Kroeker
a485e4f515 Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:30:06 +01:00
Martin Kroeker
0c2aa0bed7 Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:29:08 +01:00
Martin Kroeker
2226a82f2e Fix leading dimension check of eigen-/Schur vectors (Reference-LAPACK PR 665) 2022-11-13 17:50:49 +01:00
Martin Kroeker
645633e321 Fix leading dimension check of eigen-/Schur vectors (Reference-LAPACK PR 665) 2022-11-13 17:48:02 +01:00
Martin Kroeker
ee6643bc6b Merge pull request #3815 from martin-frbg/lapack690
Fix workspace calculation in the left-looking variant of GEQRF (Reference-LAPACK PR690)
2022-11-13 16:26:31 +01:00
Martin Kroeker
90d7451df5 Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 15:10:00 +01:00
Martin Kroeker
eba1112e38 Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 15:03:39 +01:00
Martin Kroeker
23cfe58ee3 Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 14:55:45 +01:00
Martin Kroeker
6dcf737c5d Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 14:51:39 +01:00
Martin Kroeker
3e2d52c502 Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638) 2022-11-13 13:00:52 +01:00
Martin Kroeker
cb48c29b6f Fix workspace calculation (Reference-LAPACK PR690) 2022-11-13 12:49:59 +01:00
Martin Kroeker
8c99d5d1b6 Merge pull request #3796 from martin-frbg/gemmt
Add a trivial GEMMT implementation based on a looped GEMV
2022-11-12 19:06:05 +01:00
Martin Kroeker
b53b0f6bb6 Merge pull request #3802 from martin-frbg/relafix
Fix cmake compilation of ReLAPACK and expose its INCLUDE_ALL option
2022-11-12 15:11:31 +01:00
Martin Kroeker
9a31faf420 Merge pull request #3811 from martin-frbg/issue3805
Improve gcc arch option selecting for Neoverse cpus
2022-11-10 10:57:33 +01:00
Martin Kroeker
e326ef9f0f Merge pull request #3812 from bartoldeman/cscal-zscal-skylakex
Add [cz]scal microkernels for SKYLAKEX
2022-11-10 08:00:27 +01:00
Martin Kroeker
827a9c6079 Merge pull request #3814 from martin-frbg/traviswait-3
Travis Ci: Increase the wait time for ppc jobs again
2022-11-10 08:00:02 +01:00
Martin Kroeker
d141cf341f Increase the wait time for ppc jobs again 2022-11-09 20:31:30 +01:00
Martin Kroeker
aad79ab516 Merge pull request #3813 from martin-frbg/azuredynosx
AzureCi: Limit cpu models in OSX_dynarch_cmake to keep it from running out of time
2022-11-09 20:29:17 +01:00
Martin Kroeker
09dd90ca09 Limit cpu models in OSX_dynarch_cmake 2022-11-09 15:35:57 +01:00
Martin Kroeker
f14435cb4b Merge pull request #3810 from martin-frbg/fix3800
Add fallbacks to RaptorLake entry from PR3800
2022-11-09 15:28:12 +01:00
Bart Oldeman
6c1043eb41 Add [cz]scal microkernels for SKYLAKEX
These are as similar to dscal_microk_skylakex-2.c as possible
for consistency.

Note that before this change SKYLAKEX+ uses generic C functions for
cscal/zscal via commit 2271c350 from #2610 (which is masked by
commit 086d87a30). However now #3799 disables FMAs (in turn enabled
by `-march=skylake-avx512`) in the plain C code which fixes excessive
LAPACK test failures more nicely.
2022-11-09 08:57:03 -05:00
Martin Kroeker
be546ec1ad Add gcc options for Neoverse cpus 2022-11-09 11:00:41 +01:00
Martin Kroeker
c957ad684e Bump gcc requirement for NeoverseN2 and V1 to 10.4 2022-11-09 10:46:43 +01:00
Martin Kroeker
1865b15240 Add fallbacks to RaptorLake entry 2022-11-09 10:31:30 +01:00
Martin Kroeker
e6204d254f Update CMakeLists.txt 2022-11-08 16:21:11 +01:00
Martin Kroeker
2e64722681 Update Makefile.rule 2022-11-08 16:20:17 +01:00
Martin Kroeker
aa2a2d9c01 Conditionally compile files that may get replaced by ReLAPACK 2022-11-08 12:04:46 +01:00