Commit Graph

7563 Commits

Author SHA1 Message Date
Martin Kroeker 3d38da2bc4
Make vector orthogonalization more reliable (Reference-LAPACK PR 930) 2023-11-12 16:50:52 +01:00
Martin Kroeker d58c88cf42
Merge pull request #4310 from martin-frbg/lapack904
Apply rounding up to workspace calculations done with reals (Reference-LAPACK PR 904)
2023-11-12 16:45:10 +01:00
Martin Kroeker feeb10435b
Merge pull request #4309 from martin-frbg/lapack926
Change ?GECON to return INFO=1 if RCOND is NaN (Reference-LAPACK PR 926)
2023-11-12 15:28:16 +01:00
Martin Kroeker 2ce67e2ada
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 14:42:52 +01:00
Martin Kroeker f5664740cd
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 14:29:04 +01:00
Martin Kroeker 71fbdd908d
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 14:10:16 +01:00
Martin Kroeker c9378badd9
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 13:56:06 +01:00
Martin Kroeker 225036fd92
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904) 2023-11-12 13:43:22 +01:00
Martin Kroeker eef4d15369
Merge pull request #4308 from martin-frbg/issue4277-2
Add workaround for omp_get_max_threads hanging on FreeBSD/LLVM14
2023-11-12 13:08:43 +01:00
Martin Kroeker 58427ff74d
Deprecate ?GELQS and ?GEQRS from TESTING/LIN (Reference-LAPACK PR 900) (#4307)
* Move ?GELQS and ?GEQRS from TESTING/LIN to DEPRECATED (Reference-LAPACK PR 900)

* Add f2c-converted versions of ?GELQS and ?GEQRS
2023-11-12 10:54:39 +01:00
Martin Kroeker b6144f70ff
Change ?GECON to return INFO=1 if RCOND is NaN (Reference-LAPACK PR 926) 2023-11-11 23:41:18 +01:00
Martin Kroeker 00ef1bb58a
Merge pull request #4306 from angsch/develop
Improve matcopy interface
2023-11-11 23:19:10 +01:00
Martin Kroeker 9324520d0e
typo fix 2023-11-11 23:14:58 +01:00
Martin Kroeker ff6437f2d7
Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14 2023-11-11 21:30:32 +01:00
Martin Kroeker 9c3c1cfbd6
Merge pull request #4304 from martin-frbg/issue4277
Move clang/gfortran OpenMP dependency rewriting out of f_check
2023-11-11 20:58:21 +01:00
Martin Kroeker cad10a3caa
Merge pull request #4303 from martin-frbg/ryzen-avx512
Enable autodetection of Zen 3/4 cpus as their AVX512 Intel counterparts
2023-11-11 18:36:24 +01:00
Martin Kroeker 95ed8125fa
Merge pull request #4290 from martin-frbg/issue4275-2
Put more build information into Makefile.conf_last
2023-11-11 15:28:57 +01:00
Angelika Schwarz 5ffbe646e1 Improve matcopy interface
* rows = 0 or cols = 0 is now a legal input and
  takes quick return path
* Follow BLAS/LAPACK convention that the leading
  dimensions must be at least 1.
2023-11-11 11:16:10 +01:00
Martin Kroeker 1a308a0066
Move OpenMP dependency handling for clang/gfortran combo 2023-11-10 15:27:46 +01:00
Martin Kroeker 3ad27007fc
rebase 2023-11-10 15:19:11 +01:00
Martin Kroeker 778e3b746a
Enable autodetection of current AMD cpus as their AVX512 Intel counterparts 2023-11-10 11:55:29 +01:00
Martin Kroeker 273f4e8794
Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
Fix FCOMMON_OPT for power.  Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner 3ada8fbf8a Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles 2023-11-08 11:17:45 -06:00
Martin Kroeker 0a609cc689
Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner 4eecccd49b Fix __builtin_cpu_is for AIX. 2023-11-08 07:12:21 -06:00
Chip-Kerchner 5e31c57083 Only define __builtin_cpu_is and __builtin_cpu_supports if not present. 2023-11-07 20:58:34 -06:00
Chip Kerchner 206e76187e Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX. 2023-11-07 18:08:57 -06:00
Martin Kroeker 45da7e6c91
Merge pull request #4288 from martin-frbg/fix3915
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker c14b1ebd30
Merge pull request #4295 from martin-frbg/lapack928
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker fea1d4f66c
Merge pull request #4294 from martin-frbg/lapack909
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker 6f11992dbb
Merge pull request #4293 from martin-frbg/lapack927
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker 176cc6348e
Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928) 2023-11-06 21:55:19 +01:00
Martin Kroeker f6ec777701
Fix accumulation (Reference-LAPACK PR 909) 2023-11-06 21:40:50 +01:00
Martin Kroeker cf8295da5c
Fix potential overflow in the calculation of MAXIT 2023-11-06 21:22:26 +01:00
Martin Kroeker ac7efc61fd
Put more build information into Makefile.conf_last 2023-11-06 00:03:33 +01:00
Martin Kroeker cd8eb83bae
Fix allocations and compiler warnings in ZROTG (#4289)
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker 3a86fde5a7
Delete misplaced file from PR 3915 after applying in the correct place 2023-11-04 23:52:46 +01:00
Martin Kroeker 04bc801999
(Re)apply fixes for supporting only a subset of precision types from PR 3915 2023-11-04 23:48:59 +01:00
Martin Kroeker 25096e5afe
Merge pull request #4287 from martin-frbg/issue4282
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker 9019bc4945
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well 2023-11-04 22:10:06 +01:00
Martin Kroeker 4c6ee3f18e
Merge pull request #4285 from martin-frbg/cortexfix
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker 3bfa4d4dcc
Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE 2023-11-03 14:55:31 +01:00
Martin Kroeker 59444415ac
Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner 7dcb2d67f2 Have POWER7 return arch=POWER6. 2023-11-01 15:23:28 -05:00
Chip-Kerchner c8882bd9d8 Remove POWER7 from cpu list. 2023-11-01 14:53:55 -05:00
Chip-Kerchner d8e6e2b04d Merge branch 'develop' into dynamicDispatchAIXandClang 2023-11-01 14:22:06 -05:00
Martin Kroeker 0de786cfa6
Merge pull request #4278 from martin-frbg/issue4277
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker 9f7c35a4a8
Merge pull request #4279 from martin-frbg/issue4269
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker 728788f667
typo fix 2023-10-31 11:08:22 +01:00
Martin Kroeker d003ad630b
Increase the default GEMM buffer size on modern ARM server cpus 2023-10-31 10:26:38 +01:00