Martin Kroeker
c9378badd9
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904)
2023-11-12 13:56:06 +01:00
Martin Kroeker
225036fd92
Apply ROUNDUP_LWORK (Reference-LAPACK PR 904)
2023-11-12 13:43:22 +01:00
Martin Kroeker
eef4d15369
Merge pull request #4308 from martin-frbg/issue4277-2
...
Add workaround for omp_get_max_threads hanging on FreeBSD/LLVM14
2023-11-12 13:08:43 +01:00
Martin Kroeker
58427ff74d
Deprecate ?GELQS and ?GEQRS from TESTING/LIN (Reference-LAPACK PR 900) ( #4307 )
...
* Move ?GELQS and ?GEQRS from TESTING/LIN to DEPRECATED (Reference-LAPACK PR 900)
* Add f2c-converted versions of ?GELQS and ?GEQRS
2023-11-12 10:54:39 +01:00
Martin Kroeker
b6144f70ff
Change ?GECON to return INFO=1 if RCOND is NaN (Reference-LAPACK PR 926)
2023-11-11 23:41:18 +01:00
Martin Kroeker
00ef1bb58a
Merge pull request #4306 from angsch/develop
...
Improve matcopy interface
2023-11-11 23:19:10 +01:00
Martin Kroeker
9324520d0e
typo fix
2023-11-11 23:14:58 +01:00
Martin Kroeker
ff6437f2d7
Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14
2023-11-11 21:30:32 +01:00
Martin Kroeker
9c3c1cfbd6
Merge pull request #4304 from martin-frbg/issue4277
...
Move clang/gfortran OpenMP dependency rewriting out of f_check
2023-11-11 20:58:21 +01:00
Martin Kroeker
cad10a3caa
Merge pull request #4303 from martin-frbg/ryzen-avx512
...
Enable autodetection of Zen 3/4 cpus as their AVX512 Intel counterparts
2023-11-11 18:36:24 +01:00
Martin Kroeker
95ed8125fa
Merge pull request #4290 from martin-frbg/issue4275-2
...
Put more build information into Makefile.conf_last
2023-11-11 15:28:57 +01:00
Angelika Schwarz
5ffbe646e1
Improve matcopy interface
...
* rows = 0 or cols = 0 is now a legal input and
takes quick return path
* Follow BLAS/LAPACK convention that the leading
dimensions must be at least 1.
2023-11-11 11:16:10 +01:00
Ralf Gommers
8613632dc5
Trigger AWS Graviton 3 CI job only for develop and release branches
2023-11-10 16:34:47 +01:00
Ralf Gommers
2418a20f1f
Cancel running CI jobs when new changes are pushed to a PR
...
The `group` expression ensures that the cancel-in-progress
behavior is to only cancel if a new commit is pushed to the PR for
which the job is running, not other PRs.
This is a fairly standard snippet, used also in CI jobs for NumPy
and other projects.
2023-11-10 16:34:47 +01:00
Ralf Gommers
1d4ed20c2f
Add conditions to all CI jobs to only run on main repo by default
...
This is a follow-up to gh-4271. At the moment, when a contributor
pushes the latest `develop` to their own branch to bring their own
fork in sync with `main`, or if they push another branch, this triggers
30 CI jobs to run. Most will complete silently and only burn CPU
time unnecessarily. If there's a failure, this may result in unexpected
failure notifications. And the AWS Graviton3 run won't complete at all
and time out, since the Cirun hook will only work when triggered from
the main repo.
2023-11-10 16:34:47 +01:00
Martin Kroeker
1a308a0066
Move OpenMP dependency handling for clang/gfortran combo
2023-11-10 15:27:46 +01:00
Martin Kroeker
3ad27007fc
rebase
2023-11-10 15:19:11 +01:00
Martin Kroeker
778e3b746a
Enable autodetection of current AMD cpus as their AVX512 Intel counterparts
2023-11-10 11:55:29 +01:00
Martin Kroeker
273f4e8794
Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
...
Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner
3ada8fbf8a
Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles
2023-11-08 11:17:45 -06:00
Martin Kroeker
0a609cc689
Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
...
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner
4eecccd49b
Fix __builtin_cpu_is for AIX.
2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-07 20:58:34 -06:00
Chip Kerchner
206e76187e
Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX.
2023-11-07 18:08:57 -06:00
Martin Kroeker
45da7e6c91
Merge pull request #4288 from martin-frbg/fix3915
...
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker
c14b1ebd30
Merge pull request #4295 from martin-frbg/lapack928
...
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker
fea1d4f66c
Merge pull request #4294 from martin-frbg/lapack909
...
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker
6f11992dbb
Merge pull request #4293 from martin-frbg/lapack927
...
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker
176cc6348e
Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928)
2023-11-06 21:55:19 +01:00
Martin Kroeker
f6ec777701
Fix accumulation (Reference-LAPACK PR 909)
2023-11-06 21:40:50 +01:00
Martin Kroeker
cf8295da5c
Fix potential overflow in the calculation of MAXIT
2023-11-06 21:22:26 +01:00
Martin Kroeker
ac7efc61fd
Put more build information into Makefile.conf_last
2023-11-06 00:03:33 +01:00
Martin Kroeker
cd8eb83bae
Fix allocations and compiler warnings in ZROTG ( #4289 )
...
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker
3a86fde5a7
Delete misplaced file from PR 3915 after applying in the correct place
2023-11-04 23:52:46 +01:00
Martin Kroeker
04bc801999
(Re)apply fixes for supporting only a subset of precision types from PR 3915
2023-11-04 23:48:59 +01:00
Martin Kroeker
25096e5afe
Merge pull request #4287 from martin-frbg/issue4282
...
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker
9019bc4945
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 22:10:06 +01:00
Martin Kroeker
4c6ee3f18e
Merge pull request #4285 from martin-frbg/cortexfix
...
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker
3bfa4d4dcc
Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE
2023-11-03 14:55:31 +01:00
Martin Kroeker
59444415ac
Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
...
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner
7dcb2d67f2
Have POWER7 return arch=POWER6.
2023-11-01 15:23:28 -05:00
Chip-Kerchner
c8882bd9d8
Remove POWER7 from cpu list.
2023-11-01 14:53:55 -05:00
Chip-Kerchner
d8e6e2b04d
Merge branch 'develop' into dynamicDispatchAIXandClang
2023-11-01 14:22:06 -05:00
Martin Kroeker
0de786cfa6
Merge pull request #4278 from martin-frbg/issue4277
...
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker
9f7c35a4a8
Merge pull request #4279 from martin-frbg/issue4269
...
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker
728788f667
typo fix
2023-10-31 11:08:22 +01:00
Martin Kroeker
d003ad630b
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 10:26:38 +01:00
Martin Kroeker
dc1c880782
fix libgfortran path on bsd
2023-10-28 23:14:36 +02:00
Martin Kroeker
289a5f6d9b
work around libgfortran install issue on FreeBSD
2023-10-28 18:44:58 +02:00
Martin Kroeker
1cec1c0fc7
Add FreeBSD clang/gfortran build with OpenMP
2023-10-28 14:43:19 +02:00