Martin Kroeker
eef4d15369
Merge pull request #4308 from martin-frbg/issue4277-2
...
Add workaround for omp_get_max_threads hanging on FreeBSD/LLVM14
2023-11-12 13:08:43 +01:00
Martin Kroeker
58427ff74d
Deprecate ?GELQS and ?GEQRS from TESTING/LIN (Reference-LAPACK PR 900) ( #4307 )
...
* Move ?GELQS and ?GEQRS from TESTING/LIN to DEPRECATED (Reference-LAPACK PR 900)
* Add f2c-converted versions of ?GELQS and ?GEQRS
2023-11-12 10:54:39 +01:00
Martin Kroeker
00ef1bb58a
Merge pull request #4306 from angsch/develop
...
Improve matcopy interface
2023-11-11 23:19:10 +01:00
Martin Kroeker
9324520d0e
typo fix
2023-11-11 23:14:58 +01:00
Martin Kroeker
ff6437f2d7
Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14
2023-11-11 21:30:32 +01:00
Martin Kroeker
9c3c1cfbd6
Merge pull request #4304 from martin-frbg/issue4277
...
Move clang/gfortran OpenMP dependency rewriting out of f_check
2023-11-11 20:58:21 +01:00
Martin Kroeker
cad10a3caa
Merge pull request #4303 from martin-frbg/ryzen-avx512
...
Enable autodetection of Zen 3/4 cpus as their AVX512 Intel counterparts
2023-11-11 18:36:24 +01:00
Martin Kroeker
95ed8125fa
Merge pull request #4290 from martin-frbg/issue4275-2
...
Put more build information into Makefile.conf_last
2023-11-11 15:28:57 +01:00
Angelika Schwarz
5ffbe646e1
Improve matcopy interface
...
* rows = 0 or cols = 0 is now a legal input and
takes quick return path
* Follow BLAS/LAPACK convention that the leading
dimensions must be at least 1.
2023-11-11 11:16:10 +01:00
Martin Kroeker
1a308a0066
Move OpenMP dependency handling for clang/gfortran combo
2023-11-10 15:27:46 +01:00
Martin Kroeker
3ad27007fc
rebase
2023-11-10 15:19:11 +01:00
Martin Kroeker
778e3b746a
Enable autodetection of current AMD cpus as their AVX512 Intel counterparts
2023-11-10 11:55:29 +01:00
Martin Kroeker
273f4e8794
Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
...
Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner
3ada8fbf8a
Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles
2023-11-08 11:17:45 -06:00
Martin Kroeker
0a609cc689
Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
...
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner
4eecccd49b
Fix __builtin_cpu_is for AIX.
2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-07 20:58:34 -06:00
Chip Kerchner
206e76187e
Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX.
2023-11-07 18:08:57 -06:00
Martin Kroeker
45da7e6c91
Merge pull request #4288 from martin-frbg/fix3915
...
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker
c14b1ebd30
Merge pull request #4295 from martin-frbg/lapack928
...
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker
fea1d4f66c
Merge pull request #4294 from martin-frbg/lapack909
...
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker
6f11992dbb
Merge pull request #4293 from martin-frbg/lapack927
...
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker
176cc6348e
Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928)
2023-11-06 21:55:19 +01:00
Martin Kroeker
f6ec777701
Fix accumulation (Reference-LAPACK PR 909)
2023-11-06 21:40:50 +01:00
Martin Kroeker
cf8295da5c
Fix potential overflow in the calculation of MAXIT
2023-11-06 21:22:26 +01:00
Martin Kroeker
ac7efc61fd
Put more build information into Makefile.conf_last
2023-11-06 00:03:33 +01:00
Martin Kroeker
cd8eb83bae
Fix allocations and compiler warnings in ZROTG ( #4289 )
...
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker
3a86fde5a7
Delete misplaced file from PR 3915 after applying in the correct place
2023-11-04 23:52:46 +01:00
Martin Kroeker
04bc801999
(Re)apply fixes for supporting only a subset of precision types from PR 3915
2023-11-04 23:48:59 +01:00
Martin Kroeker
25096e5afe
Merge pull request #4287 from martin-frbg/issue4282
...
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker
9019bc4945
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 22:10:06 +01:00
Martin Kroeker
4c6ee3f18e
Merge pull request #4285 from martin-frbg/cortexfix
...
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker
3bfa4d4dcc
Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE
2023-11-03 14:55:31 +01:00
Martin Kroeker
59444415ac
Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
...
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner
7dcb2d67f2
Have POWER7 return arch=POWER6.
2023-11-01 15:23:28 -05:00
Chip-Kerchner
c8882bd9d8
Remove POWER7 from cpu list.
2023-11-01 14:53:55 -05:00
Chip-Kerchner
d8e6e2b04d
Merge branch 'develop' into dynamicDispatchAIXandClang
2023-11-01 14:22:06 -05:00
Martin Kroeker
0de786cfa6
Merge pull request #4278 from martin-frbg/issue4277
...
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker
9f7c35a4a8
Merge pull request #4279 from martin-frbg/issue4269
...
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker
728788f667
typo fix
2023-10-31 11:08:22 +01:00
Martin Kroeker
d003ad630b
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 10:26:38 +01:00
Martin Kroeker
dc1c880782
fix libgfortran path on bsd
2023-10-28 23:14:36 +02:00
Martin Kroeker
289a5f6d9b
work around libgfortran install issue on FreeBSD
2023-10-28 18:44:58 +02:00
Martin Kroeker
1cec1c0fc7
Add FreeBSD clang/gfortran build with OpenMP
2023-10-28 14:43:19 +02:00
Martin Kroeker
9d425a5fe7
Merge pull request #4276 from martin-frbg/issue4275
...
Clarify make/make install in the README and update the TARGET list there
2023-10-28 14:34:17 +02:00
Martin Kroeker
f5e1f20f4d
Update target list
2023-10-27 17:10:37 +02:00
Martin Kroeker
a7f73c764c
Clarify "make" options and the need to repeat them in the install step
2023-10-27 16:48:47 +02:00
Chip Kerchner
badfb2e60f
Merge branch 'develop' into XLC-AIX
2023-10-26 09:19:31 -05:00
Martin Kroeker
96f8bb1eb9
Merge pull request #4272 from RajalakshmiSR/AIX_AS
...
POWER: AIX: Make use of power10 optimization
2023-10-24 12:08:51 +02:00
Rajalakshmi Srinivasaraghavan
980f702f72
POWER: AIX: Make use of power10 optimization
...
POWER10 optimizations are disabled when using default AIX assembler.
As we have fixed many issues recently, enabling optimization path
for default assembler.
2023-10-19 18:48:19 -05:00