Commit Graph

7549 Commits

Author SHA1 Message Date
Martin Kroeker
ff6437f2d7 Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14 2023-11-11 21:30:32 +01:00
Martin Kroeker
9c3c1cfbd6 Merge pull request #4304 from martin-frbg/issue4277
Move clang/gfortran OpenMP dependency rewriting out of f_check
2023-11-11 20:58:21 +01:00
Martin Kroeker
cad10a3caa Merge pull request #4303 from martin-frbg/ryzen-avx512
Enable autodetection of Zen 3/4 cpus as their AVX512 Intel counterparts
2023-11-11 18:36:24 +01:00
Martin Kroeker
95ed8125fa Merge pull request #4290 from martin-frbg/issue4275-2
Put more build information into Makefile.conf_last
2023-11-11 15:28:57 +01:00
Martin Kroeker
1a308a0066 Move OpenMP dependency handling for clang/gfortran combo 2023-11-10 15:27:46 +01:00
Martin Kroeker
3ad27007fc rebase 2023-11-10 15:19:11 +01:00
Martin Kroeker
778e3b746a Enable autodetection of current AMD cpus as their AVX512 Intel counterparts 2023-11-10 11:55:29 +01:00
Martin Kroeker
273f4e8794 Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
Fix FCOMMON_OPT for power.  Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner
3ada8fbf8a Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles 2023-11-08 11:17:45 -06:00
Martin Kroeker
0a609cc689 Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner
4eecccd49b Fix __builtin_cpu_is for AIX. 2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083 Only define __builtin_cpu_is and __builtin_cpu_supports if not present. 2023-11-07 20:58:34 -06:00
Chip Kerchner
206e76187e Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX. 2023-11-07 18:08:57 -06:00
Martin Kroeker
45da7e6c91 Merge pull request #4288 from martin-frbg/fix3915
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker
c14b1ebd30 Merge pull request #4295 from martin-frbg/lapack928
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker
fea1d4f66c Merge pull request #4294 from martin-frbg/lapack909
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker
6f11992dbb Merge pull request #4293 from martin-frbg/lapack927
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker
176cc6348e Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928) 2023-11-06 21:55:19 +01:00
Martin Kroeker
f6ec777701 Fix accumulation (Reference-LAPACK PR 909) 2023-11-06 21:40:50 +01:00
Martin Kroeker
cf8295da5c Fix potential overflow in the calculation of MAXIT 2023-11-06 21:22:26 +01:00
Martin Kroeker
ac7efc61fd Put more build information into Makefile.conf_last 2023-11-06 00:03:33 +01:00
Martin Kroeker
cd8eb83bae Fix allocations and compiler warnings in ZROTG (#4289)
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker
3a86fde5a7 Delete misplaced file from PR 3915 after applying in the correct place 2023-11-04 23:52:46 +01:00
Martin Kroeker
04bc801999 (Re)apply fixes for supporting only a subset of precision types from PR 3915 2023-11-04 23:48:59 +01:00
Martin Kroeker
25096e5afe Merge pull request #4287 from martin-frbg/issue4282
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker
9019bc4945 Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well 2023-11-04 22:10:06 +01:00
Martin Kroeker
4c6ee3f18e Merge pull request #4285 from martin-frbg/cortexfix
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker
3bfa4d4dcc Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE 2023-11-03 14:55:31 +01:00
Martin Kroeker
59444415ac Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner
7dcb2d67f2 Have POWER7 return arch=POWER6. 2023-11-01 15:23:28 -05:00
Chip-Kerchner
c8882bd9d8 Remove POWER7 from cpu list. 2023-11-01 14:53:55 -05:00
Chip-Kerchner
d8e6e2b04d Merge branch 'develop' into dynamicDispatchAIXandClang 2023-11-01 14:22:06 -05:00
Martin Kroeker
0de786cfa6 Merge pull request #4278 from martin-frbg/issue4277
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker
9f7c35a4a8 Merge pull request #4279 from martin-frbg/issue4269
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker
728788f667 typo fix 2023-10-31 11:08:22 +01:00
Martin Kroeker
d003ad630b Increase the default GEMM buffer size on modern ARM server cpus 2023-10-31 10:26:38 +01:00
Martin Kroeker
dc1c880782 fix libgfortran path on bsd 2023-10-28 23:14:36 +02:00
Martin Kroeker
289a5f6d9b work around libgfortran install issue on FreeBSD 2023-10-28 18:44:58 +02:00
Martin Kroeker
1cec1c0fc7 Add FreeBSD clang/gfortran build with OpenMP 2023-10-28 14:43:19 +02:00
Martin Kroeker
9d425a5fe7 Merge pull request #4276 from martin-frbg/issue4275
Clarify make/make install in the README and update the TARGET list there
2023-10-28 14:34:17 +02:00
Martin Kroeker
f5e1f20f4d Update target list 2023-10-27 17:10:37 +02:00
Martin Kroeker
a7f73c764c Clarify "make" options and the need to repeat them in the install step 2023-10-27 16:48:47 +02:00
Chip Kerchner
badfb2e60f Merge branch 'develop' into XLC-AIX 2023-10-26 09:19:31 -05:00
Martin Kroeker
96f8bb1eb9 Merge pull request #4272 from RajalakshmiSR/AIX_AS
POWER: AIX: Make use of power10 optimization
2023-10-24 12:08:51 +02:00
Rajalakshmi Srinivasaraghavan
980f702f72 POWER: AIX: Make use of power10 optimization
POWER10 optimizations are disabled when using default AIX assembler.
As we have fixed many issues recently, enabling optimization path
for default assembler.
2023-10-19 18:48:19 -05:00
Martin Kroeker
68906a98c7 Merge pull request #4271 from rgommers/homebrew-nightly-on-main-repo
Run nightly Homebrew cron job only on the main repo, not on forks
2023-10-19 13:28:24 +02:00
Ralf Gommers
6b8379d6d9 Run nightly Homebrew cron job only on the main repo, not on forks
I noticed this because GitHub emailed me that it would disable the
nightly job because it hadn't changed for 3 months. It currently takes
30-50 minutes daily, and by default runs on all forks of the main
repository that have the relevant workflow yaml file. That serves little
purpose and wastes quite a bit of energy - so disable the runs outside
of the main repo.

This will not disable the runs on forks already made in the past that
contain this workflow file, but it does save 3 months worth of runs on
every new fork that is created.

[skip ci]
2023-10-19 11:38:26 +02:00
Martin Kroeker
0799b0d215 Merge pull request #4266 from martin-frbg/gh-mingw-ucrt
GH Workflows: Switch MINGW-W64 jobs to UCRT
2023-10-18 18:58:32 +02:00
Martin Kroeker
5c411ac7a8 Merge pull request #4268 from martin-frbg/issue4267
Fix unwanted "hard" fallback to Prescott in runtime detection of Intel cpus
2023-10-18 17:47:33 +02:00
Martin Kroeker
e12aaed13d Fix unwanted fallthrough from Intel Family 6 to 15 in case of identification failure 2023-10-18 16:28:54 +02:00