Commit Graph

7546 Commits

Author SHA1 Message Date
Martin Kroeker cad10a3caa
Merge pull request #4303 from martin-frbg/ryzen-avx512
Enable autodetection of Zen 3/4 cpus as their AVX512 Intel counterparts
2023-11-11 18:36:24 +01:00
Martin Kroeker 95ed8125fa
Merge pull request #4290 from martin-frbg/issue4275-2
Put more build information into Makefile.conf_last
2023-11-11 15:28:57 +01:00
Martin Kroeker 3ad27007fc
rebase 2023-11-10 15:19:11 +01:00
Martin Kroeker 778e3b746a
Enable autodetection of current AMD cpus as their AVX512 Intel counterparts 2023-11-10 11:55:29 +01:00
Martin Kroeker 273f4e8794
Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
Fix FCOMMON_OPT for power.  Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner 3ada8fbf8a Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles 2023-11-08 11:17:45 -06:00
Martin Kroeker 0a609cc689
Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner 4eecccd49b Fix __builtin_cpu_is for AIX. 2023-11-08 07:12:21 -06:00
Chip-Kerchner 5e31c57083 Only define __builtin_cpu_is and __builtin_cpu_supports if not present. 2023-11-07 20:58:34 -06:00
Chip Kerchner 206e76187e Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX. 2023-11-07 18:08:57 -06:00
Martin Kroeker 45da7e6c91
Merge pull request #4288 from martin-frbg/fix3915
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker c14b1ebd30
Merge pull request #4295 from martin-frbg/lapack928
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker fea1d4f66c
Merge pull request #4294 from martin-frbg/lapack909
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker 6f11992dbb
Merge pull request #4293 from martin-frbg/lapack927
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker 176cc6348e
Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928) 2023-11-06 21:55:19 +01:00
Martin Kroeker f6ec777701
Fix accumulation (Reference-LAPACK PR 909) 2023-11-06 21:40:50 +01:00
Martin Kroeker cf8295da5c
Fix potential overflow in the calculation of MAXIT 2023-11-06 21:22:26 +01:00
Martin Kroeker ac7efc61fd
Put more build information into Makefile.conf_last 2023-11-06 00:03:33 +01:00
Martin Kroeker cd8eb83bae
Fix allocations and compiler warnings in ZROTG (#4289)
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker 3a86fde5a7
Delete misplaced file from PR 3915 after applying in the correct place 2023-11-04 23:52:46 +01:00
Martin Kroeker 04bc801999
(Re)apply fixes for supporting only a subset of precision types from PR 3915 2023-11-04 23:48:59 +01:00
Martin Kroeker 25096e5afe
Merge pull request #4287 from martin-frbg/issue4282
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker 9019bc4945
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well 2023-11-04 22:10:06 +01:00
Martin Kroeker 4c6ee3f18e
Merge pull request #4285 from martin-frbg/cortexfix
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker 3bfa4d4dcc
Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE 2023-11-03 14:55:31 +01:00
Martin Kroeker 59444415ac
Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner 7dcb2d67f2 Have POWER7 return arch=POWER6. 2023-11-01 15:23:28 -05:00
Chip-Kerchner c8882bd9d8 Remove POWER7 from cpu list. 2023-11-01 14:53:55 -05:00
Chip-Kerchner d8e6e2b04d Merge branch 'develop' into dynamicDispatchAIXandClang 2023-11-01 14:22:06 -05:00
Martin Kroeker 0de786cfa6
Merge pull request #4278 from martin-frbg/issue4277
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker 9f7c35a4a8
Merge pull request #4279 from martin-frbg/issue4269
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker 728788f667
typo fix 2023-10-31 11:08:22 +01:00
Martin Kroeker d003ad630b
Increase the default GEMM buffer size on modern ARM server cpus 2023-10-31 10:26:38 +01:00
Martin Kroeker dc1c880782
fix libgfortran path on bsd 2023-10-28 23:14:36 +02:00
Martin Kroeker 289a5f6d9b
work around libgfortran install issue on FreeBSD 2023-10-28 18:44:58 +02:00
Martin Kroeker 1cec1c0fc7
Add FreeBSD clang/gfortran build with OpenMP 2023-10-28 14:43:19 +02:00
Martin Kroeker 9d425a5fe7
Merge pull request #4276 from martin-frbg/issue4275
Clarify make/make install in the README and update the TARGET list there
2023-10-28 14:34:17 +02:00
Martin Kroeker f5e1f20f4d
Update target list 2023-10-27 17:10:37 +02:00
Martin Kroeker a7f73c764c
Clarify "make" options and the need to repeat them in the install step 2023-10-27 16:48:47 +02:00
Chip Kerchner badfb2e60f Merge branch 'develop' into XLC-AIX 2023-10-26 09:19:31 -05:00
Martin Kroeker 96f8bb1eb9
Merge pull request #4272 from RajalakshmiSR/AIX_AS
POWER: AIX: Make use of power10 optimization
2023-10-24 12:08:51 +02:00
Rajalakshmi Srinivasaraghavan 980f702f72 POWER: AIX: Make use of power10 optimization
POWER10 optimizations are disabled when using default AIX assembler.
As we have fixed many issues recently, enabling optimization path
for default assembler.
2023-10-19 18:48:19 -05:00
Martin Kroeker 68906a98c7
Merge pull request #4271 from rgommers/homebrew-nightly-on-main-repo
Run nightly Homebrew cron job only on the main repo, not on forks
2023-10-19 13:28:24 +02:00
Ralf Gommers 6b8379d6d9 Run nightly Homebrew cron job only on the main repo, not on forks
I noticed this because GitHub emailed me that it would disable the
nightly job because it hadn't changed for 3 months. It currently takes
30-50 minutes daily, and by default runs on all forks of the main
repository that have the relevant workflow yaml file. That serves little
purpose and wastes quite a bit of energy - so disable the runs outside
of the main repo.

This will not disable the runs on forks already made in the past that
contain this workflow file, but it does save 3 months worth of runs on
every new fork that is created.

[skip ci]
2023-10-19 11:38:26 +02:00
Martin Kroeker 0799b0d215
Merge pull request #4266 from martin-frbg/gh-mingw-ucrt
GH Workflows: Switch MINGW-W64 jobs to UCRT
2023-10-18 18:58:32 +02:00
Martin Kroeker 5c411ac7a8
Merge pull request #4268 from martin-frbg/issue4267
Fix unwanted "hard" fallback to Prescott in runtime detection of Intel cpus
2023-10-18 17:47:33 +02:00
Martin Kroeker e12aaed13d
Fix unwanted fallthrough from Intel Family 6 to 15 in case of identification failure 2023-10-18 16:28:54 +02:00
Martin Kroeker f8c230c21c
Switch MINGW-W64 jobs to UCRT 2023-10-18 11:58:54 +02:00
Martin Kroeker c28d71c6fb
Merge pull request #4265 from martin-frbg/issue4228
Fix compilation with (the fortran compiler from) Cray CCE
2023-10-17 15:08:30 +02:00
Martin Kroeker b41cab0875
Need to use override to actually strip down the already defined FFLAGS for NAG and CCE Fortran 2023-10-16 22:20:59 +02:00