Martin Kroeker
778e3b746a
Enable autodetection of current AMD cpus as their AVX512 Intel counterparts
2023-11-10 11:55:29 +01:00
Martin Kroeker
273f4e8794
Merge pull request #4300 from ChipKerchner/powerFortranAIXMakefiles
...
Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos on AIX
2023-11-08 23:33:17 +01:00
Chip Kerchner
3ada8fbf8a
Merge remote-tracking branch 'origin/develop' into powerFortranAIXMakefiles
2023-11-08 11:17:45 -06:00
Martin Kroeker
0a609cc689
Merge pull request #4298 from ChipKerchner/fixBuiltinsDynamicArch
...
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-08 15:57:57 +01:00
Chip-Kerchner
4eecccd49b
Fix __builtin_cpu_is for AIX.
2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-07 20:58:34 -06:00
Chip Kerchner
206e76187e
Fix FCOMMON_OPT for power. Error out for certain C and Fortran compiler combos in AIX.
2023-11-07 18:08:57 -06:00
Martin Kroeker
45da7e6c91
Merge pull request #4288 from martin-frbg/fix3915
...
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker
c14b1ebd30
Merge pull request #4295 from martin-frbg/lapack928
...
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker
fea1d4f66c
Merge pull request #4294 from martin-frbg/lapack909
...
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker
6f11992dbb
Merge pull request #4293 from martin-frbg/lapack927
...
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker
176cc6348e
Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928)
2023-11-06 21:55:19 +01:00
Martin Kroeker
f6ec777701
Fix accumulation (Reference-LAPACK PR 909)
2023-11-06 21:40:50 +01:00
Martin Kroeker
cf8295da5c
Fix potential overflow in the calculation of MAXIT
2023-11-06 21:22:26 +01:00
Martin Kroeker
ac7efc61fd
Put more build information into Makefile.conf_last
2023-11-06 00:03:33 +01:00
Martin Kroeker
cd8eb83bae
Fix allocations and compiler warnings in ZROTG ( #4289 )
...
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker
3a86fde5a7
Delete misplaced file from PR 3915 after applying in the correct place
2023-11-04 23:52:46 +01:00
Martin Kroeker
04bc801999
(Re)apply fixes for supporting only a subset of precision types from PR 3915
2023-11-04 23:48:59 +01:00
Martin Kroeker
25096e5afe
Merge pull request #4287 from martin-frbg/issue4282
...
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker
9019bc4945
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 22:10:06 +01:00
Martin Kroeker
4c6ee3f18e
Merge pull request #4285 from martin-frbg/cortexfix
...
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker
3bfa4d4dcc
Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE
2023-11-03 14:55:31 +01:00
Martin Kroeker
59444415ac
Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
...
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner
7dcb2d67f2
Have POWER7 return arch=POWER6.
2023-11-01 15:23:28 -05:00
Chip-Kerchner
c8882bd9d8
Remove POWER7 from cpu list.
2023-11-01 14:53:55 -05:00
Chip-Kerchner
d8e6e2b04d
Merge branch 'develop' into dynamicDispatchAIXandClang
2023-11-01 14:22:06 -05:00
Martin Kroeker
0de786cfa6
Merge pull request #4278 from martin-frbg/issue4277
...
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker
9f7c35a4a8
Merge pull request #4279 from martin-frbg/issue4269
...
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker
728788f667
typo fix
2023-10-31 11:08:22 +01:00
Martin Kroeker
d003ad630b
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 10:26:38 +01:00
Martin Kroeker
dc1c880782
fix libgfortran path on bsd
2023-10-28 23:14:36 +02:00
Martin Kroeker
289a5f6d9b
work around libgfortran install issue on FreeBSD
2023-10-28 18:44:58 +02:00
Martin Kroeker
1cec1c0fc7
Add FreeBSD clang/gfortran build with OpenMP
2023-10-28 14:43:19 +02:00
Martin Kroeker
9d425a5fe7
Merge pull request #4276 from martin-frbg/issue4275
...
Clarify make/make install in the README and update the TARGET list there
2023-10-28 14:34:17 +02:00
Martin Kroeker
f5e1f20f4d
Update target list
2023-10-27 17:10:37 +02:00
Martin Kroeker
a7f73c764c
Clarify "make" options and the need to repeat them in the install step
2023-10-27 16:48:47 +02:00
Chip Kerchner
badfb2e60f
Merge branch 'develop' into XLC-AIX
2023-10-26 09:19:31 -05:00
Martin Kroeker
96f8bb1eb9
Merge pull request #4272 from RajalakshmiSR/AIX_AS
...
POWER: AIX: Make use of power10 optimization
2023-10-24 12:08:51 +02:00
Rajalakshmi Srinivasaraghavan
980f702f72
POWER: AIX: Make use of power10 optimization
...
POWER10 optimizations are disabled when using default AIX assembler.
As we have fixed many issues recently, enabling optimization path
for default assembler.
2023-10-19 18:48:19 -05:00
Martin Kroeker
68906a98c7
Merge pull request #4271 from rgommers/homebrew-nightly-on-main-repo
...
Run nightly Homebrew cron job only on the main repo, not on forks
2023-10-19 13:28:24 +02:00
Ralf Gommers
6b8379d6d9
Run nightly Homebrew cron job only on the main repo, not on forks
...
I noticed this because GitHub emailed me that it would disable the
nightly job because it hadn't changed for 3 months. It currently takes
30-50 minutes daily, and by default runs on all forks of the main
repository that have the relevant workflow yaml file. That serves little
purpose and wastes quite a bit of energy - so disable the runs outside
of the main repo.
This will not disable the runs on forks already made in the past that
contain this workflow file, but it does save 3 months worth of runs on
every new fork that is created.
[skip ci]
2023-10-19 11:38:26 +02:00
Martin Kroeker
0799b0d215
Merge pull request #4266 from martin-frbg/gh-mingw-ucrt
...
GH Workflows: Switch MINGW-W64 jobs to UCRT
2023-10-18 18:58:32 +02:00
Martin Kroeker
5c411ac7a8
Merge pull request #4268 from martin-frbg/issue4267
...
Fix unwanted "hard" fallback to Prescott in runtime detection of Intel cpus
2023-10-18 17:47:33 +02:00
Martin Kroeker
e12aaed13d
Fix unwanted fallthrough from Intel Family 6 to 15 in case of identification failure
2023-10-18 16:28:54 +02:00
Martin Kroeker
f8c230c21c
Switch MINGW-W64 jobs to UCRT
2023-10-18 11:58:54 +02:00
Martin Kroeker
c28d71c6fb
Merge pull request #4265 from martin-frbg/issue4228
...
Fix compilation with (the fortran compiler from) Cray CCE
2023-10-17 15:08:30 +02:00
Martin Kroeker
b41cab0875
Need to use override to actually strip down the already defined FFLAGS for NAG and CCE Fortran
2023-10-16 22:20:59 +02:00
Martin Kroeker
301e2ecc49
Cray Fortran uses -O in combinations like -O omp so don't filter that out
2023-10-16 22:15:46 +02:00
Martin Kroeker
66c2c41e99
Merge pull request #4260 from RajalakshmiSR/AIX-M4
...
POWER: Increase macro size limit for AIX
2023-10-13 10:51:23 +02:00
Martin Kroeker
425bcc1f8b
Merge pull request #4256 from ChipKerchner/fixBfloat16BitsStruct
...
Fix bfloat16_bits union so that it always the sizeof unsigned short for AIX.
2023-10-12 22:01:50 +02:00