Commit Graph

7537 Commits

Author SHA1 Message Date
Chip-Kerchner
4eecccd49b Fix __builtin_cpu_is for AIX. 2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083 Only define __builtin_cpu_is and __builtin_cpu_supports if not present. 2023-11-07 20:58:34 -06:00
Martin Kroeker
45da7e6c91 Merge pull request #4288 from martin-frbg/fix3915
(Re)apply Makefile fix for supporting only a subset of precision types
2023-11-07 17:12:46 +01:00
Martin Kroeker
c14b1ebd30 Merge pull request #4295 from martin-frbg/lapack928
Correct constant ALPHA in ?ORBDB6 (Reference-LAPACK PR 928)
2023-11-07 14:44:08 +01:00
Martin Kroeker
fea1d4f66c Merge pull request #4294 from martin-frbg/lapack909
Fix accumulation in LAPACK ?LASSQ (Reference-LAPACK PR 909)
2023-11-07 12:23:16 +01:00
Martin Kroeker
6f11992dbb Merge pull request #4293 from martin-frbg/lapack927
Fix potential integer overflow in LAPACK C/ZBDSQR (Reference-LAPACK PR 927)
2023-11-07 10:53:28 +01:00
Martin Kroeker
176cc6348e Correct constant ALPHA to match changed algorithm (Reference-LAPACK PR 928) 2023-11-06 21:55:19 +01:00
Martin Kroeker
f6ec777701 Fix accumulation (Reference-LAPACK PR 909) 2023-11-06 21:40:50 +01:00
Martin Kroeker
cf8295da5c Fix potential overflow in the calculation of MAXIT 2023-11-06 21:22:26 +01:00
Martin Kroeker
cd8eb83bae Fix allocations and compiler warnings in ZROTG (#4289)
* Clean up ZROTG
2023-11-05 18:13:37 +01:00
Martin Kroeker
3a86fde5a7 Delete misplaced file from PR 3915 after applying in the correct place 2023-11-04 23:52:46 +01:00
Martin Kroeker
04bc801999 (Re)apply fixes for supporting only a subset of precision types from PR 3915 2023-11-04 23:48:59 +01:00
Martin Kroeker
25096e5afe Merge pull request #4287 from martin-frbg/issue4282
Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well
2023-11-04 23:26:17 +01:00
Martin Kroeker
9019bc4945 Use SkylakeX ?ASUM microkernel for Cooperlake/Sapphirerapids as well 2023-11-04 22:10:06 +01:00
Martin Kroeker
4c6ee3f18e Merge pull request #4285 from martin-frbg/cortexfix
Fix outdated KERNEL files for SVE Cortex by aliasing to ARMV8SVE
2023-11-03 21:32:49 +01:00
Martin Kroeker
3bfa4d4dcc Fix outdated SVE kernel definitions for Cortex cpus by aliasing to ARMV8SVE 2023-11-03 14:55:31 +01:00
Martin Kroeker
59444415ac Merge pull request #4280 from ChipKerchner/dynamicDispatchAIXandClang
Add dynamic dispatch to AIX and clang for Power
2023-11-02 23:43:59 +01:00
Chip-Kerchner
7dcb2d67f2 Have POWER7 return arch=POWER6. 2023-11-01 15:23:28 -05:00
Chip-Kerchner
c8882bd9d8 Remove POWER7 from cpu list. 2023-11-01 14:53:55 -05:00
Chip-Kerchner
d8e6e2b04d Merge branch 'develop' into dynamicDispatchAIXandClang 2023-11-01 14:22:06 -05:00
Martin Kroeker
0de786cfa6 Merge pull request #4278 from martin-frbg/issue4277
CirrusCI: Add FreeBSD clang/gfortran build with OpenMP
2023-11-01 19:45:09 +01:00
Martin Kroeker
9f7c35a4a8 Merge pull request #4279 from martin-frbg/issue4269
Increase the default GEMM buffer size on modern ARM server cpus
2023-10-31 15:41:25 +01:00
Martin Kroeker
728788f667 typo fix 2023-10-31 11:08:22 +01:00
Martin Kroeker
d003ad630b Increase the default GEMM buffer size on modern ARM server cpus 2023-10-31 10:26:38 +01:00
Martin Kroeker
dc1c880782 fix libgfortran path on bsd 2023-10-28 23:14:36 +02:00
Martin Kroeker
289a5f6d9b work around libgfortran install issue on FreeBSD 2023-10-28 18:44:58 +02:00
Martin Kroeker
1cec1c0fc7 Add FreeBSD clang/gfortran build with OpenMP 2023-10-28 14:43:19 +02:00
Martin Kroeker
9d425a5fe7 Merge pull request #4276 from martin-frbg/issue4275
Clarify make/make install in the README and update the TARGET list there
2023-10-28 14:34:17 +02:00
Martin Kroeker
f5e1f20f4d Update target list 2023-10-27 17:10:37 +02:00
Martin Kroeker
a7f73c764c Clarify "make" options and the need to repeat them in the install step 2023-10-27 16:48:47 +02:00
Chip Kerchner
badfb2e60f Merge branch 'develop' into XLC-AIX 2023-10-26 09:19:31 -05:00
Martin Kroeker
96f8bb1eb9 Merge pull request #4272 from RajalakshmiSR/AIX_AS
POWER: AIX: Make use of power10 optimization
2023-10-24 12:08:51 +02:00
Rajalakshmi Srinivasaraghavan
980f702f72 POWER: AIX: Make use of power10 optimization
POWER10 optimizations are disabled when using default AIX assembler.
As we have fixed many issues recently, enabling optimization path
for default assembler.
2023-10-19 18:48:19 -05:00
Martin Kroeker
68906a98c7 Merge pull request #4271 from rgommers/homebrew-nightly-on-main-repo
Run nightly Homebrew cron job only on the main repo, not on forks
2023-10-19 13:28:24 +02:00
Ralf Gommers
6b8379d6d9 Run nightly Homebrew cron job only on the main repo, not on forks
I noticed this because GitHub emailed me that it would disable the
nightly job because it hadn't changed for 3 months. It currently takes
30-50 minutes daily, and by default runs on all forks of the main
repository that have the relevant workflow yaml file. That serves little
purpose and wastes quite a bit of energy - so disable the runs outside
of the main repo.

This will not disable the runs on forks already made in the past that
contain this workflow file, but it does save 3 months worth of runs on
every new fork that is created.

[skip ci]
2023-10-19 11:38:26 +02:00
Martin Kroeker
0799b0d215 Merge pull request #4266 from martin-frbg/gh-mingw-ucrt
GH Workflows: Switch MINGW-W64 jobs to UCRT
2023-10-18 18:58:32 +02:00
Martin Kroeker
5c411ac7a8 Merge pull request #4268 from martin-frbg/issue4267
Fix unwanted "hard" fallback to Prescott in runtime detection of Intel cpus
2023-10-18 17:47:33 +02:00
Martin Kroeker
e12aaed13d Fix unwanted fallthrough from Intel Family 6 to 15 in case of identification failure 2023-10-18 16:28:54 +02:00
Martin Kroeker
f8c230c21c Switch MINGW-W64 jobs to UCRT 2023-10-18 11:58:54 +02:00
Martin Kroeker
c28d71c6fb Merge pull request #4265 from martin-frbg/issue4228
Fix compilation with (the fortran compiler from) Cray CCE
2023-10-17 15:08:30 +02:00
Martin Kroeker
b41cab0875 Need to use override to actually strip down the already defined FFLAGS for NAG and CCE Fortran 2023-10-16 22:20:59 +02:00
Martin Kroeker
301e2ecc49 Cray Fortran uses -O in combinations like -O omp so don't filter that out 2023-10-16 22:15:46 +02:00
Martin Kroeker
66c2c41e99 Merge pull request #4260 from RajalakshmiSR/AIX-M4
POWER: Increase macro size limit for AIX
2023-10-13 10:51:23 +02:00
Martin Kroeker
425bcc1f8b Merge pull request #4256 from ChipKerchner/fixBfloat16BitsStruct
Fix bfloat16_bits union so that it always the sizeof unsigned short for AIX.
2023-10-12 22:01:50 +02:00
Martin Kroeker
789cdcc94f Merge pull request #4259 from martin-frbg/azureosxclang
AzureCI: move OSX-Clang jobs to macOS-12 to resolve setup/build timeouts
2023-10-12 20:04:28 +02:00
Rajalakshmi Srinivasaraghavan
9f42570e33 POWER: Increase macro size limit for AIX
This patch increases the macro size limit from 4096 to 16384 to
allow compiling larger assembly files in AIX.
Tested with GCC and IBM Open XL C.
2023-10-12 12:37:40 -05:00
Martin Kroeker
9f49aef91b Merge pull request #4255 from RajalakshmiSR/AIX-P10
POWER10: Fix compilation issues with Open XL C
2023-10-12 18:59:17 +02:00
Martin Kroeker
fe75c88a2c AzureCI: move OSX-Clang jobs to macOS-12 to resolve setup/build timeouts 2023-10-12 18:20:09 +02:00
Chip-Kerchner
d46eba06a7 Pack structure only on AIX. 2023-10-12 09:41:33 -05:00
Martin Kroeker
90231bfc4e Merge pull request #4258 from martin-frbg/issue4257
Fix build on Fujitsu A64FX
2023-10-12 16:38:28 +02:00