Commit Graph

6596 Commits

Author SHA1 Message Date
Martin Kroeker
f703846ad9 Add function prototypes 2022-08-13 11:38:27 +02:00
Martin Kroeker
8a5ad01acd Merge pull request #3725 from FlyGoat/alpha
Alpha build fixes
2022-08-11 20:38:07 +02:00
Martin Kroeker
0cca127907 Merge pull request #3726 from martin-frbg/fixc99
Fix C99-style declaration in converted LAPACK code
2022-08-11 20:36:49 +02:00
Martin Kroeker
be82449c3f Merge pull request #3727 from martin-frbg/traviswait-2
Double the wait time for ppc jobs in Travis CI
2022-08-11 19:49:19 +02:00
Martin Kroeker
3fc06b6e7e Double the wait time for ppc jobs in Travis CI 2022-08-11 16:42:02 +02:00
Martin Kroeker
704a024df4 Fix C99-style declaration of loop variable 2022-08-11 16:37:23 +02:00
Jiaxun Yang
407af4b6aa Document alpha targets
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:08:48 +01:00
Jiaxun Yang
a03ed065e1 Wire up alpha in new build system
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:08:46 +01:00
Jiaxun Yang
50c4eeb97d alpha: Remove include of version.h
It will be defined by preprocessor argument.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang
fa14bdb26d Entitle missing declearation for alpha
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang
99f82f1f81 alpha: Use mb for rmb
Alpha never have a rmb instruction.
Just use mb for rmb.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Martin Kroeker
0946b1ea7e Merge pull request #3719 from ivan-pi/lsame
Add const attribute to C LAPACK lsame
2022-08-09 22:15:26 +02:00
Martin Kroeker
5d577ad7b9 Merge pull request #3721 from isuruf/patch-1
Fix building tests when NOFORTRAN is not given at all
2022-08-09 20:24:40 +02:00
Martin Kroeker
b94944777b Merge pull request #3718 from martin-frbg/fixsbtest
Fix SBGEMM test to work with INTERFACE64
2022-08-09 09:45:22 +02:00
Isuru Fernando
d54971fe20 Fix building tests when NOFORTRAN is not given at all 2022-08-08 11:20:38 -05:00
Martin Kroeker
3d338b57de remove spurious loops 2022-08-08 17:09:45 +02:00
Ivan Pribec
802e71bf05 Add const attribute to lsame 2022-08-08 15:15:52 +02:00
Martin Kroeker
d9dc015cfc Use blasint for INTERFACE64 compatibility 2022-08-08 14:52:10 +02:00
Martin Kroeker
e671d0386b Update version to 0.3.21.dev 2022-08-07 23:03:20 +02:00
Martin Kroeker
1dd979959d set version to 0.3.21.dev 2022-08-07 23:02:36 +02:00
Martin Kroeker
94cba8e3c5 Merge pull request #3716 from martin-frbg/0321changes
Update Changelog for 0.3.21
2022-08-07 22:30:58 +02:00
Martin Kroeker
25ce2e2a63 Update with 0.3.21 changes 2022-08-07 22:21:23 +02:00
Martin Kroeker
9f89b62b25 Merge pull request #3715 from martin-frbg/issue3648
Increase thresholds for STFSM and CTFSM in the LAPACK testsuite
2022-08-07 08:45:06 +02:00
Martin Kroeker
9c4e91a77d Increase threshold 2022-08-07 00:03:50 +02:00
Martin Kroeker
1fe06caf49 Increase threshold 2022-08-07 00:03:20 +02:00
Martin Kroeker
ff58e9a7f1 Merge pull request #3609 from martin-frbg/lapack3101
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
2022-08-06 14:31:56 +02:00
Martin Kroeker
f6a1854ce9 resync gensymbol with develop 2022-08-06 09:29:09 +02:00
Martin Kroeker
2bee490287 Merge pull request #3714 from martin-frbg/crosscmake
Add more x86_64 target definitions for CMAKE cross-compiling
2022-08-04 23:58:21 +02:00
Martin Kroeker
85fd3c4279 Support compilation with the Cray C and Fortran compilers (#3712)
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker
3784b3d45c Add more x86_64 target definitions for cross-compiling 2022-08-04 19:18:32 +02:00
Martin Kroeker
096ae6f2bd Merge pull request #3709 from nursik/develop
Add TCORE Generic
2022-08-03 15:43:27 +02:00
Martin Kroeker
19fefd100e Merge pull request #3703 from martin-frbg/omp_adaptive
Add env variable OMP_ADAPTIVE to control OMP threadpool behaviour
2022-08-03 15:38:39 +02:00
Martin Kroeker
2e51a61914 Merge pull request #3693 from Mayank-Raj3/Mayank-Raj3-patch-1
corrected indentation of for and if statement dgemv_thread_safety.cpp
2022-08-03 15:38:14 +02:00
Nursultan Zarlyk
a7ac252fd9 Add TCORE Generic in prebuild.cmake
During the cross-compilation on x64 host with MSVC for ARMv8, the
build fails as there is no define directives for Generic core.
2022-08-02 10:50:58 +02:00
Martin Kroeker
648a69a67e Merge pull request #3707 from martin-frbg/getarch_risc
Fix crash in RISCV autodetection when pmodel is not present in /proc/cpuinfo
2022-07-31 10:13:38 +02:00
Martin Kroeker
ef9c976a94 Really fix compilation; fix crash when pmodel is not present in cpuinfo 2022-07-31 00:41:04 +02:00
Martin Kroeker
f727235be4 Merge pull request #3706 from martin-frbg/czifunding
Acknowledge past CZI EOSS 1/EOSS 3 funding
2022-07-30 14:11:45 +02:00
Martin Kroeker
880bc1d1db Acknowledge past CZI EOSS 1/EOSS 3 funding 2022-07-30 12:34:09 +02:00
Martin Kroeker
d0ba257de0 Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 20:31:20 +02:00
Martin Kroeker
78da6a750a Merge pull request #3705 from RajalakshmiSR/bf16ppc
POWER: Enable bfloat16 kernels by default
2022-07-28 18:38:14 +02:00
Rajalakshmi Srinivasaraghavan
1d97405c02 POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
2022-07-28 07:43:53 -05:00
gxw
fbfe1daf6e LoongArch64: Add DYNAMIC_ARCH support 2022-07-28 14:28:45 +08:00
Martin Kroeker
80cdfed7b2 Use OMP_ADAPTIVE setting to choose between static and dynamic OMP threadpool size 2022-07-27 23:43:20 +02:00
Martin Kroeker
08e3754b39 Add environment variable OMP_ADAPTIVE 2022-07-27 23:41:47 +02:00
Martin Kroeker
047a279f09 Merge pull request #3702 from martin-frbg/issue3687
Add openblas_getaffinity() extension (Linux-only)
2022-07-27 20:57:50 +02:00
Martin Kroeker
30473b6a9d add openblas_getaffinity() 2022-07-27 19:15:18 +02:00
Martin Kroeker
8668571040 add openblas_getaffinity() 2022-07-27 19:14:36 +02:00
Martin Kroeker
daca01622b fix detection of Neoverse V1 and user-enforced selection of N2 in ARM64 DYNAMIC_ARCH (#3700)
* fix detection of Neoverse V1 and user-enforced selection of N2
2022-07-27 09:17:43 +02:00
Martin Kroeker
c322aab685 Merge pull request #3684 from imzhuhl/neoversen2_dynamic_arch
Neoverse N2: DYNAMIC_ARCH
2022-07-26 20:06:26 +02:00
Martin Kroeker
cf796aee8c Merge pull request #3699 from martin-frbg/issue3692
Add c_check recognition of Fujitsu fcc for Fugaku A64FX
2022-07-26 16:36:43 +02:00