Commit Graph

7452 Commits

Author SHA1 Message Date
Martin Kroeker 0cca127907
Merge pull request #3726 from martin-frbg/fixc99
Fix C99-style declaration in converted LAPACK code
2022-08-11 20:36:49 +02:00
Martin Kroeker be82449c3f
Merge pull request #3727 from martin-frbg/traviswait-2
Double the wait time for ppc jobs in Travis CI
2022-08-11 19:49:19 +02:00
Martin Kroeker 3fc06b6e7e
Double the wait time for ppc jobs in Travis CI 2022-08-11 16:42:02 +02:00
Martin Kroeker 704a024df4
Fix C99-style declaration of loop variable 2022-08-11 16:37:23 +02:00
Jiaxun Yang 407af4b6aa Document alpha targets
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:08:48 +01:00
Jiaxun Yang a03ed065e1 Wire up alpha in new build system
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:08:46 +01:00
Jiaxun Yang 50c4eeb97d alpha: Remove include of version.h
It will be defined by preprocessor argument.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang fa14bdb26d Entitle missing declearation for alpha
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang 99f82f1f81 alpha: Use mb for rmb
Alpha never have a rmb instruction.
Just use mb for rmb.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang 1569a43f7e GitHub Actions: Add cross compile tests
Add cross compile tests without running checks. Currently
only mips64el, riscv64, mipsel, alpha is wired up.
Just help us make sure those less popular CPUs are not
messed up by changes.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:01:34 +01:00
Jiaxun Yang aaaecdbf19 Allow CC to contain space
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:01:30 +01:00
haanhvu 38e9c969c4 Clarify the installation guide in the output of make 2022-08-10 16:28:50 +07:00
Martin Kroeker 0946b1ea7e
Merge pull request #3719 from ivan-pi/lsame
Add const attribute to C LAPACK lsame
2022-08-09 22:15:26 +02:00
Martin Kroeker 5d577ad7b9
Merge pull request #3721 from isuruf/patch-1
Fix building tests when NOFORTRAN is not given at all
2022-08-09 20:24:40 +02:00
Martin Kroeker b94944777b
Merge pull request #3718 from martin-frbg/fixsbtest
Fix SBGEMM test to work with INTERFACE64
2022-08-09 09:45:22 +02:00
Isuru Fernando d54971fe20
Fix building tests when NOFORTRAN is not given at all 2022-08-08 11:20:38 -05:00
Martin Kroeker 3d338b57de
remove spurious loops 2022-08-08 17:09:45 +02:00
Ivan Pribec 802e71bf05 Add const attribute to lsame 2022-08-08 15:15:52 +02:00
Martin Kroeker d9dc015cfc
Use blasint for INTERFACE64 compatibility 2022-08-08 14:52:10 +02:00
Martin Kroeker e671d0386b
Update version to 0.3.21.dev 2022-08-07 23:03:20 +02:00
Martin Kroeker 1dd979959d
set version to 0.3.21.dev 2022-08-07 23:02:36 +02:00
Martin Kroeker b89fb708ca
Update version to 0.3.21 2022-08-07 22:36:26 +02:00
Martin Kroeker 9a34217cc6
Merge pull request #3717 from xianyi/develop
Update from develop for 0.3.21 release
2022-08-07 22:35:20 +02:00
Martin Kroeker 79f54f266d
Update version to 0.3.21 2022-08-07 22:32:11 +02:00
Martin Kroeker 94cba8e3c5
Merge pull request #3716 from martin-frbg/0321changes
Update Changelog for 0.3.21
2022-08-07 22:30:58 +02:00
Martin Kroeker 25ce2e2a63
Update with 0.3.21 changes 2022-08-07 22:21:23 +02:00
Jiaxun Yang b633eb79f2 Use $at as temporary register for mips/loongson CPUCFG read
Some compilers (namely LLVM) are not happy with clobbering
registers in inline assembly.
Use $at as temporary register and explicitly use noat
hint.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-07 13:22:32 +01:00
Martin Kroeker 9f89b62b25
Merge pull request #3715 from martin-frbg/issue3648
Increase thresholds for STFSM and CTFSM in the LAPACK testsuite
2022-08-07 08:45:06 +02:00
Martin Kroeker 9c4e91a77d
Increase threshold 2022-08-07 00:03:50 +02:00
Martin Kroeker 1fe06caf49
Increase threshold 2022-08-07 00:03:20 +02:00
Martin Kroeker ff58e9a7f1
Merge pull request #3609 from martin-frbg/lapack3101
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
2022-08-06 14:31:56 +02:00
Martin Kroeker f6a1854ce9
resync gensymbol with develop 2022-08-06 09:29:09 +02:00
Martin Kroeker 2bee490287
Merge pull request #3714 from martin-frbg/crosscmake
Add more x86_64 target definitions for CMAKE cross-compiling
2022-08-04 23:58:21 +02:00
Martin Kroeker 85fd3c4279
Support compilation with the Cray C and Fortran compilers (#3712)
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker 3784b3d45c
Add more x86_64 target definitions for cross-compiling 2022-08-04 19:18:32 +02:00
Martin Kroeker 096ae6f2bd
Merge pull request #3709 from nursik/develop
Add TCORE Generic
2022-08-03 15:43:27 +02:00
Martin Kroeker 19fefd100e
Merge pull request #3703 from martin-frbg/omp_adaptive
Add env variable OMP_ADAPTIVE to control OMP threadpool behaviour
2022-08-03 15:38:39 +02:00
Martin Kroeker 2e51a61914
Merge pull request #3693 from Mayank-Raj3/Mayank-Raj3-patch-1
corrected indentation of for and if statement dgemv_thread_safety.cpp
2022-08-03 15:38:14 +02:00
Nursultan Zarlyk a7ac252fd9 Add TCORE Generic in prebuild.cmake
During the cross-compilation on x64 host with MSVC for ARMv8, the
build fails as there is no define directives for Generic core.
2022-08-02 10:50:58 +02:00
Jiaxun Yang 19d4f90c44 Use auvx to detect CPUCFG on mips/loongson
It's safer and easier than SIGILL.

Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-07-31 19:41:59 +01:00
Martin Kroeker 648a69a67e
Merge pull request #3707 from martin-frbg/getarch_risc
Fix crash in RISCV autodetection when pmodel is not present in /proc/cpuinfo
2022-07-31 10:13:38 +02:00
Martin Kroeker ef9c976a94
Really fix compilation; fix crash when pmodel is not present in cpuinfo 2022-07-31 00:41:04 +02:00
Martin Kroeker f727235be4
Merge pull request #3706 from martin-frbg/czifunding
Acknowledge past CZI EOSS 1/EOSS 3 funding
2022-07-30 14:11:45 +02:00
Martin Kroeker 880bc1d1db
Acknowledge past CZI EOSS 1/EOSS 3 funding 2022-07-30 12:34:09 +02:00
Martin Kroeker d0ba257de0
Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 20:31:20 +02:00
Martin Kroeker 78da6a750a
Merge pull request #3705 from RajalakshmiSR/bf16ppc
POWER: Enable bfloat16 kernels by default
2022-07-28 18:38:14 +02:00
Rajalakshmi Srinivasaraghavan 1d97405c02 POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
2022-07-28 07:43:53 -05:00
gxw fbfe1daf6e LoongArch64: Add DYNAMIC_ARCH support 2022-07-28 14:28:45 +08:00
Martin Kroeker 80cdfed7b2
Use OMP_ADAPTIVE setting to choose between static and dynamic OMP threadpool size 2022-07-27 23:43:20 +02:00
Martin Kroeker 08e3754b39
Add environment variable OMP_ADAPTIVE 2022-07-27 23:41:47 +02:00