Martin Kroeker
0cca127907
Merge pull request #3726 from martin-frbg/fixc99
...
Fix C99-style declaration in converted LAPACK code
2022-08-11 20:36:49 +02:00
Martin Kroeker
be82449c3f
Merge pull request #3727 from martin-frbg/traviswait-2
...
Double the wait time for ppc jobs in Travis CI
2022-08-11 19:49:19 +02:00
Martin Kroeker
3fc06b6e7e
Double the wait time for ppc jobs in Travis CI
2022-08-11 16:42:02 +02:00
Martin Kroeker
704a024df4
Fix C99-style declaration of loop variable
2022-08-11 16:37:23 +02:00
Jiaxun Yang
407af4b6aa
Document alpha targets
...
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:08:48 +01:00
Jiaxun Yang
a03ed065e1
Wire up alpha in new build system
...
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:08:46 +01:00
Jiaxun Yang
50c4eeb97d
alpha: Remove include of version.h
...
It will be defined by preprocessor argument.
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang
fa14bdb26d
Entitle missing declearation for alpha
...
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang
99f82f1f81
alpha: Use mb for rmb
...
Alpha never have a rmb instruction.
Just use mb for rmb.
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:02:58 +01:00
Jiaxun Yang
1569a43f7e
GitHub Actions: Add cross compile tests
...
Add cross compile tests without running checks. Currently
only mips64el, riscv64, mipsel, alpha is wired up.
Just help us make sure those less popular CPUs are not
messed up by changes.
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:01:34 +01:00
Jiaxun Yang
aaaecdbf19
Allow CC to contain space
...
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-11 15:01:30 +01:00
haanhvu
38e9c969c4
Clarify the installation guide in the output of make
2022-08-10 16:28:50 +07:00
Martin Kroeker
0946b1ea7e
Merge pull request #3719 from ivan-pi/lsame
...
Add const attribute to C LAPACK lsame
2022-08-09 22:15:26 +02:00
Martin Kroeker
5d577ad7b9
Merge pull request #3721 from isuruf/patch-1
...
Fix building tests when NOFORTRAN is not given at all
2022-08-09 20:24:40 +02:00
Martin Kroeker
b94944777b
Merge pull request #3718 from martin-frbg/fixsbtest
...
Fix SBGEMM test to work with INTERFACE64
2022-08-09 09:45:22 +02:00
Isuru Fernando
d54971fe20
Fix building tests when NOFORTRAN is not given at all
2022-08-08 11:20:38 -05:00
Martin Kroeker
3d338b57de
remove spurious loops
2022-08-08 17:09:45 +02:00
Ivan Pribec
802e71bf05
Add const attribute to lsame
2022-08-08 15:15:52 +02:00
Martin Kroeker
d9dc015cfc
Use blasint for INTERFACE64 compatibility
2022-08-08 14:52:10 +02:00
Martin Kroeker
e671d0386b
Update version to 0.3.21.dev
2022-08-07 23:03:20 +02:00
Martin Kroeker
1dd979959d
set version to 0.3.21.dev
2022-08-07 23:02:36 +02:00
Martin Kroeker
b89fb708ca
Update version to 0.3.21
2022-08-07 22:36:26 +02:00
Martin Kroeker
9a34217cc6
Merge pull request #3717 from xianyi/develop
...
Update from develop for 0.3.21 release
2022-08-07 22:35:20 +02:00
Martin Kroeker
79f54f266d
Update version to 0.3.21
2022-08-07 22:32:11 +02:00
Martin Kroeker
94cba8e3c5
Merge pull request #3716 from martin-frbg/0321changes
...
Update Changelog for 0.3.21
2022-08-07 22:30:58 +02:00
Martin Kroeker
25ce2e2a63
Update with 0.3.21 changes
2022-08-07 22:21:23 +02:00
Jiaxun Yang
b633eb79f2
Use $at as temporary register for mips/loongson CPUCFG read
...
Some compilers (namely LLVM) are not happy with clobbering
registers in inline assembly.
Use $at as temporary register and explicitly use noat
hint.
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-07 13:22:32 +01:00
Martin Kroeker
9f89b62b25
Merge pull request #3715 from martin-frbg/issue3648
...
Increase thresholds for STFSM and CTFSM in the LAPACK testsuite
2022-08-07 08:45:06 +02:00
Martin Kroeker
9c4e91a77d
Increase threshold
2022-08-07 00:03:50 +02:00
Martin Kroeker
1fe06caf49
Increase threshold
2022-08-07 00:03:20 +02:00
Martin Kroeker
ff58e9a7f1
Merge pull request #3609 from martin-frbg/lapack3101
...
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
2022-08-06 14:31:56 +02:00
Martin Kroeker
f6a1854ce9
resync gensymbol with develop
2022-08-06 09:29:09 +02:00
Martin Kroeker
2bee490287
Merge pull request #3714 from martin-frbg/crosscmake
...
Add more x86_64 target definitions for CMAKE cross-compiling
2022-08-04 23:58:21 +02:00
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
...
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker
3784b3d45c
Add more x86_64 target definitions for cross-compiling
2022-08-04 19:18:32 +02:00
Martin Kroeker
096ae6f2bd
Merge pull request #3709 from nursik/develop
...
Add TCORE Generic
2022-08-03 15:43:27 +02:00
Martin Kroeker
19fefd100e
Merge pull request #3703 from martin-frbg/omp_adaptive
...
Add env variable OMP_ADAPTIVE to control OMP threadpool behaviour
2022-08-03 15:38:39 +02:00
Martin Kroeker
2e51a61914
Merge pull request #3693 from Mayank-Raj3/Mayank-Raj3-patch-1
...
corrected indentation of for and if statement dgemv_thread_safety.cpp
2022-08-03 15:38:14 +02:00
Nursultan Zarlyk
a7ac252fd9
Add TCORE Generic in prebuild.cmake
...
During the cross-compilation on x64 host with MSVC for ARMv8, the
build fails as there is no define directives for Generic core.
2022-08-02 10:50:58 +02:00
Jiaxun Yang
19d4f90c44
Use auvx to detect CPUCFG on mips/loongson
...
It's safer and easier than SIGILL.
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-07-31 19:41:59 +01:00
Martin Kroeker
648a69a67e
Merge pull request #3707 from martin-frbg/getarch_risc
...
Fix crash in RISCV autodetection when pmodel is not present in /proc/cpuinfo
2022-07-31 10:13:38 +02:00
Martin Kroeker
ef9c976a94
Really fix compilation; fix crash when pmodel is not present in cpuinfo
2022-07-31 00:41:04 +02:00
Martin Kroeker
f727235be4
Merge pull request #3706 from martin-frbg/czifunding
...
Acknowledge past CZI EOSS 1/EOSS 3 funding
2022-07-30 14:11:45 +02:00
Martin Kroeker
880bc1d1db
Acknowledge past CZI EOSS 1/EOSS 3 funding
2022-07-30 12:34:09 +02:00
Martin Kroeker
d0ba257de0
Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
...
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 20:31:20 +02:00
Martin Kroeker
78da6a750a
Merge pull request #3705 from RajalakshmiSR/bf16ppc
...
POWER: Enable bfloat16 kernels by default
2022-07-28 18:38:14 +02:00
Rajalakshmi Srinivasaraghavan
1d97405c02
POWER: Enable bfloat16 kernels by default
...
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
2022-07-28 07:43:53 -05:00
gxw
fbfe1daf6e
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 14:28:45 +08:00
Martin Kroeker
80cdfed7b2
Use OMP_ADAPTIVE setting to choose between static and dynamic OMP threadpool size
2022-07-27 23:43:20 +02:00
Martin Kroeker
08e3754b39
Add environment variable OMP_ADAPTIVE
2022-07-27 23:41:47 +02:00