Martin Kroeker
c4b52ef46e
Merge pull request #3624 from ioraff/no-perl
...
rewrite perl scripts in universal shell
2022-05-22 18:21:17 +02:00
Martin Kroeker
d0c3504255
Merge pull request #3631 from martin-frbg/revertdynskx
...
Revert selection of a different DGEMM kernel for SkylakeX in DYNAMIC_ARCH builds
2022-05-20 13:47:09 +02:00
Martin Kroeker
dac14a5f7d
revert "switch DGEMM parameters for SkylakeX if DYNAMIC_ARCH"
2022-05-20 11:28:23 +02:00
Martin Kroeker
dc49edd4e6
Revert "roll back DGEMM kernel ... for DYNAMIC_ARCH"
2022-05-20 11:23:30 +02:00
Martin Kroeker
faf58d2b3f
Merge pull request #3630 from martin-frbg/fixpr3629
...
Fix compilation of cpuid_riscv
2022-05-20 06:37:37 +02:00
Martin Kroeker
30df29c0b3
Fix compilation
2022-05-19 18:57:46 +02:00
Zhang Xianyi
a720e2ca8a
Merge pull request #3629 from Rabenda/riscv-c910
...
riscv: Fix machine recognition for c910v
2022-05-19 17:57:19 +08:00
Han Gao
8dd4579480
riscv: Fix machine recognition for c910v
...
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-05-19 17:32:48 +08:00
Owen Rafferty
42c7a27e6b
rewrite perl scripts in universal shell
2022-05-18 19:00:15 -05:00
Martin Kroeker
ed8b377318
Merge pull request #3628 from martin-frbg/issue3620
...
DYNAMIC_ARCH: Improve mapping for future AMD cpus
2022-05-19 00:03:55 +02:00
Martin Kroeker
2c62096fce
Expand cpu mapping for future Zen cpus and use feature-based fallback for unknown AMD family codes
2022-05-18 15:35:30 +02:00
Martin Kroeker
dbdb686128
Merge pull request #3625 from RajalakshmiSR/P10_store
...
POWER10: Changing store instructions for Level1 functions
2022-05-15 00:24:35 +02:00
Rajalakshmi Srinivasaraghavan
b62173c5a0
POWER10: Changing store instructions for Level1 functions
...
This patch changes 32 bytes stores to two 16 bytes stores
to fix a recent degradation due to 32 bytes stores.
2022-05-12 11:17:33 -05:00
martin-frbg
cf61d48470
fix C_LAPACK build options for files that must be built unoptimized
2022-05-12 09:43:23 +02:00
martin-frbg
884e6da73b
restore OpenBLAS-specific build file fixes
2022-05-12 00:20:54 +02:00
martin-frbg
56d238ff0f
Merge branch 'lapack3101' of https://github.com/martin-frbg/OpenBLAS into lapack3101
2022-05-11 18:45:02 +02:00
martin-frbg
ffb4f04fbb
Update the LAPACK testsuite to match 3.10.1
2022-05-11 18:40:55 +02:00
Martin Kroeker
ce814e84dc
Merge pull request #3619 from martin-frbg/fixup-3613
...
Initial attempt at proper cpu detection on RISCV
2022-05-04 15:12:22 +02:00
Martin Kroeker
46d22150de
Initial attempt at proper cpu detection on RISCV
2022-05-04 08:58:56 +02:00
Martin Kroeker
8f13ab94d2
Merge pull request #3613 from Rabenda/fix-riscv
...
Fix riscv64 detect
2022-05-04 07:22:47 +02:00
Martin Kroeker
7d6c0ae223
Merge pull request #3618 from martin-frbg/issue3606
...
Automatically downgrade C910V to RISCV64_GENERIC if the compiler lacks vector support
2022-05-04 07:22:25 +02:00
Martin Kroeker
18427f3759
Have getarch downgrade the RISCV C910V target to GENERIC if compiler lacks vector support
2022-05-03 23:29:55 +02:00
Martin Kroeker
d328636e58
Add compiler check for RISCV vector support
2022-05-03 23:27:50 +02:00
Martin Kroeker
06d1dd6ba8
Merge pull request #3616 from martin-frbg/issue3615
...
Fix CMAKE generator rules for ?laswp_ncopy and ?neg_tcopy kernels
2022-05-01 00:09:20 +02:00
Martin Kroeker
75cdc8cba5
Merge branch 'develop' into lapack3101
2022-04-30 21:43:45 +02:00
Martin Kroeker
567d8853d1
rename lapack subtarget to lapack_overrides to avoid name clash with netlib in case-insensitive settings
2022-04-30 20:38:09 +02:00
Martin Kroeker
d502326e54
Merge pull request #3614 from martin-frbg/clapackfix
...
Makefile fixes related to C_LAPACK, plus Travis CI fixes
2022-04-30 20:35:17 +02:00
Martin Kroeker
96ddadf164
Update .travis.yml
2022-04-30 18:49:04 +02:00
Martin Kroeker
adaf03835b
try to fix assembler errors on z13
2022-04-30 18:33:00 +02:00
Martin Kroeker
84cb58b7fb
Fix generator rules for ?laswp_ncopy and ?neg_tcopy
2022-04-30 15:28:38 +02:00
Martin Kroeker
11f023b712
fix arch tags
2022-04-27 22:18:22 +02:00
Martin Kroeker
b4b9ccdbfa
Remove leftover debug output
2022-04-27 21:59:45 +02:00
Martin Kroeker
24e99eca31
Avoid adding -lgfortran with NOFORTRAN
2022-04-27 20:31:42 +02:00
Martin Kroeker
3dd09e87bf
Update NOFORTRAN message for fallback to C_LAPACK
2022-04-27 20:26:45 +02:00
Han Gao
8123324c99
Fix riscv64 arch detect
...
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-04-27 02:29:43 +08:00
Han Gao
3fc52ebcfb
Fix other arch build in detect.
...
When CORE is empty, use -march=loongson3a. Fix it.
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-04-27 01:34:55 +08:00
Martin Kroeker
ee4244538c
Merge pull request #3612 from nsait-linaro/fix-windows-make-build
...
build: minor fixes to build on windows with make
2022-04-25 15:51:34 +02:00
Niyas Sait
3f5d145cd4
build: minor fixes to build on windows with make
...
This patch contains following fixes
1. Fix to build without PIC flag
2. Define LAPACK_COMPLEX_STRUCTURE for windows. Builds are failing
without it and changes are consistent with the CMake rules defined
in system.cmake (line 576)
2022-04-25 00:01:12 +01:00
martin-frbg
510c7222ef
Add dummy C files for 3.10.1 modules
2022-04-22 13:09:12 +02:00
martin-frbg
ee7f42217b
Add dummy C versions of ?roundup_lwork for C_LAPACK
2022-04-22 12:28:15 +02:00
martin-frbg
aea3d366f5
Add dummy C versions of the 3.10.1 ?laqz codes for C_LAPACK
2022-04-22 09:27:37 +02:00
martin-frbg
920199c577
fix module build and d/sroundup_lwork
2022-04-21 23:31:02 +02:00
martin-frbg
7c84a9f7be
update suffix rules
2022-04-21 22:43:29 +02:00
martin-frbg
329cdc31b3
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
2022-04-21 20:39:29 +02:00
Martin Kroeker
18b19d135b
C_LAPACK: Fixes to make it compile with MSVC ( #3605 )
...
* Fix f2c-like support functions to compile with MSVC, and
re-enable C_LAPACK for MSVC in CMAKE
* Add MSVC&flang build to Azure CI in order to check C_LAPACK correctness
2022-04-17 17:49:38 +02:00
Martin Kroeker
d09b9dd0cc
Merge pull request #3607 from martin-frbg/issue3603
...
Fix undefined PREFETCHSIZEs in PPC440 GEMV kernels
2022-04-16 12:54:35 +02:00
Martin Kroeker
05dcfa176e
fix undefined prefetchsizes
2022-04-16 10:04:27 +02:00
Martin Kroeker
2bbb9f05c7
fix undefined prefetchsize
2022-04-16 10:00:10 +02:00
Martin Kroeker
8144a498ad
Merge pull request #3604 from mmuetzel/ci
...
Adapt commands for tests with GNU make.
2022-04-11 19:31:26 +02:00
Markus Mützel
430bea00ba
Adapt commands for tests with GNU make.
2022-04-11 11:45:05 +02:00