Commit Graph

7452 Commits

Author SHA1 Message Date
Martin Kroeker 4dbdc28c25
Merge pull request #3590 from mmuetzel/ci-msys2
Add action for MSYS2 builds.
2022-04-03 19:53:38 +02:00
Markus Mützel 67714aea71 Add action for MSYS2 builds. 2022-03-31 11:07:18 +02:00
Martin Kroeker ad4598143b
Merge pull request #3575 from mmuetzel/lapacke-win64
Fix LAPACKE with 64-bit indexing on Windows.
2022-03-29 20:04:04 +02:00
Martin Kroeker d5ed695632
Merge pull request #3588 from martin-frbg/fix3586
Fix mistaken declaration of CortexX1 as ArmV9 in PR#3586
2022-03-29 19:36:27 +02:00
Martin Kroeker 1d4e703f50
Merge pull request #3589 from e4t/Exclude_paramter.c_symbols_with_DYNAMIC_ARCH
Do not include symbols defined in driver/others/parameter.c in DYNAMI…
2022-03-29 19:35:56 +02:00
Egbert Eich 5e6d160020 Do not include symbols defined in driver/others/parameter.c in DYNAMIC_ARCH
driver/others/parameter.c does not get build during DYNAMIC_ARCH, thus,
do not declare its symbols. This will make the build fail early and in
an obvious way if functions are trying to use these symbols.

Signed-off-by: Egbert Eich <eich@suse.com>
2022-03-29 10:01:28 +02:00
Martin Kroeker a55a06c269
Update param.h 2022-03-28 18:10:08 +02:00
Martin Kroeker abbc65cff4
Cortex X1 is only Arm8.2 2022-03-28 17:40:27 +02:00
Martin Kroeker d93cf7f23c
fix defines for CORTEX-X 2022-03-28 17:37:06 +02:00
Martin Kroeker 48e421934f
CortexX1 is only ArmV8 2022-03-28 17:31:26 +02:00
Martin Kroeker 115bc9b98f
CortexX1 is ARMV8 like A7x 2022-03-28 17:28:29 +02:00
Martin Kroeker 848722926c
CortexX1 is only ARMV8 2022-03-28 17:18:56 +02:00
Martin Kroeker 8002048217
Merge pull request #3587 from e4t/fix_avx512
Use CC and full command line instead of hard-coding gcc for AVX512 ch…
2022-03-28 14:58:32 +02:00
Egbert Eich 0b69fa6ddf Use CC and full command line instead of hard-coding gcc for AVX512 checking
Hard-coding gcc may not provide incorrect results when a different compiler
for the target build is used. To remain in sync with the main call to c_check,
pass the full command line.

Signed-off-by: Egbert Eich <eich@suse.com>
2022-03-28 08:22:07 +02:00
Martin Kroeker 5d2f451d09
Merge pull request #3586 from martin-frbg/arm64cpus
Initial support for M1 on Linux, Phytium FT2000 series, ARMV9 Cortex X1,X2,A510,A710
2022-03-27 18:12:21 +02:00
Martin Kroeker b3b4672c30
Add initial support for Phytium FT2000 series and ARMV9 Cortex 510/710/X1/X2 2022-03-27 15:29:20 +02:00
Martin Kroeker 57dd92a662
Add initial support for ARMV9 Cortex 510/710/X1/X2 2022-03-27 15:26:42 +02:00
Martin Kroeker 09b8545fc5
Add initial support for M1 on Linux, Phytium FT2xxx series, ARM Cortex 510/710/X1/X2 2022-03-27 15:24:40 +02:00
Martin Kroeker a3d2514b9a
Merge pull request #3585 from martin-frbg/issue3581
Revert accidental change of generic ARMV8 DGEMM parameters from #3425
2022-03-27 15:19:26 +02:00
Martin Kroeker 8d0f7f0176
Revert accidental change of generic ARMV8 DGEMM parameters from #3425 2022-03-27 13:10:47 +02:00
Martin Kroeker 153f8bc8da
Merge pull request #3584 from martin-frbg/ctestskip
Add a (CMAKE) option to skip the LAPACK testsuite and use it in Azure CI
2022-03-25 14:35:15 +01:00
Markus Mützel aeb561d234 Add support for Intel Fortran compilers.
Port changes from upstream Reference-LAPACK.
2022-03-25 13:37:15 +01:00
Martin Kroeker 6c3842a891
Disable the LAPACK testsuite for the Windows clang/flang build as it takes too long 2022-03-24 21:25:16 +01:00
Martin Kroeker 4199ca728e
Add LAPACK-like option to omit the LAPACK testsuite 2022-03-24 21:23:28 +01:00
Larson, Eric 8fe3555792 ILP support
long's in windows are 4 bytes (MSVS, intel compilers). Use int64_t and int32_t
to ensure 8 byte integers for ILP interface.

support 8 byte integer flag for intel ifort compiler
2022-03-24 19:09:23 +01:00
Aisha Tammy 3efbf968f1 create INDEX64 target 2022-03-24 19:09:23 +01:00
Martin Kroeker 34ecd967a5
Merge pull request #3580 from martin-frbg/dynx86_sbgemm
Remove extraneous (and wrong) definition of sbgemm_r on x86_64
2022-03-24 11:33:00 +01:00
Martin Kroeker 2519c9d93f
Merge pull request #3579 from martin-frbg/issue3557-2
Fix malfunctioning AVX512 check
2022-03-24 08:28:37 +01:00
Martin Kroeker 40302558ed
Remove extraneous (and wrong) definition of sbgemm_r on x86_64 2022-03-23 20:05:32 +01:00
Martin Kroeker b79b99d695
Merge branch 'xianyi:develop' into issue3557-2 2022-03-23 19:13:54 +01:00
Martin Kroeker c87a4dbf35
Fix checks for AVX512 and atomics 2022-03-23 15:48:58 +01:00
Martin Kroeker 93a81856ae
Revert AVX512 capability check from PR #1980 (moved to build) 2022-03-23 15:22:13 +01:00
Martin Kroeker 9fbeb88fb8
Utilize compiler AVX512 capability info from c_check when building getarch 2022-03-23 15:19:55 +01:00
Martin Kroeker 4cb302a596
Merge pull request #3561 from AlessioZanga/patch-msvc
Remove MSVC limitation
2022-03-23 11:28:13 +01:00
Martin Kroeker f67977a323
Merge pull request #3576 from martin-frbg/cmaketestbom
Skip BLAS tests if Windows powershell added a BOM
2022-03-23 07:19:15 +01:00
Martin Kroeker 0ee2d15fdb
Merge pull request #3577 from martin-frbg/azure_win2022
Update Windows jobs in Azure CI to use Windows2022
2022-03-23 07:18:45 +01:00
Martin Kroeker a0e86adf93
Update Windows jobs in Azure CI to use Windows2022 2022-03-22 21:51:09 +01:00
Martin Kroeker 2408315d10
Skip tests if Windows powershell added a BOM 2022-03-22 21:37:55 +01:00
Martin Kroeker 694f6c5c8d
Merge pull request #3574 from AdamNiederer/fix-dynamic-list-compilation
Fix broken elif in dynamic.c
2022-03-19 09:21:56 +01:00
Adam Niederer 69f2ac4ea2 Fix broken elif in dynamic.c
This fixes compilation in the following case:

$(MAKE) USE_OPENMP=1 USE_THREAD=1 NO_LAPACK=0 DYNAMIC_ARCH=1 \
DYNAMIC_LIST="HASWELL SKYLAKEX ATOM COOPERLAKE SAPPHIRERAPIDS ZEN"
2022-03-17 20:04:37 -04:00
Martin Kroeker 501bf31e3e
Merge pull request #3567 from cenewcombe/develop
Fix unsafe read of Y in zsymv_L_sse2.S
2022-03-12 13:40:17 +01:00
Caroline Newcombe 5cc1111383 fix unsafe read of Y in assembly kernel 2022-03-11 11:56:33 -06:00
Martin Kroeker 8d5a9c2f98
Merge pull request #3565 from jonaszhou1/develop
Support Zhaoxin/Centaur kh40000 as ZEN
2022-03-11 14:29:30 +01:00
Martin Kroeker 9dcd8aeb7a
Merge pull request #3566 from martin-frbg/configtls
Report USE_TLS in get_config output if set
2022-03-11 14:27:27 +01:00
Martin Kroeker bf4642eb7e
Report USE_TLS if set 2022-03-10 16:19:29 +01:00
JonasZhou 2d0ad89b0d Support Zhaoxin/Centaur kh40000 as ZEN
Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
2022-03-10 15:08:38 +08:00
AlessioZanga ed2871cb71
Change `BUILD_WITHOUT_LAPACK` to `OFF` by default 2022-03-05 23:35:29 +01:00
Alessio Zanga ed8c028f7f
Remove MSVC limitation 2022-03-05 14:06:21 +01:00
Martin Kroeker 1ef97c470c
Merge pull request #3550 from guowangy/smatrix-mask-fix
Small Matrix: use proper inline asm input constraint for AVX512 mask
2022-02-28 08:28:02 +01:00
Xianyi Zhang 45786b05da Merge branch 'develop' into risc-v 2022-02-28 11:48:02 +08:00