Johannes Schönberger
ec4700087e
Wrap variables in cmake/prebuild in quotes to handle spaces in paths
2023-03-19 16:09:47 +01:00
Martin Kroeker
de937b3194
Add clang option to avoid running out of registers in AVX512 assembly
2023-03-17 21:22:37 +01:00
Martin Kroeker
50c263716e
Correct DGEMM_UNROLL_M value for A53 in cross-compile
2023-03-10 11:53:24 +01:00
Martin Kroeker
8272dfc552
Handle unrecognized ASM compiler (from Arm Compiler 22.1)
2023-03-03 00:21:59 +01:00
xctan
6a0de3aa39
Add missing RISC-V architecture in arch.cmake
...
RISC-V support exists in Makefile.system but is missing in arch.cmake. This patch adds riscv64 platform support to cmake building system just like 039e27545f/Makefile.system (L830-L832) did.
2023-02-26 20:21:57 +08:00
Martin Kroeker
fa5ff7d199
slarmm/dlarmm are needed by COMPLEX/COMPLEX16-only builds too
2023-02-22 00:25:12 +01:00
Martin Kroeker
19a696f8fe
fix nested conditionals
2023-02-02 19:59:49 +01:00
Martin Kroeker
e964ebd0d0
Add compiler option for AVX512-capable Ryzen(4)
2023-02-02 19:04:05 +01:00
Martin Kroeker
afff3f5510
Fix typo that could clear EXTRALIB settings
2023-02-02 10:43:44 +01:00
Martin Kroeker
5ddcc5ffa2
Update arch.cmake
2022-12-25 10:37:39 +01:00
Martin Kroeker
18cac5a34c
SVE-enabled ARM64 targets in DYNAMIC_ARCH require a recent compiler
2022-12-25 10:25:13 +01:00
Martin Kroeker
88cd91c490
Fix stray character
2022-11-19 23:15:20 +01:00
Martin Kroeker
5ff46f4092
Add ?GELST (Reference-LAPACK PR739)
2022-11-19 22:49:31 +01:00
Martin Kroeker
e4a31c0d23
add ?LANGB interface (Reference-LAPACK PR725)
2022-11-17 18:15:04 +01:00
Martin Kroeker
258a83a642
change line endings from CRLF to LF
2022-11-16 21:57:42 +01:00
Martin Kroeker
f16aa1ce7a
Merge pull request #3821 from martin-frbg/lapack651
...
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
2022-11-16 13:59:02 +01:00
Martin Kroeker
b6d74b7fff
Add f2c-converted files for the BLAS3-based Sylvester solver
2022-11-15 16:26:44 +01:00
Martin Kroeker
bb652f65a3
Typo fix
2022-11-14 16:35:13 +01:00
Martin Kroeker
fb42a0cf8b
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
2022-11-14 14:06:50 +01:00
Martin Kroeker
90d7451df5
Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742)
2022-11-13 15:10:00 +01:00
Martin Kroeker
be546ec1ad
Add gcc options for Neoverse cpus
2022-11-09 11:00:41 +01:00
Martin Kroeker
747ade5adf
fix INTERFACE64/USE64BITINT reporting
2022-10-18 17:28:07 +02:00
Martin Kroeker
32566bfb44
Disable the gfortran tree vectorizer for netlib LAPACK
2022-10-13 14:04:25 +02:00
Martin Kroeker
a0a4f7c447
Add -mfma to -mavx2 for clang, and add AVX2 declaration for Zen in DYNAMIC_ARCH builds
2022-09-13 22:47:00 +02:00
Martin Kroeker
ff58e9a7f1
Merge pull request #3609 from martin-frbg/lapack3101
...
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
2022-08-06 14:31:56 +02:00
Martin Kroeker
2bee490287
Merge pull request #3714 from martin-frbg/crosscmake
...
Add more x86_64 target definitions for CMAKE cross-compiling
2022-08-04 23:58:21 +02:00
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
...
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker
3784b3d45c
Add more x86_64 target definitions for cross-compiling
2022-08-04 19:18:32 +02:00
Nursultan Zarlyk
a7ac252fd9
Add TCORE Generic in prebuild.cmake
...
During the cross-compilation on x64 host with MSVC for ARMv8, the
build fails as there is no define directives for Generic core.
2022-08-02 10:50:58 +02:00
Nursultan Zarlyk
1bb7993a97
Fix MSVC ARM64 build. Add generic kernel for ARM64
2022-06-02 16:53:54 +02:00
Martin Kroeker
5cb0d23027
Support USE_PERL fallback for gensymbol
2022-05-22 18:36:24 +02:00
Owen Rafferty
42c7a27e6b
rewrite perl scripts in universal shell
2022-05-18 19:00:15 -05:00
martin-frbg
329cdc31b3
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
2022-04-21 20:39:29 +02:00
Martin Kroeker
18b19d135b
C_LAPACK: Fixes to make it compile with MSVC ( #3605 )
...
* Fix f2c-like support functions to compile with MSVC, and
re-enable C_LAPACK for MSVC in CMAKE
* Add MSVC&flang build to Azure CI in order to check C_LAPACK correctness
2022-04-17 17:49:38 +02:00
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
...
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
2022-04-09 22:38:58 +02:00
Martin Kroeker
ad4598143b
Merge pull request #3575 from mmuetzel/lapacke-win64
...
Fix LAPACKE with 64-bit indexing on Windows.
2022-03-29 20:04:04 +02:00
Martin Kroeker
abbc65cff4
Cortex X1 is only Arm8.2
2022-03-28 17:40:27 +02:00
Martin Kroeker
57dd92a662
Add initial support for ARMV9 Cortex 510/710/X1/X2
2022-03-27 15:26:42 +02:00
Markus Mützel
aeb561d234
Add support for Intel Fortran compilers.
...
Port changes from upstream Reference-LAPACK.
2022-03-25 13:37:15 +01:00
Markus Mützel
00f44bfff7
cmake: Check if Fortran compiler is usable before enabling it.
2022-01-21 13:27:17 +01:00
Martin Kroeker
a9e297e476
Fix handling of ifdef/ifndef
2022-01-09 23:31:59 +01:00
Martin Kroeker
b6b024232d
Merge pull request #3508 from snadampal/v1_n2
...
OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics
2022-01-09 14:50:26 +01:00
Sunita Nadampalli
19c8f615dc
OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics
2022-01-07 00:28:17 +00:00
jgillis
ea3db69faa
Fix cmake crosscompilation for core2 target
...
Missing HAVE_SSE* cmake variables cause cc.cmake to forget about `-msse*` flags
2021-12-29 22:50:20 +01:00
Rafael Cardoso Fernandes Sousa
d38110a5ce
Use CMake variables instead of as
2021-12-10 17:46:53 -06:00
Rafael Cardoso Fernandes Sousa
214fbcee15
Fix cmake for power
2021-12-09 08:28:17 -06:00
Martin Kroeker
454edd741c
Merge pull request #3425 from binebrank/arm_sve_dgemm
...
Add dgemm kernel for arm64 SVE
2021-11-26 16:14:55 +01:00
Martin Kroeker
bcfbdc81b2
Merge pull request #3459 from rafaelcfsousa/fix_cmake
...
Fix issues when building OpenBLAS with cmake
2021-11-26 15:19:24 +01:00
Bine Brank
1af73ce38e
Adapt CMake for SVE
2021-11-26 10:35:01 +01:00
Rafael Cardoso Fernandes Sousa
d5c9353f1b
Modify the order that cmake set the KERNEL variables (generic now is fallback)
2021-11-24 20:08:35 -06:00
Rafael Cardoso Fernandes Sousa
fb891f33da
Fix the cmake parser to identify more patterns
2021-11-24 14:07:28 -06:00
Martin Kroeker
a3cd36acff
Add CMAKE support for cross-compiling to MIPS32
2021-11-20 17:34:28 +01:00
Markus Mützel
de2ed66596
cmake: Set SUFFIX64 also for NOFORTRAN
2021-11-15 08:53:52 +01:00
Martin Kroeker
02ea3db8e7
Merge pull request #3404 from guowangy/spr-build
...
Initial build support for Sapphire Rapids
2021-10-17 23:05:11 +02:00
مهدي شينون (Mehdi Chinoune)
efd7ac241d
Fix MinGW/Clang 64 bits detection.
...
CMAKE_COMPILER_IS_GNUCC is only valid for GCC.
2021-10-16 08:02:27 +01:00
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
2021-10-12 01:30:40 -07:00
Martin Kroeker
e02df9fc55
Propagate BUILD_BFLOAT16 to CFLAGS
2021-09-14 16:12:27 +02:00
Martin Kroeker
1c0a8a714a
Add defaults for SBGEMV kernels
2021-09-14 16:10:58 +02:00
Martin Kroeker
af19cda65a
Add "recursive" option for IBM xlf compiler ( #3359 )
...
* Add correct "recursive" option for xlf (from reference-lapack issue 606)
2021-09-04 18:26:59 +02:00
Martin Kroeker
bec9d9f63d
Merge pull request #3335 from guowangy/small-matrix-latest
...
Add GEMM optimization for small matrix and single/double kernel for skylakex
2021-08-29 22:33:33 +02:00
cianciosa
4c766cd11f
Fix a small syntax error. A ( was accidently deleted.
2021-08-11 12:08:34 -04:00
cianciosa
c28560129f
Check the total number of arguments passed insead of if the ARGV# is defined. This fixes a problem when compling openblas as a subproject of another code.
2021-08-11 12:00:07 -04:00
Wangyang Guo
76ea8db4da
Small Matrix: enable by default for x86_64 arch
...
If no customized GEMM_SMALL_M_PERMIT kernel defined, it will just by pass to normal path.
2021-08-05 02:59:36 +00:00
Wangyang Guo
fee5abd84b
Small Matrix: support cmake build
2021-08-04 08:50:15 +00:00
gxw
0b8f7c8c10
Add cmake support for LOONGARCH64
2021-08-02 10:00:41 +08:00
Martin Kroeker
47ba85f314
Fix regex to match kernels suffixed with cpuname too
2021-07-22 17:24:15 +02:00
Martin Kroeker
30f23be0f9
Rework setting of -mfma to only apply it where necessary
2021-07-22 12:00:03 +02:00
User User-User
91e2b11d3c
add to cmake listings too
2021-06-20 15:32:42 +02:00
Martin Kroeker
13fa9f737d
Modify defines for CR and RC to work around name collision on Windows
2021-06-16 12:17:25 +02:00
Martin Kroeker
db50b24a4a
Add entries for the new Householder Reconstruction functions from 3.9.1
2021-05-02 19:55:15 +02:00
Martin Kroeker
40000d1f64
Add entries for Householder reconstruction functions from 3.9.1
2021-05-02 19:21:59 +02:00
刘雨培
725432efaa
pass NO_AVX512 macro def
2021-04-07 00:10:41 +08:00
Jake Arkinstall
d7a77091a3
Addressed issue #3100 , removing an unnecessary write to the include directory
2021-02-10 12:11:17 +00:00
Martin Kroeker
33b5670122
Merge pull request #3096 from martin-frbg/fixclangcmake
...
Fix Cooperlake/DYNAMIC_ARCH builds with clang on Windows
2021-02-02 13:33:15 +01:00
Martin Kroeker
95e19e2e23
fix case in compiler name check
...
Co-authored-by: xoviat <49173759+xoviat@users.noreply.github.com>
2021-02-02 10:53:46 +01:00
Martin Kroeker
99ac042702
remove spurious lines (probably editor malfunction)
2021-02-01 21:02:53 +01:00
Martin Kroeker
774b9f8653
handle AppleClang in Cooperlake support condition
2021-02-01 20:18:53 +01:00
Martin Kroeker
eb1d2344f7
Fix compiler version check for Intel Cooperlake support (clang-cl does not accept -dumpversion)
2021-02-01 19:45:25 +01:00
Martin Kroeker
0cc36770f1
Merge pull request #3073 from xoviat/embedded
...
add embedded option
2021-01-31 18:02:41 +01:00
Martin Kroeker
cb61d3b46b
Add DYNAMIC_LIST support for ARM64
2021-01-25 13:13:20 +01:00
xoviat
b60de4447a
add cortex-m platform
2021-01-19 08:57:44 -06:00
Martin Kroeker
89ae305e11
Workaround for cmake having its own C_COMPILER variable
2021-01-13 12:30:26 +01:00
Martin Kroeker
ec4d77c47c
Add -mfma for HAVE_FMA3 in the non-DYNAMIC_ARCH case as well
2020-11-13 09:16:34 +01:00
Martin Kroeker
a29338aaa6
Remove extraneous quotes that caused a cmake policy warning
2020-11-07 20:27:42 +01:00
Martin Kroeker
438a8e5624
Fix placement of getarch call and spurious cpu property accumulation in DYNAMIC_ARCH builds
2020-11-07 20:26:12 +01:00
Martin Kroeker
0155cd53a3
Add -msse3 where needed for DYNAMIC_ARCH builds
2020-11-03 23:45:49 +01:00
Martin Kroeker
a9f9354296
Fix target test
2020-11-02 23:17:46 +01:00
Martin Kroeker
b9bc76aec4
Add files via upload
2020-11-02 22:43:50 +01:00
Martin Kroeker
e5f8c2bf8a
typo fix
2020-11-01 22:25:43 +01:00
Martin Kroeker
6baf8af658
Disable EXPRECISION for the combination of DYNAMIC_CORE and GENERIC target
2020-11-01 22:11:48 +01:00
Chen, Guobing
a7b1f9b1bb
Implementation of BF16 based gemv
...
1. Add a new API -- sbgemv to support bfloat16 based gemv
2. Implement a generic kernel for sbgemv
3. Implement an avx512-bf16 based kernel for sbgemv
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
2020-10-29 02:08:23 +08:00
Martin Kroeker
eddc65c7b7
Add POWER10 support flag (unconditionally for now)
2020-10-20 01:09:49 +02:00
Martin Kroeker
f5902ab0a1
Support cross-compiling for Apple Vortex
2020-10-18 19:10:58 +02:00
Martin Kroeker
f64243ff57
Add compiler options for sse/sse2/ssse3/sse4.1
2020-10-16 10:47:06 +02:00
Martin Kroeker
786c0a3ce8
Add sse options for use of intrinics with older compilers
2020-10-16 10:41:53 +02:00
Martin Kroeker
756802df61
Merge pull request #2890 from martin-frbg/s-d-sum
...
Revert special handling of Windows xNRM2 and enable C+intrinsics kern…
2020-10-14 09:02:03 +02:00
Martin Kroeker
75e3a92df6
Add express -mavx and -msse options (and fix a stray = for cooperlake)
2020-10-14 01:01:58 +02:00
Martin Kroeker
e3a29f6b58
Change "HALF" and "sh" to "BFLOAT16" and "sb"
2020-10-12 00:07:37 +02:00
Martin Kroeker
68e6823d36
Adapt for supporting only a subset of variable types
2020-10-11 15:01:32 +02:00
Martin Kroeker
88928650c4
Merge pull request #2883 from martin-frbg/issue2872
...
Minor CMAKE fixes
2020-10-11 10:30:33 +02:00