Commit Graph

384 Commits

Author SHA1 Message Date
Martin Kroeker
49689fbef7 Add support for compiling SVE kernels with the NVIDIA HPC compiler 2023-08-25 17:11:04 +02:00
Martin Kroeker
562ef5fdca Merge pull request #4169 from felixonmars/patch-1
Use defined variable for riscv64 in arch.cmake
2023-08-12 17:20:56 +02:00
Martin Kroeker
0e5d56ae4a Merge pull request #4170 from felixonmars/patch-2
Fix 64-bit fortran options for riscv64
2023-08-12 09:21:05 +02:00
Markus Mützel
57256623f4 fc.cmake: Add support for LLVM Flang. 2023-08-05 13:16:06 +02:00
Felix Yan
f5506b002c Add 64-bit flag on INTERFACE64 only 2023-07-28 16:19:14 +03:00
Felix Yan
4ed6414c17 Fix 64-bit fortran options for riscv64
64-bit builds are currently broken without this flag.

Makefiles have done this already: 5720fa02c5/Makefile.system (L831)
2023-07-28 04:53:27 +03:00
Felix Yan
007cd834c1 Use defined variable for riscv64 in arch.cmake
It's defined in #4137
2023-07-28 04:50:16 +03:00
Chris Sidebottom
f971ef55f2 Add ARMV8SVE to AArch64 Dynamic Dispatch
In order to enable support for future cores which have similar tunings
(in this case I'm doing this for the Arm(R) Neoverse(TM) V2 core), this generically detects SVE support and enables it. This should better manage the size and complexity of dynamic dispatch rather than just copy pasting the same parameters.

To make `ARMV8SVE` more representive of the common 128-bit SVE case,
I've split it and similar parameters from A64FX which has the wider
512-bit SVE.
2023-07-25 18:35:15 +01:00
Martin Kroeker
b61e64da6f Merge pull request #4142 from exyntech/armv8-as-arm64
Fix armv8 detection in system_check.cmake
2023-07-15 23:15:49 +02:00
Martin Kroeker
f82a197143 Merge pull request #4137 from felixonmars/patch-1
Fix riscv64 detection in system_check.cmake
2023-07-15 19:41:06 +02:00
Andy Mroczkowski
45b2cd2fb2 treat armv8 CMAKE_SYSTEM_PROCESSOR as arm64
The cmake scripts incorrectly treated armv8 as 32-bit arm, causing
compilation issues. This just adds 'armv8' to the arm64 condition check.
2023-07-12 09:37:45 -04:00
Martin Kroeker
35dedb68ce Add C versions of C/ZRSCL 2023-07-11 17:07:30 +02:00
Felix Yan
a721fccfdc Fix riscv64 detection in system_check.cmake 2023-07-11 16:34:20 +03:00
Martin Kroeker
22ad23abb1 Merge pull request #4126 from martin-frbg/lapack839
Add C/ZRSCL for reciprocal scaling of a complex vector (Reference-LAPACK PR 839)
2023-07-10 01:49:33 +02:00
Martin Kroeker
e3277e134c Add CRSCL/ZRSCL (Reference-LAPACK PR 839) 2023-07-05 14:22:59 +02:00
Martin Kroeker
2f9f70584e Fix target list syntax for ARM64 DYNAMIC_ARCH 2023-07-05 09:48:54 +02:00
Martin Kroeker
ac698cedad Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds 2023-07-05 09:47:49 +02:00
Martin Kroeker
fa6d06359a correct list placement of zgedmd/zgedmdq 2023-06-21 17:17:31 +02:00
Martin Kroeker
2bfd7ff126 Add dummy C sources for ?GEDMD 2023-06-20 21:39:29 +02:00
Martin Kroeker
defafd1353 Add functions for Dynamic Mode Decomposition (Reference-LAPACK PR 736) 2023-06-20 15:07:53 +02:00
Martin Kroeker
83d6ce1289 Add interfaces for Dynamic Mode Decomposition (Reference-LAPACK PR 736) 2023-06-20 14:55:26 +02:00
Martin Kroeker
b6a28adeaf Add ?TRSYL3 (Reference-LAPACK PR 651) 2023-06-20 11:33:03 +02:00
Martin Kroeker
c3a2d407a0 Merge pull request #4048 from imzhuhl/spr_sbgemm_fix
Sapphire Rapids sbgemm fix
2023-06-17 20:47:09 +02:00
Martin Kroeker
d2144b2981 Add NVHPC 2023-06-09 19:01:15 +02:00
Martin Kroeker
f8c8b22740 Add NVHPC(pgf95) 2023-06-09 18:58:52 +02:00
Martin Kroeker
2993a73262 Add NVHPC(nvc) 2023-06-09 18:56:13 +02:00
Martin Kroeker
f981e37dda Treat f95 as gfortran 2023-06-05 20:34:31 +02:00
Honglin Zhu
d96d386d13 Add CMake dynamic build for Sapphire Rapids 2023-05-20 14:13:49 +08:00
Martin Kroeker
a789211a2e Add interfaces for [cz]unhr_col and [sd]orhr_col (Reference-LAPACK PR 827) 2023-05-18 23:56:27 +02:00
Martin Kroeker
57bdc36c84 add conditionals for BUILD_LAPACK_DEPRECATED 2023-04-11 22:38:38 +02:00
Martin Kroeker
1659750861 Add ARMV5 to CMAKE prebuild.cmake for cross-compiles (#3959)
* Add ARMV5
2023-03-26 00:22:06 +01:00
Pieter Pas
a79ae116e2 Add ARMV6 to cmake/prebuild.cmake,
based on cpuid_arm.c and param.h
2023-03-25 16:30:42 +01:00
Johannes Schönberger
fbfe474583 f 2023-03-19 16:23:44 +01:00
Johannes Schönberger
ec4700087e Wrap variables in cmake/prebuild in quotes to handle spaces in paths 2023-03-19 16:09:47 +01:00
Martin Kroeker
de937b3194 Add clang option to avoid running out of registers in AVX512 assembly 2023-03-17 21:22:37 +01:00
Martin Kroeker
50c263716e Correct DGEMM_UNROLL_M value for A53 in cross-compile 2023-03-10 11:53:24 +01:00
Martin Kroeker
8272dfc552 Handle unrecognized ASM compiler (from Arm Compiler 22.1) 2023-03-03 00:21:59 +01:00
xctan
6a0de3aa39 Add missing RISC-V architecture in arch.cmake
RISC-V support exists in Makefile.system but is missing in arch.cmake. This patch adds riscv64 platform support to cmake building system just like 039e27545f/Makefile.system (L830-L832) did.
2023-02-26 20:21:57 +08:00
Martin Kroeker
fa5ff7d199 slarmm/dlarmm are needed by COMPLEX/COMPLEX16-only builds too 2023-02-22 00:25:12 +01:00
Martin Kroeker
19a696f8fe fix nested conditionals 2023-02-02 19:59:49 +01:00
Martin Kroeker
e964ebd0d0 Add compiler option for AVX512-capable Ryzen(4) 2023-02-02 19:04:05 +01:00
Martin Kroeker
afff3f5510 Fix typo that could clear EXTRALIB settings 2023-02-02 10:43:44 +01:00
Martin Kroeker
5ddcc5ffa2 Update arch.cmake 2022-12-25 10:37:39 +01:00
Martin Kroeker
18cac5a34c SVE-enabled ARM64 targets in DYNAMIC_ARCH require a recent compiler 2022-12-25 10:25:13 +01:00
Martin Kroeker
88cd91c490 Fix stray character 2022-11-19 23:15:20 +01:00
Martin Kroeker
5ff46f4092 Add ?GELST (Reference-LAPACK PR739) 2022-11-19 22:49:31 +01:00
Martin Kroeker
e4a31c0d23 add ?LANGB interface (Reference-LAPACK PR725) 2022-11-17 18:15:04 +01:00
Martin Kroeker
258a83a642 change line endings from CRLF to LF 2022-11-16 21:57:42 +01:00
Martin Kroeker
f16aa1ce7a Merge pull request #3821 from martin-frbg/lapack651
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
2022-11-16 13:59:02 +01:00
Martin Kroeker
b6d74b7fff Add f2c-converted files for the BLAS3-based Sylvester solver 2022-11-15 16:26:44 +01:00