OpenBLAS/kernel
Martin Kroeker 2efa3b70dc
Add workaround for NVIDIA HPC
2021-01-12 16:49:39 +01:00
..
alpha Add implementations of ssum/dsum and csum/zsum 2019-03-30 22:05:11 +01:00
arm Support NVIDIA HPC compiler 2021-01-12 16:36:12 +01:00
arm64 Add workaround for NVIDIA HPC 2021-01-12 16:49:39 +01:00
generic Add the support for RISC-V Vector. 2020-10-15 16:09:02 +08:00
ia64 Add ia64 implementation of ?sum 2019-03-30 22:18:03 +01:00
mips Add msa support for loongson 2020-12-09 10:28:46 +08:00
mips64 Add msa support for loongson 2020-12-09 10:28:46 +08:00
power Optimize swap function for POWER10 2021-01-08 08:01:36 -06:00
riscv64 Refs #2899 2020-11-10 09:38:04 +08:00
simd fix the CI failure of lack the head 2020-11-12 17:35:17 +08:00
sparc Work around DOT and SWAP test failures 2020-12-06 19:15:37 +01:00
x86 Enable COOPERLAKE build target 2020-08-13 06:18:00 +08:00
x86_64 Disable FMA intrinsics in the srot kernel when the compiler is PGI/NVIDIA 2020-12-19 22:15:58 +01:00
zarch s390x: fix cscal and zscal implementations 2020-09-21 13:10:05 +02:00
CMakeLists.txt Change "HALF" and "sh" to "BFLOAT16" and "sb" 2020-10-12 00:00:55 +02:00
Makefile Amend SkylakeX options to support the NVIDIA compiler 2020-12-19 22:11:49 +01:00
Makefile.L1 Conditionally add -mfma to compiler options where needed 2020-12-17 11:34:05 +01:00
Makefile.L2 Implementation of BF16 based gemv 2020-10-29 02:08:23 +08:00
Makefile.L3 Add msa support for loongson 2020-12-09 10:28:46 +08:00
Makefile.LA Support NO_LAPACK=1 to build the lib without LAPACK functions. 2011-03-04 11:51:32 +08:00
setparam-ref.c Add msa support for loongson 2020-12-09 10:28:46 +08:00