OpenBLAS/kernel
Martin Kroeker 8e4c209002
Merge pull request #3398 from kavanabhat/aix_p10_gnuas
Big Endian Changes for Power10 kernels
2021-10-05 18:59:47 +02:00
..
alpha Add implementations of ssum/dsum and csum/zsum 2019-03-30 22:05:11 +01:00
arm Typo fix 2021-02-23 13:14:35 +01:00
arm64 Move alphaI to x22 to leave x18 unused (reserved on OSX) 2021-09-17 09:53:18 +02:00
generic Move a conditionally used variable 2021-09-11 14:37:44 +02:00
ia64 Add ia64 implementation of ?sum 2019-03-30 22:18:03 +01:00
loongarch64 Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
mips Add msa support for loongson 2020-12-09 10:28:46 +08:00
mips64 Add msa support for loongson 2020-12-09 10:28:46 +08:00
power AIX changes for P10 with GNU Compiler 2021-10-01 05:18:35 -05:00
riscv64 riscv64: Add Makefile 2021-06-07 22:55:56 +00:00
simd fix the CI failure of lack the head 2020-11-12 17:35:17 +08:00
sparc Work around DOT and SWAP test failures 2020-12-06 19:15:37 +01:00
x86 Enable COOPERLAKE build target 2020-08-13 06:18:00 +08:00
x86_64 x86_64: BFLOAT16: fix build warning 2021-09-28 18:30:06 +08:00
zarch s390x: fix cscal and zscal implementations 2020-09-21 13:10:05 +02:00
CMakeLists.txt Add dedicated entries for BFLOAT16 kernels 2021-09-14 16:17:18 +02:00
Makefile Amend SkylakeX options to support the NVIDIA compiler 2020-12-19 22:11:49 +01:00
Makefile.L1 Conditionally add -mfma to compiler options where needed 2020-12-17 11:34:05 +01:00
Makefile.L2 Empirical workaround for numpy SVD NaN problem from issue 3318 2021-07-18 22:19:19 +02:00
Makefile.L3 Add workaround for Windows10 macro name clash 2021-09-01 21:36:50 +02:00
Makefile.LA Support NO_LAPACK=1 to build the lib without LAPACK functions. 2011-03-04 11:51:32 +08:00
setparam-ref.c Small Matrix: support BFLOAT16 data type 2021-08-30 17:40:20 +08:00