OpenBLAS/kernel
Martin Kroeker b0a590f4fe
Merge pull request #3475 from wjc404/optimize-A53-dgemm
optimize cgemm on ARM cortex A53 & cortex A55
2021-12-12 19:09:08 +01:00
..
alpha Add implementations of ssum/dsum and csum/zsum 2019-03-30 22:05:11 +01:00
arm Typo fix 2021-02-23 13:14:35 +01:00
arm64 optimize cgemm on ARM cortex A53 & cortex A55 2021-12-12 17:22:52 +08:00
generic Move a conditionally used variable 2021-09-11 14:37:44 +02:00
ia64 Add ia64 implementation of ?sum 2019-03-30 22:18:03 +01:00
loongarch64 Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
mips Add generic mips32 target 2021-11-20 17:31:51 +01:00
mips64 Add msa support for loongson 2020-12-09 10:28:46 +08:00
power [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
riscv64 riscv64: Add Makefile 2021-06-07 22:55:56 +00:00
simd fix the CI failure of lack the head 2020-11-12 17:35:17 +08:00
sparc Work around DOT and SWAP test failures 2020-12-06 19:15:37 +01:00
x86 initial support for Sapphire Rapids platform 2021-10-12 01:30:40 -07:00
x86_64 roll back DGEMM kernels to 4x8 when compiling for DYNAMIC_ARCH 2021-12-06 19:43:54 +01:00
zarch s390x: fix cscal and zscal implementations 2020-09-21 13:10:05 +02:00
CMakeLists.txt Fix error cmake (small kernels) 2021-12-09 09:57:39 -06:00
Makefile initial support for Sapphire Rapids platform 2021-10-12 01:30:40 -07:00
Makefile.L1 Conditionally add -mfma to compiler options where needed 2020-12-17 11:34:05 +01:00
Makefile.L2 Empirical workaround for numpy SVD NaN problem from issue 3318 2021-07-18 22:19:19 +02:00
Makefile.L3 adjust Makefile.L3 for SVE 2021-12-11 16:35:08 +01:00
Makefile.LA Support NO_LAPACK=1 to build the lib without LAPACK functions. 2011-03-04 11:51:32 +08:00
setparam-ref.c initial support for Sapphire Rapids platform 2021-10-12 01:30:40 -07:00