OpenBLAS/kernel
Martin Kroeker 93d975d8fd
Merge pull request #4593 from XiWeiGu/loongarch_add_buffer_offset
loongarch: Optimizing the performance of the GEMM on servers
2024-04-10 14:23:31 +02:00
..
alpha alpha: Remove include of version.h 2022-08-11 15:02:58 +01:00
arm fix loop condition for incx < 0 2024-03-12 15:46:23 +01:00
arm64 Add support for Cortex-A76 2024-04-02 19:41:44 +02:00
csky Add CSKY support 2024-01-16 23:45:06 +08:00
e2k Add default KERNEL file for Elbrus E2K arch 2022-01-22 18:59:36 +01:00
generic Optimized cgemm kernel 16x4 LASX for LoongArch 2024-02-21 15:28:47 +08:00
ia64 Add ia64 implementation of ?sum 2019-03-30 22:18:03 +01:00
loongarch64 loongarch: Fixed dzamax 2024-03-25 23:17:53 -04:00
mips fix loop condition for incx < 0 2024-03-12 15:48:00 +01:00
mips64 Merge pull request #4185 from XiWeiGu/mips_enable_msa 2024-02-05 15:50:16 +01:00
power Revert "Merge pull request #4532 from austinpagan/cgemm_zgemm_c_code" 2024-03-01 07:57:39 -06:00
riscv64 Update nrm2_rvv.c 2024-03-13 13:07:26 +01:00
simd fix the CI failure of lack the head 2020-11-12 17:35:17 +08:00
sparc Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:58:57 +02:00
x86 Handle NAN 2024-01-08 16:11:25 +01:00
x86_64 Add CSUM and ZSUM kernels (trivially derived from their existing ASUM counterparts) 2024-02-24 23:57:50 +01:00
zarch Fix erroneous mapping of SUM kernels to ASUM 2024-02-27 11:28:50 +01:00
CMakeLists.txt Adding USE_GEMM3M macro to kernel targets, so that the *gemm3m functions and parameters can be included into the gotoblas structure. Fixes #4500 2024-02-12 02:29:58 +01:00
Makefile powerpc: Fix build errors with Open XL C 2023-10-04 14:04:03 -05:00
Makefile.L1 Conditionally add -mfma to compiler options where needed 2020-12-17 11:34:05 +01:00
Makefile.L2 make SSYMV available to BUILD_DOUBLE-only builds 2023-02-22 00:30:20 +01:00
Makefile.L3 (Re)apply fixes for supporting only a subset of precision types from PR 3915 2023-11-04 23:48:59 +01:00
Makefile.LA Support NO_LAPACK=1 to build the lib without LAPACK functions. 2011-03-04 11:51:32 +08:00
setparam-ref.c Merge pull request #4593 from XiWeiGu/loongarch_add_buffer_offset 2024-04-10 14:23:31 +02:00