OpenBLAS/kernel/arm64
Martin Kroeker 456ee2e1f0
Merge pull request #2357 from chenxuqiang/dgemm_beta_zero
kernel/arm64/dgemm_beta.S: add beta == zero branch
2020-01-02 22:28:36 +01:00
..
KERNEL declare DGEMM_BETA in KERNEL.ARMV8 rather than the generic KERNEL 2019-12-20 10:11:50 +08:00
KERNEL.ARMV8 update 2020-01-02 11:01:57 +08:00
KERNEL.CORTEXA53 Simplifying ARMv8 build parameters 2018-11-19 16:41:49 +00:00
KERNEL.CORTEXA57 ARM64: Enable DYNAMIC_ARCH 2018-10-22 01:49:35 -07:00
KERNEL.CORTEXA72 Simplifying ARMv8 build parameters 2018-11-19 16:41:49 +00:00
KERNEL.CORTEXA73 Simplifying ARMv8 build parameters 2018-11-19 16:41:49 +00:00
KERNEL.FALKOR Simplifying ARMv8 build parameters 2018-11-19 16:41:49 +00:00
KERNEL.THUNDERX ARM64: Enable DYNAMIC_ARCH 2018-10-22 01:49:35 -07:00
KERNEL.THUNDERX2T99 ARM64: Enable DYNAMIC_ARCH 2018-10-22 01:49:35 -07:00
KERNEL.TSV110 update 2020-01-02 11:01:57 +08:00
Makefile added experimental support for ARMV8 2013-11-24 15:47:00 +01:00
amax.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
asum.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
axpy.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
casum.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
casum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
cgemm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
cgemm_kernel_8x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
cgemm_kernel_8x4_thunderx2t99.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
copy.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
copy_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
csum.S Add ARM64 implementations of ?sum 2019-03-30 22:13:36 +01:00
ctrmm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
ctrmm_kernel_8x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dasum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
daxpy_thunderx.c ARM64: Rename kernel files to have consistent naming 2017-01-24 14:53:34 +05:30
daxpy_thunderx2t99.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
ddot_thunderx.c ARM64: Rename kernel files to have consistent naming 2017-01-24 14:53:34 +05:30
dgemm_beta.S kernel/arm64/dgemm_beta.S: add beta == zero branch 2020-01-01 21:50:45 -05:00
dgemm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dgemm_kernel_4x8.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dgemm_kernel_8x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dgemm_kernel_8x4_thunderx2t99.S ARM64: Move parameters from parameter.c to param.h 2018-10-22 01:45:51 -07:00
dgemm_ncopy_4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dgemm_ncopy_8.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dgemm_tcopy_4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dgemm_tcopy_8.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dot.S ARM64: Fix utest dsdot errors 2018-02-27 10:47:55 +00:00
dot_thunderx.c ARM64: Rename kernel files to have consistent naming 2017-01-24 14:53:34 +05:30
dot_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
dtrmm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dtrmm_kernel_4x8.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dtrmm_kernel_8x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
dznrm2_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
dznrm2_thunderx2t99_fast.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
gemv_n.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
gemv_t.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
iamax.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
iamax_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
izamax.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
izamax_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
nrm2.S Fix accidental duplication of jump instruction 2019-10-08 08:09:26 +02:00
rot.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
sasum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
scal.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
scnrm2_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
sgemm_beta.S Use arm neon instructions to optimize sgemm_beta operation 2019-12-31 10:42:03 +08:00
sgemm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
sgemm_kernel_8x8.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
sgemm_kernel_16x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
sgemm_kernel_16x4_thunderx2t99.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
sgemm_ncopy_4.S Use arm neon instructions to optimize ncopy operation 2019-12-31 17:06:35 +08:00
sgemm_tcopy_16.S [WIP] Use arm neon instructions to optimize tcopy operation 2019-12-31 10:21:23 +08:00
strmm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
strmm_kernel_8x8.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
strmm_kernel_16x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
sum.S Add ARM64 implementations of ?sum 2019-03-30 22:13:36 +01:00
swap.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
swap_thunderx2t99.S THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations 2017-02-03 03:55:06 -08:00
zamax.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zasum.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zasum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
zaxpy.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zdot.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zdot_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 2018-06-11 10:17:16 +01:00
zgemm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zgemm_kernel_4x4_thunderx2t99.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zgemv_n.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zgemv_t.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
znrm2.S Remove automatic label postfixes from macro included only once 2019-10-08 08:37:50 +02:00
zrot.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zscal.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00
zsum.S Add ARM64 implementations of ?sum 2019-03-30 22:13:36 +01:00
ztrmm_kernel_4x4.S ARM64: Convert all labels to local labels 2017-10-24 11:40:05 +00:00