.. |
KERNEL
|
declare DGEMM_BETA in KERNEL.ARMV8 rather than the generic KERNEL
|
2019-12-20 10:11:50 +08:00 |
KERNEL.ARMV8
|
update
|
2020-01-02 11:01:57 +08:00 |
KERNEL.CORTEXA53
|
Simplifying ARMv8 build parameters
|
2018-11-19 16:41:49 +00:00 |
KERNEL.CORTEXA57
|
ARM64: Enable DYNAMIC_ARCH
|
2018-10-22 01:49:35 -07:00 |
KERNEL.CORTEXA72
|
Simplifying ARMv8 build parameters
|
2018-11-19 16:41:49 +00:00 |
KERNEL.CORTEXA73
|
Simplifying ARMv8 build parameters
|
2018-11-19 16:41:49 +00:00 |
KERNEL.FALKOR
|
Simplifying ARMv8 build parameters
|
2018-11-19 16:41:49 +00:00 |
KERNEL.THUNDERX
|
ARM64: Enable DYNAMIC_ARCH
|
2018-10-22 01:49:35 -07:00 |
KERNEL.THUNDERX2T99
|
ARM64: Enable DYNAMIC_ARCH
|
2018-10-22 01:49:35 -07:00 |
KERNEL.TSV110
|
update
|
2020-01-02 11:01:57 +08:00 |
Makefile
|
added experimental support for ARMV8
|
2013-11-24 15:47:00 +01:00 |
amax.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
asum.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
axpy.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
casum.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
casum_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
cgemm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
cgemm_kernel_8x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
cgemm_kernel_8x4_thunderx2t99.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
copy.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
copy_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
csum.S
|
Add ARM64 implementations of ?sum
|
2019-03-30 22:13:36 +01:00 |
ctrmm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
ctrmm_kernel_8x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dasum_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
daxpy_thunderx.c
|
ARM64: Rename kernel files to have consistent naming
|
2017-01-24 14:53:34 +05:30 |
daxpy_thunderx2t99.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
ddot_thunderx.c
|
ARM64: Rename kernel files to have consistent naming
|
2017-01-24 14:53:34 +05:30 |
dgemm_beta.S
|
kernel/arm64/dgemm_beta.S: add beta == zero branch
|
2020-01-01 21:50:45 -05:00 |
dgemm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dgemm_kernel_4x8.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dgemm_kernel_8x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dgemm_kernel_8x4_thunderx2t99.S
|
ARM64: Move parameters from parameter.c to param.h
|
2018-10-22 01:45:51 -07:00 |
dgemm_ncopy_4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dgemm_ncopy_8.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dgemm_tcopy_4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dgemm_tcopy_8.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dot.S
|
ARM64: Fix utest dsdot errors
|
2018-02-27 10:47:55 +00:00 |
dot_thunderx.c
|
ARM64: Rename kernel files to have consistent naming
|
2017-01-24 14:53:34 +05:30 |
dot_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
dtrmm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dtrmm_kernel_4x8.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dtrmm_kernel_8x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
dznrm2_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
dznrm2_thunderx2t99_fast.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
gemv_n.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
gemv_t.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
iamax.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
iamax_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
izamax.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
izamax_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
nrm2.S
|
Fix accidental duplication of jump instruction
|
2019-10-08 08:09:26 +02:00 |
rot.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
sasum_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
scal.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
scnrm2_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
sgemm_beta.S
|
Use arm neon instructions to optimize sgemm_beta operation
|
2019-12-31 10:42:03 +08:00 |
sgemm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
sgemm_kernel_8x8.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
sgemm_kernel_16x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
sgemm_kernel_16x4_thunderx2t99.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
sgemm_ncopy_4.S
|
Use arm neon instructions to optimize ncopy operation
|
2019-12-31 17:06:35 +08:00 |
sgemm_tcopy_16.S
|
[WIP] Use arm neon instructions to optimize tcopy operation
|
2019-12-31 10:21:23 +08:00 |
strmm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
strmm_kernel_8x8.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
strmm_kernel_16x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
sum.S
|
Add ARM64 implementations of ?sum
|
2019-03-30 22:13:36 +01:00 |
swap.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
swap_thunderx2t99.S
|
THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations
|
2017-02-03 03:55:06 -08:00 |
zamax.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zasum.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zasum_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
zaxpy.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zdot.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zdot_thunderx2t99.c
|
Fixed a few more unnecessary calls to num_cpu_avail.
|
2018-06-11 10:17:16 +01:00 |
zgemm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zgemm_kernel_4x4_thunderx2t99.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zgemv_n.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zgemv_t.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
znrm2.S
|
Remove automatic label postfixes from macro included only once
|
2019-10-08 08:37:50 +02:00 |
zrot.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zscal.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |
zsum.S
|
Add ARM64 implementations of ?sum
|
2019-03-30 22:13:36 +01:00 |
ztrmm_kernel_4x4.S
|
ARM64: Convert all labels to local labels
|
2017-10-24 11:40:05 +00:00 |