| .. |
|
KERNEL
|
LoongArch64: Add DYNAMIC_ARCH support
|
2022-07-28 14:28:45 +08:00 |
|
KERNEL.LOONGSON2K1000
|
loongarch64: Add optimizations for nrm2.
|
2023-12-07 14:36:26 +08:00 |
|
KERNEL.LOONGSON3R5
|
loongarch64: Add optimizations for nrm2.
|
2023-12-07 14:36:26 +08:00 |
|
KERNEL.generic
|
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2
|
2023-08-07 11:20:42 +08:00 |
|
Makefile
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
amax.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
amin.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
asum.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
cnrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
copy.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
damax_lasx.S
|
loongarch64: Add optimizations for amax.
|
2023-12-07 14:36:07 +08:00 |
|
damax_lsx.S
|
loongarch64: Add optimizations for amax.
|
2023-12-07 14:36:07 +08:00 |
|
damin_lasx.S
|
loongarch64: Add optimization for amin.
|
2023-12-07 14:36:07 +08:00 |
|
damin_lsx.S
|
loongarch64: Add optimization for amin.
|
2023-12-07 14:36:07 +08:00 |
|
dasum_lasx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
dasum_lsx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
daxpby_lasx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
daxpby_lsx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
daxpy_lasx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
daxpy_lsx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
dcopy_lasx.S
|
loongarch64: Add optimizations for copy.
|
2023-12-07 14:36:26 +08:00 |
|
dcopy_lsx.S
|
loongarch64: Add optimizations for copy.
|
2023-12-07 14:36:26 +08:00 |
|
dgemm_kernel_16x4.S
|
LoongArch64: Update dgemm kernel
|
2023-08-07 11:06:52 +08:00 |
|
dgemm_ncopy_4.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemm_ncopy_16.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemm_tcopy_4.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemm_tcopy_16.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemv_n_8_lasx.S
|
LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH
|
2023-09-27 10:05:27 +08:00 |
|
dgemv_t_8_lasx.S
|
LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH
|
2023-09-27 10:05:27 +08:00 |
|
dmax_lasx.S
|
loongarch64: Add optimization for max.
|
2023-12-07 14:36:07 +08:00 |
|
dmax_lsx.S
|
loongarch64: Add optimization for max.
|
2023-12-07 14:36:07 +08:00 |
|
dmin_lasx.S
|
loongarch64: Add optimization for min.
|
2023-12-07 14:36:07 +08:00 |
|
dmin_lsx.S
|
loongarch64: Add optimization for min.
|
2023-12-07 14:36:07 +08:00 |
|
dnrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
dnrm2_lasx.S
|
loongarch64: Add optimizations for nrm2.
|
2023-12-07 14:36:26 +08:00 |
|
dnrm2_lsx.S
|
loongarch64: Add optimizations for nrm2.
|
2023-12-07 14:36:26 +08:00 |
|
dot.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
dot_lasx.S
|
loongarch: Add optimization for dsdot kernel.
|
2023-11-28 20:24:16 +08:00 |
|
dot_lsx.S
|
loongarch: Add LSX optimization for dot.
|
2023-11-28 20:24:18 +08:00 |
|
drot_lasx.S
|
loongarch64: Add optimizations for rot.
|
2023-12-07 14:36:26 +08:00 |
|
drot_lsx.S
|
loongarch64: Add optimizations for rot.
|
2023-12-07 14:36:26 +08:00 |
|
dscal_lasx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
|
dscal_lsx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
|
dsum_lasx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
dsum_lsx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
dswap_lasx.S
|
loongarch64: Add optimizations for swap.
|
2023-12-07 14:36:26 +08:00 |
|
dswap_lsx.S
|
loongarch64: Add optimizations for swap.
|
2023-12-07 14:36:26 +08:00 |
|
dtrsm_kernel_LN_16x4_lasx.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
|
dtrsm_kernel_LT_16x4_lasx.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
|
dtrsm_kernel_RN_16x4_lasx.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
|
dtrsm_kernel_RT_16x4_lasx.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
|
dtrsm_kernel_macro.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
|
gemm_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
gemv_n.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
gemv_t.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
iamax.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
iamin.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
idamax_lasx.S
|
loongarch64: Add optimizations for iamax.
|
2023-12-07 14:36:07 +08:00 |
|
idamax_lsx.S
|
loongarch64: Add optimizations for iamax.
|
2023-12-07 14:36:07 +08:00 |
|
idamin_lasx.S
|
loongarch64: Add optimizations for iamin.
|
2023-12-07 14:36:07 +08:00 |
|
idamin_lsx.S
|
loongarch64: Add optimizations for iamin.
|
2023-12-07 14:36:07 +08:00 |
|
idmax_lasx.S
|
loongarch64: Add optimizations for imax.
|
2023-12-07 14:36:07 +08:00 |
|
idmax_lsx.S
|
loongarch64: Add optimizations for imax.
|
2023-12-07 14:36:07 +08:00 |
|
idmin_lasx.S
|
loongarch64: Add optimizations for imin.
|
2023-12-07 14:36:07 +08:00 |
|
idmin_lsx.S
|
loongarch64: Add optimizations for imin.
|
2023-12-07 14:36:07 +08:00 |
|
isamax_lasx.S
|
loongarch64: Add optimizations for iamax.
|
2023-12-07 14:36:07 +08:00 |
|
isamax_lsx.S
|
loongarch64: Add optimizations for iamax.
|
2023-12-07 14:36:07 +08:00 |
|
isamin_lasx.S
|
loongarch64: Add optimizations for iamin.
|
2023-12-07 14:36:07 +08:00 |
|
isamin_lsx.S
|
loongarch64: Add optimizations for iamin.
|
2023-12-07 14:36:07 +08:00 |
|
ismax_lasx.S
|
loongarch64: Add optimizations for imax.
|
2023-12-07 14:36:07 +08:00 |
|
ismax_lsx.S
|
loongarch64: Add optimizations for imax.
|
2023-12-07 14:36:07 +08:00 |
|
ismin_lasx.S
|
loongarch64: Add optimizations for imin.
|
2023-12-07 14:36:07 +08:00 |
|
ismin_lsx.S
|
loongarch64: Add optimizations for imin.
|
2023-12-07 14:36:07 +08:00 |
|
izamax.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
izamin.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
loongarch64_asm.S
|
LoongArch64: Compatible with early internal toolchain
|
2023-08-31 16:55:29 +08:00 |
|
max.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
min.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
samax_lasx.S
|
loongarch64: Add optimizations for amax.
|
2023-12-07 14:36:07 +08:00 |
|
samax_lsx.S
|
loongarch64: Add optimizations for amax.
|
2023-12-07 14:36:07 +08:00 |
|
samin_lasx.S
|
loongarch64: Add optimization for amin.
|
2023-12-07 14:36:07 +08:00 |
|
samin_lsx.S
|
loongarch64: Add optimization for amin.
|
2023-12-07 14:36:07 +08:00 |
|
sasum_lasx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
sasum_lsx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
saxpby_lasx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
saxpby_lsx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
saxpy_lasx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
saxpy_lsx.S
|
loongarch64: Add optimizations for axpy and axpby.
|
2023-12-07 14:36:26 +08:00 |
|
scal.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
scopy_lasx.S
|
loongarch64: Add optimizations for copy.
|
2023-12-07 14:36:26 +08:00 |
|
scopy_lsx.S
|
loongarch64: Add optimizations for copy.
|
2023-12-07 14:36:26 +08:00 |
|
sgemm_kernel_16x8_lasx.S
|
LoongArch64: Compatible with early internal toolchain
|
2023-08-31 16:55:29 +08:00 |
|
sgemm_ncopy_8_lasx.S
|
LoongArch64: Add sgemm_kernel
|
2023-08-23 16:08:43 +08:00 |
|
sgemm_ncopy_16_lasx.S
|
LoongArch64: Add sgemm_kernel
|
2023-08-23 16:08:43 +08:00 |
|
sgemm_tcopy_8_lasx.S
|
LoongArch64: Add sgemm_kernel
|
2023-08-23 16:08:43 +08:00 |
|
sgemm_tcopy_16_lasx.S
|
LoongArch64: Add sgemm_kernel
|
2023-08-23 16:08:43 +08:00 |
|
sgemv_n_8_lasx.S
|
LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH
|
2023-09-27 10:05:27 +08:00 |
|
sgemv_t_8_lasx.S
|
LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH
|
2023-09-27 10:05:27 +08:00 |
|
smax_lasx.S
|
loongarch64: Add optimization for max.
|
2023-12-07 14:36:07 +08:00 |
|
smax_lsx.S
|
loongarch64: Add optimization for max.
|
2023-12-07 14:36:07 +08:00 |
|
smin_lasx.S
|
loongarch64: Add optimization for min.
|
2023-12-07 14:36:07 +08:00 |
|
smin_lsx.S
|
loongarch64: Add optimization for min.
|
2023-12-07 14:36:07 +08:00 |
|
snrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
snrm2_lasx.S
|
loongarch64: Add optimizations for nrm2.
|
2023-12-07 14:36:26 +08:00 |
|
snrm2_lsx.S
|
loongarch64: Add optimizations for nrm2.
|
2023-12-07 14:36:26 +08:00 |
|
srot_lasx.S
|
loongarch64: Add optimizations for rot.
|
2023-12-07 14:36:26 +08:00 |
|
srot_lsx.S
|
loongarch64: Add optimizations for rot.
|
2023-12-07 14:36:26 +08:00 |
|
sscal_lasx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
|
sscal_lsx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
|
ssum_lasx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
ssum_lsx.S
|
loongarch64: Add optimizations for sum and asum.
|
2023-12-07 14:36:26 +08:00 |
|
sswap_lasx.S
|
loongarch64: Add optimizations for swap.
|
2023-12-07 14:36:26 +08:00 |
|
sswap_lsx.S
|
loongarch64: Add optimizations for swap.
|
2023-12-07 14:36:26 +08:00 |
|
swap.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
trsm_kernel_LN.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
trsm_kernel_LT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
trsm_kernel_RT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zamax.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zamin.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zasum.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zcopy.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zdot.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zgemm3m_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zgemm_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zgemv_n.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zgemv_t.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
znrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
zscal.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
ztrsm_kernel_LT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
ztrsm_kernel_RT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |