OpenBLAS/kernel/loongarch64
yancheng d32f38fb37 loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
..
KERNEL LoongArch64: Add DYNAMIC_ARCH support 2022-07-28 14:28:45 +08:00
KERNEL.LOONGSON2K1000 loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
KERNEL.LOONGSON3R5 loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
KERNEL.generic LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2 2023-08-07 11:20:42 +08:00
Makefile Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
amax.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
amin.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
asum.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
cnrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:50:44 +02:00
copy.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
damax_lasx.S loongarch64: Add optimizations for amax. 2023-12-07 14:36:07 +08:00
damax_lsx.S loongarch64: Add optimizations for amax. 2023-12-07 14:36:07 +08:00
damin_lasx.S loongarch64: Add optimization for amin. 2023-12-07 14:36:07 +08:00
damin_lsx.S loongarch64: Add optimization for amin. 2023-12-07 14:36:07 +08:00
dasum_lasx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
dasum_lsx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
daxpby_lasx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
daxpby_lsx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
daxpy_lasx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
daxpy_lsx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
dcopy_lasx.S loongarch64: Add optimizations for copy. 2023-12-07 14:36:26 +08:00
dcopy_lsx.S loongarch64: Add optimizations for copy. 2023-12-07 14:36:26 +08:00
dgemm_kernel_16x4.S LoongArch64: Update dgemm kernel 2023-08-07 11:06:52 +08:00
dgemm_ncopy_4.S loongarch64: Optimize dgemm_kernel 2021-12-21 09:33:06 +08:00
dgemm_ncopy_16.S loongarch64: Optimize dgemm_kernel 2021-12-21 09:33:06 +08:00
dgemm_tcopy_4.S loongarch64: Optimize dgemm_kernel 2021-12-21 09:33:06 +08:00
dgemm_tcopy_16.S loongarch64: Optimize dgemm_kernel 2021-12-21 09:33:06 +08:00
dgemv_n_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2023-09-27 10:05:27 +08:00
dgemv_t_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2023-09-27 10:05:27 +08:00
dmax_lasx.S loongarch64: Add optimization for max. 2023-12-07 14:36:07 +08:00
dmax_lsx.S loongarch64: Add optimization for max. 2023-12-07 14:36:07 +08:00
dmin_lasx.S loongarch64: Add optimization for min. 2023-12-07 14:36:07 +08:00
dmin_lsx.S loongarch64: Add optimization for min. 2023-12-07 14:36:07 +08:00
dnrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:50:44 +02:00
dnrm2_lasx.S loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
dnrm2_lsx.S loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
dot.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
dot_lasx.S loongarch: Add optimization for dsdot kernel. 2023-11-28 20:24:16 +08:00
dot_lsx.S loongarch: Add LSX optimization for dot. 2023-11-28 20:24:18 +08:00
drot_lasx.S loongarch64: Add optimizations for rot. 2023-12-07 14:36:26 +08:00
drot_lsx.S loongarch64: Add optimizations for rot. 2023-12-07 14:36:26 +08:00
dscal_lasx.S loongarch64: Add optimizations for scal. 2023-12-07 14:36:07 +08:00
dscal_lsx.S loongarch64: Add optimizations for scal. 2023-12-07 14:36:07 +08:00
dsum_lasx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
dsum_lsx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
dswap_lasx.S loongarch64: Add optimizations for swap. 2023-12-07 14:36:26 +08:00
dswap_lsx.S loongarch64: Add optimizations for swap. 2023-12-07 14:36:26 +08:00
dtrsm_kernel_LN_16x4_lasx.S LoongArch64: Add dtrsm kernel 2023-09-26 15:45:14 +08:00
dtrsm_kernel_LT_16x4_lasx.S LoongArch64: Add dtrsm kernel 2023-09-26 15:45:14 +08:00
dtrsm_kernel_RN_16x4_lasx.S LoongArch64: Add dtrsm kernel 2023-09-26 15:45:14 +08:00
dtrsm_kernel_RT_16x4_lasx.S LoongArch64: Add dtrsm kernel 2023-09-26 15:45:14 +08:00
dtrsm_kernel_macro.S LoongArch64: Add dtrsm kernel 2023-09-26 15:45:14 +08:00
gemm_kernel.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
gemv_n.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
gemv_t.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
iamax.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
iamin.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
idamax_lasx.S loongarch64: Add optimizations for iamax. 2023-12-07 14:36:07 +08:00
idamax_lsx.S loongarch64: Add optimizations for iamax. 2023-12-07 14:36:07 +08:00
idamin_lasx.S loongarch64: Add optimizations for iamin. 2023-12-07 14:36:07 +08:00
idamin_lsx.S loongarch64: Add optimizations for iamin. 2023-12-07 14:36:07 +08:00
idmax_lasx.S loongarch64: Add optimizations for imax. 2023-12-07 14:36:07 +08:00
idmax_lsx.S loongarch64: Add optimizations for imax. 2023-12-07 14:36:07 +08:00
idmin_lasx.S loongarch64: Add optimizations for imin. 2023-12-07 14:36:07 +08:00
idmin_lsx.S loongarch64: Add optimizations for imin. 2023-12-07 14:36:07 +08:00
isamax_lasx.S loongarch64: Add optimizations for iamax. 2023-12-07 14:36:07 +08:00
isamax_lsx.S loongarch64: Add optimizations for iamax. 2023-12-07 14:36:07 +08:00
isamin_lasx.S loongarch64: Add optimizations for iamin. 2023-12-07 14:36:07 +08:00
isamin_lsx.S loongarch64: Add optimizations for iamin. 2023-12-07 14:36:07 +08:00
ismax_lasx.S loongarch64: Add optimizations for imax. 2023-12-07 14:36:07 +08:00
ismax_lsx.S loongarch64: Add optimizations for imax. 2023-12-07 14:36:07 +08:00
ismin_lasx.S loongarch64: Add optimizations for imin. 2023-12-07 14:36:07 +08:00
ismin_lsx.S loongarch64: Add optimizations for imin. 2023-12-07 14:36:07 +08:00
izamax.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
izamin.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
loongarch64_asm.S LoongArch64: Compatible with early internal toolchain 2023-08-31 16:55:29 +08:00
max.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
min.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
samax_lasx.S loongarch64: Add optimizations for amax. 2023-12-07 14:36:07 +08:00
samax_lsx.S loongarch64: Add optimizations for amax. 2023-12-07 14:36:07 +08:00
samin_lasx.S loongarch64: Add optimization for amin. 2023-12-07 14:36:07 +08:00
samin_lsx.S loongarch64: Add optimization for amin. 2023-12-07 14:36:07 +08:00
sasum_lasx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
sasum_lsx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
saxpby_lasx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
saxpby_lsx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
saxpy_lasx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
saxpy_lsx.S loongarch64: Add optimizations for axpy and axpby. 2023-12-07 14:36:26 +08:00
scal.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
scopy_lasx.S loongarch64: Add optimizations for copy. 2023-12-07 14:36:26 +08:00
scopy_lsx.S loongarch64: Add optimizations for copy. 2023-12-07 14:36:26 +08:00
sgemm_kernel_16x8_lasx.S LoongArch64: Compatible with early internal toolchain 2023-08-31 16:55:29 +08:00
sgemm_ncopy_8_lasx.S LoongArch64: Add sgemm_kernel 2023-08-23 16:08:43 +08:00
sgemm_ncopy_16_lasx.S LoongArch64: Add sgemm_kernel 2023-08-23 16:08:43 +08:00
sgemm_tcopy_8_lasx.S LoongArch64: Add sgemm_kernel 2023-08-23 16:08:43 +08:00
sgemm_tcopy_16_lasx.S LoongArch64: Add sgemm_kernel 2023-08-23 16:08:43 +08:00
sgemv_n_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2023-09-27 10:05:27 +08:00
sgemv_t_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2023-09-27 10:05:27 +08:00
smax_lasx.S loongarch64: Add optimization for max. 2023-12-07 14:36:07 +08:00
smax_lsx.S loongarch64: Add optimization for max. 2023-12-07 14:36:07 +08:00
smin_lasx.S loongarch64: Add optimization for min. 2023-12-07 14:36:07 +08:00
smin_lsx.S loongarch64: Add optimization for min. 2023-12-07 14:36:07 +08:00
snrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:50:44 +02:00
snrm2_lasx.S loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
snrm2_lsx.S loongarch64: Add optimizations for nrm2. 2023-12-07 14:36:26 +08:00
srot_lasx.S loongarch64: Add optimizations for rot. 2023-12-07 14:36:26 +08:00
srot_lsx.S loongarch64: Add optimizations for rot. 2023-12-07 14:36:26 +08:00
sscal_lasx.S loongarch64: Add optimizations for scal. 2023-12-07 14:36:07 +08:00
sscal_lsx.S loongarch64: Add optimizations for scal. 2023-12-07 14:36:07 +08:00
ssum_lasx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
ssum_lsx.S loongarch64: Add optimizations for sum and asum. 2023-12-07 14:36:26 +08:00
sswap_lasx.S loongarch64: Add optimizations for swap. 2023-12-07 14:36:26 +08:00
sswap_lsx.S loongarch64: Add optimizations for swap. 2023-12-07 14:36:26 +08:00
swap.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
trsm_kernel_LN.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
trsm_kernel_LT.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
trsm_kernel_RT.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
zamax.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
zamin.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
zasum.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
zcopy.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
zdot.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
zgemm3m_kernel.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
zgemm_kernel.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
zgemv_n.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
zgemv_t.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
znrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:50:44 +02:00
zscal.S Delete the macro instruction "li" and use "li.d" instead 2021-08-12 17:02:54 +08:00
ztrsm_kernel_LT.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
ztrsm_kernel_RT.S Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00