.. |
KERNEL
|
LoongArch64: Add DYNAMIC_ARCH support
|
2022-07-28 14:28:45 +08:00 |
KERNEL.LOONGSON2K1000
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
KERNEL.LOONGSON3R5
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
KERNEL.generic
|
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2
|
2023-08-07 11:20:42 +08:00 |
Makefile
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
amax.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
amax_lasx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
amax_lsx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
amin.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
amin_lasx.S
|
loongarch: Fixed {s/d/sc/dz}amin LASX opt
|
2024-03-16 14:52:14 +08:00 |
amin_lsx.S
|
loongarch: Fixed {s/d/sc/dz}amin LSX opt
|
2024-03-19 09:18:44 +08:00 |
asum.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
asum_lasx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
asum_lsx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
axpby_lasx.S
|
loongarch: Fixed {s/d/c/z}axpby LASX opt
|
2024-03-16 14:51:56 +08:00 |
axpby_lsx.S
|
loongarch: Fixed {s/d}axpby LSX opt
|
2024-03-18 17:51:42 +08:00 |
axpy_lasx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
axpy_lsx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
camax_lasx.S
|
loongarch: Fixed dzamax
|
2024-03-25 23:17:53 -04:00 |
camax_lsx.S
|
loongarch: Fixed dzamax
|
2024-03-25 23:17:53 -04:00 |
camin_lasx.S
|
loongarch: Fixed {s/d/sc/dz}amin LASX opt
|
2024-03-16 14:52:14 +08:00 |
camin_lsx.S
|
loongarch: Fixed {s/d/sc/dz}amin LSX opt
|
2024-03-19 09:18:44 +08:00 |
casum_lasx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
casum_lsx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
caxpby_lasx.S
|
loongarch: Fixed {s/d/c/z}axpby LASX opt
|
2024-03-16 14:51:56 +08:00 |
caxpby_lsx.S
|
LoongArch64: Opt {c/z}axpby
|
2024-02-04 11:23:31 +08:00 |
caxpy_lasx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
caxpy_lsx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
ccopy_lasx.S
|
loongarch64: Add c/zcopy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
ccopy_lsx.S
|
loongarch64: Add c/zcopy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
cdot_lasx.S
|
loongarch64: Add c/zdot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
cdot_lsx.S
|
loongarch64: Add c/zdot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
cgemm_kernel_2x2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
cgemm_kernel_2x2_lsx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
cgemm_kernel_8x4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
cgemm_kernel_16x4_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
cgemm_ncopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
cgemm_ncopy_2_lsx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
cgemm_ncopy_4_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
cgemm_ncopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
cgemm_ncopy_8_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
cgemm_ncopy_16_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
cgemm_tcopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
cgemm_tcopy_2_lsx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
cgemm_tcopy_4_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
cgemm_tcopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
cgemm_tcopy_8_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
cgemm_tcopy_16_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
cgemv_n_4_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
cgemv_n_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
cgemv_t_4_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
cgemv_t_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
cnrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
cnrm2_lasx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
cnrm2_lsx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
copy.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
copy_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
copy_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
crot_lasx.S
|
loongarch64: Add c/zrot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
crot_lsx.S
|
loongarch: Fixed zrot LSX opt
|
2024-03-19 10:08:11 +08:00 |
cscal_lasx.S
|
LoongArch64: Handle NAN and INF
|
2024-01-30 17:17:30 +08:00 |
cscal_lsx.S
|
LoongArch64: Handle NAN and INF
|
2024-01-30 17:17:30 +08:00 |
csum_lasx.S
|
loongarch: Fixed {s/d/c/z}sum LASX opt
|
2024-03-16 14:52:10 +08:00 |
csum_lsx.S
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
cswap_lasx.S
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
cswap_lsx.S
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
dgemm_kernel_8x4.S
|
Add dgemm_kernel_8x4.S file.
|
2023-12-29 17:30:57 +08:00 |
dgemm_kernel_16x4.S
|
LoongArch64: Update dgemm kernel
|
2023-08-07 11:06:52 +08:00 |
dgemm_kernel_16x6.S
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
dgemm_ncopy_4.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
dgemm_ncopy_4_lsx.S
|
Optimize copy functions with lsx.
|
2023-12-29 17:30:57 +08:00 |
dgemm_ncopy_8_lsx.S
|
loongarch64: Fixed utest fork:safety
|
2024-05-10 14:16:36 +08:00 |
dgemm_ncopy_16.S
|
loongarch64: Fixed utest fork:safety
|
2024-05-10 14:16:36 +08:00 |
dgemm_small_kernel_nn_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
dgemm_small_kernel_nt_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
dgemm_small_kernel_tn_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
dgemm_small_kernel_tt_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
dgemm_small_matrix_permit.c
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
dgemm_tcopy_4.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
dgemm_tcopy_4_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dgemm_tcopy_6.S
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
dgemm_tcopy_8_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dgemm_tcopy_16.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
dgemv_n_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dgemv_n_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
dgemv_t_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dgemv_t_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
dnrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
dnrm2_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
dnrm2_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
dot.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
dot_lasx.S
|
remove stray comma
|
2024-04-23 21:52:26 +02:00 |
dot_lsx.S
|
remove stray comma
|
2024-05-09 12:33:19 +02:00 |
dscal_lasx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
dscal_lsx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
dsymv_L_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
dsymv_L_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
dsymv_U_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
dsymv_U_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
dtrsm_kernel_LN_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dtrsm_kernel_LT_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dtrsm_kernel_RN_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dtrsm_kernel_RT_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
dtrsm_kernel_macro.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
gemm_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
gemm_ncopy_6.prefx.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
gemv_n.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
gemv_t.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
iamax.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
iamax_lasx.S
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
iamax_lsx.S
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
iamin.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
iamin_lasx.S
|
loongarch: Fixed i{s/c/z}amin LASX opt
|
2024-03-16 14:52:06 +08:00 |
iamin_lsx.S
|
loongarch64: Refine iamin optimization.
|
2023-12-29 17:30:57 +08:00 |
icamax_lasx.S
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
icamax_lsx.S
|
loongarch64: Fixed icamax_lsx
|
2024-05-10 14:16:40 +08:00 |
icamin_lasx.S
|
loongarch: Fixed i{s/c/z}amin LASX opt
|
2024-03-16 14:52:06 +08:00 |
icamin_lsx.S
|
loongarch: Fixed i{c/z}amin LSX opt
|
2024-03-18 17:15:58 +08:00 |
imax_lasx.S
|
loongarch64: Refine imax optimization.
|
2023-12-29 17:30:57 +08:00 |
imax_lsx.S
|
loongarch64: Refine imax optimization.
|
2023-12-29 17:30:57 +08:00 |
imin_lasx.S
|
loongarch64: Refine imin optimization.
|
2023-12-29 17:30:57 +08:00 |
imin_lsx.S
|
loongarch64: Refine imin optimization.
|
2023-12-29 17:30:57 +08:00 |
izamax.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
izamin.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
loongarch64_asm.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
max.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
max_lasx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
max_lsx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
min.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
min_lasx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
min_lsx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
rot_lasx.S
|
LoongArch64: Fixed utest kernel_regress:skx_avx
|
2024-02-26 02:04:37 -05:00 |
rot_lsx.S
|
LoongArch64: Fixed utest kernel_regress:skx_avx
|
2024-02-26 02:04:37 -05:00 |
scal.S
|
LoongArch: Fixed numpy CI failure
|
2024-07-15 11:43:08 +08:00 |
scal_lasx.S
|
LoongArch: Fixed numpy CI failure
|
2024-07-15 11:43:08 +08:00 |
scal_lsx.S
|
LoongArch: Fixed numpy CI failure
|
2024-07-15 11:43:08 +08:00 |
sgemm_kernel_16x8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemm_ncopy_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemm_ncopy_16_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemm_tcopy_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemm_tcopy_16_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemv_n_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemv_n_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
sgemv_t_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
sgemv_t_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
snrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
snrm2_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
snrm2_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
ssymv_L_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
ssymv_L_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
ssymv_U_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
ssymv_U_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
sum_lasx.S
|
loongarch: Fixed {s/d/c/z}sum LASX opt
|
2024-03-16 14:52:10 +08:00 |
sum_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
swap.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
swap_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
swap_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
trsm_kernel_LN.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
trsm_kernel_LN_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
trsm_kernel_LT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
trsm_kernel_LT_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
trsm_kernel_RN_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
trsm_kernel_RT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
trsm_kernel_RT_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
zamax.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
zamin.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
zasum.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
zcopy.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
zdot.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
zgemm3m_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
zgemm_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
zgemm_kernel_2x2.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
zgemm_kernel_2x2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
zgemm_kernel_4x4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_kernel_8x4_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_ncopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
zgemm_ncopy_4_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_ncopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_ncopy_8_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_tcopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
zgemm_tcopy_4_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_tcopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemm_tcopy_8_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
zgemv_n.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
zgemv_n_2_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
zgemv_n_4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
zgemv_t.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
zgemv_t_2_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
zgemv_t_4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
znrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
znrm2_lasx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
znrm2_lsx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
zscal.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
ztrsm_kernel_LT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
ztrsm_kernel_RT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |