| .. |
|
KERNEL
|
LoongArch64: Add DYNAMIC_ARCH support
|
2022-07-28 14:28:45 +08:00 |
|
KERNEL.LOONGSON2K1000
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
|
KERNEL.LOONGSON3R5
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
|
KERNEL.generic
|
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2
|
2023-08-07 11:20:42 +08:00 |
|
Makefile
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
amax.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
amax_lasx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
|
amax_lsx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
|
amin.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
amin_lasx.S
|
loongarch: Fixed {s/d/sc/dz}amin LASX opt
|
2024-03-16 14:52:14 +08:00 |
|
amin_lsx.S
|
loongarch: Fixed {s/d/sc/dz}amin LSX opt
|
2024-03-19 09:18:44 +08:00 |
|
asum.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
asum_lasx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
asum_lsx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
axpby_lasx.S
|
loongarch: Fixed {s/d/c/z}axpby LASX opt
|
2024-03-16 14:51:56 +08:00 |
|
axpby_lsx.S
|
loongarch: Fixed {s/d}axpby LSX opt
|
2024-03-18 17:51:42 +08:00 |
|
axpy_lasx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
axpy_lsx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
camax_lasx.S
|
loongarch: Fixed dzamax
|
2024-03-25 23:17:53 -04:00 |
|
camax_lsx.S
|
loongarch: Fixed dzamax
|
2024-03-25 23:17:53 -04:00 |
|
camin_lasx.S
|
loongarch: Fixed {s/d/sc/dz}amin LASX opt
|
2024-03-16 14:52:14 +08:00 |
|
camin_lsx.S
|
loongarch: Fixed {s/d/sc/dz}amin LSX opt
|
2024-03-19 09:18:44 +08:00 |
|
casum_lasx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
casum_lsx.S
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
caxpby_lasx.S
|
loongarch: Fixed {s/d/c/z}axpby LASX opt
|
2024-03-16 14:51:56 +08:00 |
|
caxpby_lsx.S
|
LoongArch64: Opt {c/z}axpby
|
2024-02-04 11:23:31 +08:00 |
|
caxpy_lasx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
caxpy_lsx.S
|
loongarch64: Refine and add axpy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
ccopy_lasx.S
|
loongarch64: Add c/zcopy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
ccopy_lsx.S
|
loongarch64: Add c/zcopy optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
cdot_lasx.S
|
loongarch64: Add c/zdot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
cdot_lsx.S
|
loongarch64: Add c/zdot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
cgemm_kernel_2x2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
cgemm_kernel_2x2_lsx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
cgemm_kernel_8x4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
cgemm_kernel_16x4_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
|
cgemm_ncopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
cgemm_ncopy_2_lsx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
cgemm_ncopy_4_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
|
cgemm_ncopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
cgemm_ncopy_8_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
cgemm_ncopy_16_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
|
cgemm_tcopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
cgemm_tcopy_2_lsx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
cgemm_tcopy_4_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
|
cgemm_tcopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
cgemm_tcopy_8_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
cgemm_tcopy_16_lasx.S
|
Optimized cgemm kernel 16x4 LASX for LoongArch
|
2024-02-21 15:28:47 +08:00 |
|
cgemv_n_4_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
cgemv_n_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
cgemv_t_4_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
cgemv_t_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
cnrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
cnrm2_lasx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
cnrm2_lsx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
copy.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
copy_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
copy_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
crot_lasx.S
|
loongarch64: Add c/zrot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
crot_lsx.S
|
loongarch: Fixed zrot LSX opt
|
2024-03-19 10:08:11 +08:00 |
|
cscal_lasx.S
|
LoongArch64: Handle NAN and INF
|
2024-01-30 17:17:30 +08:00 |
|
cscal_lsx.S
|
LoongArch64: Handle NAN and INF
|
2024-01-30 17:17:30 +08:00 |
|
csum_lasx.S
|
loongarch: Fixed {s/d/c/z}sum LASX opt
|
2024-03-16 14:52:10 +08:00 |
|
csum_lsx.S
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
|
cswap_lasx.S
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
|
cswap_lsx.S
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
|
dgemm_kernel_8x4.S
|
Add dgemm_kernel_8x4.S file.
|
2023-12-29 17:30:57 +08:00 |
|
dgemm_kernel_16x4.S
|
LoongArch64: Update dgemm kernel
|
2023-08-07 11:06:52 +08:00 |
|
dgemm_kernel_16x6.S
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
dgemm_ncopy_4.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemm_ncopy_4_lsx.S
|
Optimize copy functions with lsx.
|
2023-12-29 17:30:57 +08:00 |
|
dgemm_ncopy_8_lsx.S
|
loongarch64: Fixed utest fork:safety
|
2024-05-10 14:16:36 +08:00 |
|
dgemm_ncopy_16.S
|
loongarch64: Fixed utest fork:safety
|
2024-05-10 14:16:36 +08:00 |
|
dgemm_small_kernel_nn_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
|
dgemm_small_kernel_nt_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
|
dgemm_small_kernel_tn_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
|
dgemm_small_kernel_tt_lasx.S
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
|
dgemm_small_matrix_permit.c
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
|
dgemm_tcopy_4.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemm_tcopy_4_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dgemm_tcopy_6.S
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
dgemm_tcopy_8_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dgemm_tcopy_16.S
|
loongarch64: Optimize dgemm_kernel
|
2021-12-21 09:33:06 +08:00 |
|
dgemv_n_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dgemv_n_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
|
dgemv_t_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dgemv_t_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
|
dnrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
dnrm2_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
dnrm2_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
dot.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
dot_lasx.S
|
remove stray comma
|
2024-04-23 21:52:26 +02:00 |
|
dot_lsx.S
|
remove stray comma
|
2024-05-09 12:33:19 +02:00 |
|
dscal_lasx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
|
dscal_lsx.S
|
loongarch64: Add optimizations for scal.
|
2023-12-07 14:36:07 +08:00 |
|
dsymv_L_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
|
dsymv_L_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
|
dsymv_U_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
|
dsymv_U_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
|
dtrsm_kernel_LN_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dtrsm_kernel_LT_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dtrsm_kernel_RN_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dtrsm_kernel_RT_16x4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
dtrsm_kernel_macro.S
|
LoongArch64: Add dtrsm kernel
|
2023-09-26 15:45:14 +08:00 |
|
gemm_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
gemm_ncopy_6.prefx.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
gemv_n.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
gemv_t.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
iamax.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
iamax_lasx.S
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
iamax_lsx.S
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
iamin.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
iamin_lasx.S
|
loongarch: Fixed i{s/c/z}amin LASX opt
|
2024-03-16 14:52:06 +08:00 |
|
iamin_lsx.S
|
loongarch64: Refine iamin optimization.
|
2023-12-29 17:30:57 +08:00 |
|
icamax_lasx.S
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
icamax_lsx.S
|
loongarch64: Fixed icamax_lsx
|
2024-05-10 14:16:40 +08:00 |
|
icamin_lasx.S
|
loongarch: Fixed i{s/c/z}amin LASX opt
|
2024-03-16 14:52:06 +08:00 |
|
icamin_lsx.S
|
loongarch: Fixed i{c/z}amin LSX opt
|
2024-03-18 17:15:58 +08:00 |
|
imax_lasx.S
|
loongarch64: Refine imax optimization.
|
2023-12-29 17:30:57 +08:00 |
|
imax_lsx.S
|
loongarch64: Refine imax optimization.
|
2023-12-29 17:30:57 +08:00 |
|
imin_lasx.S
|
loongarch64: Refine imin optimization.
|
2023-12-29 17:30:57 +08:00 |
|
imin_lsx.S
|
loongarch64: Refine imin optimization.
|
2023-12-29 17:30:57 +08:00 |
|
izamax.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
izamin.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
loongarch64_asm.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
max.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
max_lasx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
|
max_lsx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
|
min.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
min_lasx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
|
min_lsx.S
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
|
rot_lasx.S
|
LoongArch64: Fixed utest kernel_regress:skx_avx
|
2024-02-26 02:04:37 -05:00 |
|
rot_lsx.S
|
LoongArch64: Fixed utest kernel_regress:skx_avx
|
2024-02-26 02:04:37 -05:00 |
|
scal.S
|
LoongArch: Fixed issue 4728
|
2024-06-06 16:43:09 +08:00 |
|
scal_lasx.S
|
LoongArch: Fixed issue 4728
|
2024-06-06 16:43:09 +08:00 |
|
scal_lsx.S
|
LoongArch: Fixed issue 4728
|
2024-06-06 16:43:09 +08:00 |
|
sgemm_kernel_16x8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemm_ncopy_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemm_ncopy_16_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemm_tcopy_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemm_tcopy_16_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemv_n_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemv_n_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
|
sgemv_t_8_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
sgemv_t_lsx.S
|
Optimized sgemv and dgemv kernel LSX for LoongArch
|
2024-02-28 18:07:27 +08:00 |
|
snrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
snrm2_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
snrm2_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
ssymv_L_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
|
ssymv_L_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
|
ssymv_U_lasx.S
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
|
ssymv_U_lsx.S
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
|
sum_lasx.S
|
loongarch: Fixed {s/d/c/z}sum LASX opt
|
2024-03-16 14:52:10 +08:00 |
|
sum_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
swap.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
swap_lasx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
swap_lsx.S
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
|
trsm_kernel_LN.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
trsm_kernel_LN_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
trsm_kernel_LT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
trsm_kernel_LT_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
trsm_kernel_RN_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
trsm_kernel_RT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
trsm_kernel_RT_UNROLLN6.c
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
2024-05-08 10:10:26 +08:00 |
|
zamax.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zamin.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zasum.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zcopy.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zdot.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zgemm3m_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zgemm_kernel.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
zgemm_kernel_2x2.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
zgemm_kernel_2x2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
zgemm_kernel_4x4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_kernel_8x4_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_ncopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
zgemm_ncopy_4_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_ncopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_ncopy_8_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_tcopy_2_lasx.S
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
|
zgemm_tcopy_4_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_tcopy_4_lsx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemm_tcopy_8_lasx.S
|
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
|
2024-02-06 11:49:01 +08:00 |
|
zgemv_n.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zgemv_n_2_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
zgemv_n_4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
zgemv_t.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
zgemv_t_2_lsx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
zgemv_t_4_lasx.S
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
|
znrm2.S
|
Allow negative INCX (API change from version 3.10 of the reference implementation)
|
2023-08-10 16:50:44 +02:00 |
|
znrm2_lasx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
znrm2_lsx.S
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
|
zscal.S
|
Delete the macro instruction "li" and use "li.d" instead
|
2021-08-12 17:02:54 +08:00 |
|
ztrsm_kernel_LT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |
|
ztrsm_kernel_RT.S
|
Add support for LOONGARCH64
|
2021-07-27 15:29:12 +08:00 |