OpenBLAS/kernel/power
Rajalakshmi Srinivasaraghavan ad745c0bae Optimize scopy/ccopy for POWER10
This patch makes use of new POWER10 vector pair instructions for
loads and stores. Also reorganized all variants of copy functions
to make use of same kernel.
2020-10-21 09:53:45 -05:00
..
KERNEL Add proper defaults for IxMIN/IxMAX kernels 2020-02-21 11:55:52 +01:00
KERNEL.CELL Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER4 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER5 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER6 Use generic zrot.c on ppc64/POWER6 to work around utest failure from … (#1535) 2018-04-23 19:05:49 +02:00
KERNEL.POWER8 Typo fix 2020-07-24 16:04:58 +00:00
KERNEL.POWER9 Fix cmake compilation issue - POWER9 2020-05-08 20:31:56 -05:00
KERNEL.POWER10 Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
KERNEL.PPC440 Fix endianness conditionals 2020-02-19 18:09:54 +01:00
KERNEL.PPC440FP2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PPC970 Fix endianness conditionals 2020-02-19 18:09:54 +01:00
KERNEL.PPCG4 Change complex DOT and ROT to generic kernels and switch CGEMM 2020-06-03 09:13:29 +02:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
axpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
casum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
casum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
caxpy.c caxpy and cdot are using vec_vsx_ld 2019-08-30 04:09:15 +00:00
caxpy_power8.S Update caxpy_power8.S 2020-02-13 22:44:09 +01:00
caxpy_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
ccopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ccopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ccopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
cdot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
cdot_power9.S Fix SEGV in cdot_power9 2019-11-26 21:55:04 -07:00
cgemm_kernel_8x4_power8.S Update cgemm_kernel_8x4_power8.S 2020-06-30 15:16:39 +02:00
cgemm_kernel_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
cgemm_kernel_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
cgemm_logic_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
cgemm_logic_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemm_macros_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
cgemm_macros_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
cgemm_tcopy_logic_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
cgemv_t.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
cnrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cnrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cnrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_microk_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
crot.c Add ifdefs around call to altivec microkernel 2020-07-23 18:30:42 +00:00
cswap.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
cswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ctrmm_kernel_8x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:37:04 +02:00
ctrmm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
ctrmm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
daxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
daxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
daxpy_microk_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
daxpy_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
dcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dcopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
ddot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ddot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
def_vsx.h added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
dgemm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:41:43 +02:00
dgemm_kernel_power9.S sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 2019-06-04 07:11:30 +00:00
dgemm_kernel_power10.c POWER10: Avoid setting accumulators to zero in gemm kernels 2020-08-28 10:42:54 -05:00
dgemm_logic_16x4_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_logic_power9.S power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 2019-03-29 15:49:40 +00:00
dgemm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_macros_power9.S power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 2019-03-29 15:49:40 +00:00
dgemm_ncopy_4_power8.S dgemm_ncopy_4_ save/restore 2018-02-18 01:30:17 +00:00
dgemm_ncopy_logic_4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_ncopy_macros_4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_tcopy_16_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
dgemm_tcopy_logic_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_macros_16_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dgemv_n_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemv_n_microk_power10.c dgemv optimization for POWER10 2020-07-29 18:59:32 -05:00
dgemv_n_power10.c dgemv optimization for POWER10 2020-07-29 18:59:32 -05:00
dgemv_t.c Fix compilation issues with clang on POWER 2020-07-27 14:11:07 -05:00
dgemv_t_power10.c dgemv optimization for POWER10 2020-07-29 18:59:32 -05:00
dnrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dnrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
drot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
drot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dscal.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dswap.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrmm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:29:45 +02:00
dtrmm_logic_16x4_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
dtrmm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrsm_kernel_LT_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:31:01 +02:00
dtrsm_logic_LT_16x4_power8.S optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
dtrsm_macros_LT_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
exfunc.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power6.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_ncopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_hummer_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_n.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemv_n_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemv_t.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemv_t_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
iamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
icamax.c conflict resolve 2019-05-01 19:36:22 +00:00
icamax_power8.S Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) 2019-09-22 22:35:22 +02:00
icamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
icamin.c conflict resolve 2019-05-01 19:36:22 +00:00
icamin_power8.S Update icamin_power8.S 2020-02-13 18:38:43 +01:00
icamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
idamax.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
idamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
imax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
isamax.c Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin}, 2019-01-16 15:16:21 +00:00
isamax_power8.S Update isamax_power8.S 2020-02-12 23:59:50 +01:00
isamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
isamin.c Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin}, 2019-01-16 15:16:21 +00:00
isamin_power8.S Update isamin_power8.S 2020-02-13 00:00:32 +01:00
isamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
izamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lock.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
max.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sasum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
saxpy.c Merge pull request #2721 from martin-frbg/p8align 2020-07-24 11:06:20 +02:00
sbgemm_kernel_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_ncopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_ncopy_16_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_tcopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_tcopy_16_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
scal.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
scal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
scopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
scopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
scopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
sdot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_kernel_16x8_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
sgemm_kernel_power9.S sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 2019-06-04 07:11:30 +00:00
sgemm_kernel_power10.c POWER10: Avoid setting accumulators to zero in gemm kernels 2020-08-28 10:42:54 -05:00
sgemm_logic_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_logic_power9.S trmm fix 2019-09-29 02:28:34 +00:00
sgemm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_macros_power9.S new sgemm 8x16 2019-06-17 15:33:38 +00:00
sgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
sgemm_tcopy_16_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
sgemm_tcopy_logic_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_logic_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_tcopy_macros_16_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sgemv_n_8.c aligned 2019-08-29 23:22:23 +00:00
sgemv_t.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sgemv_t_8.c aligned 2019-08-29 23:22:23 +00:00
snrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
snrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
snrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
srot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
srot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sscal.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sswap.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
strmm_kernel_16x8_power8.S Fix spurious trailing whitespace in comment 2020-10-19 09:12:12 +02:00
strmm_logic_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
strmm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
swap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
swap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
symv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
symv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zasum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zasum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zaxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zaxpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zaxpy_microk_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
zaxpy_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
zaxpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zcopy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zcopy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zcopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
zdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot.c Typo fix 2020-07-23 17:34:56 +00:00
zdot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zdot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power6.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power9.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_logic_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_logic_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
zgemm_logic_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemm_macros_power9.S power9 zgemm ztrmm optimized 2019-06-05 20:07:16 +00:00
zgemm_macros_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_ncopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
zgemm_tcopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_logic_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemv_n.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemv_n_4.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zgemv_n_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemv_t.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemv_t_4.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zgemv_t_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
znrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zrot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zscal.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zscal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zscal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
zswap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zswap.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zswap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zsymv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsymv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_logic_8x2_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
ztrmm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
ztrsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00