OpenBLAS/kernel/power
Rajalakshmi Srinivasaraghavan b06880c2cd POWER10: Improving dasum performance
Unrolling a loop in dasum micro code to help in improving
POWER10 performance.
2021-08-10 22:06:04 -05:00
..
KERNEL Add proper defaults for IxMIN/IxMAX kernels 2020-02-21 11:55:52 +01:00
KERNEL.CELL Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER4 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER5 fix c/zrot and sgemv for POWER5 2021-06-10 11:11:56 +02:00
KERNEL.POWER6 Use generic zrot.c on ppc64/POWER6 to work around utest failure from … (#1535) 2018-04-23 19:05:49 +02:00
KERNEL.POWER8 Add workaround for LAPACK test failures with the NVIDIA HPC compiler 2021-03-19 11:44:31 +01:00
KERNEL.POWER9 Add workaround for LAPACK test failures with the NVIDIA HPC compiler 2021-03-19 11:46:25 +01:00
KERNEL.POWER10 POWER10: Optimized zgemv 2021-04-10 19:00:24 -05:00
KERNEL.PPC440 Fix inverted conditional for caxpy/zaxpy 2021-06-10 11:14:03 +02:00
KERNEL.PPC440FP2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PPC970 Fix endianness conditionals 2020-02-19 18:09:54 +01:00
KERNEL.PPCG4 Fix caxpy/zaxpy for big-endian 2021-06-10 11:15:48 +02:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
axpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
casum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
casum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
caxpy.c caxpy and cdot are using vec_vsx_ld 2019-08-30 04:09:15 +00:00
caxpy_microk_power10.c POWER10: Improve axpy performance 2020-12-10 11:51:42 -06:00
caxpy_power8.S Update caxpy_power8.S 2020-02-13 22:44:09 +01:00
caxpy_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
caxpy_power10.c Optimize caxpy for POWER10 2020-10-29 14:57:51 -05:00
ccopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ccopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ccopy_microk_power10.c POWER10: Improve copy performance 2020-12-13 10:41:45 -06:00
ccopy_power10.c POWER10: Improve copy performance 2020-12-13 10:41:45 -06:00
cdot.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
cdot_microk_power10.c Optimize cdot function for POWER10 2021-01-15 13:40:34 -06:00
cdot_power9.S Fix SEGV in cdot_power9 2019-11-26 21:55:04 -07:00
cgemm_kernel_8x4_power8.S Update cgemm_kernel_8x4_power8.S 2020-06-30 15:16:39 +02:00
cgemm_kernel_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
cgemm_kernel_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
cgemm_logic_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
cgemm_logic_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemm_macros_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
cgemm_macros_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
cgemm_tcopy_logic_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
cgemv_t.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
cnrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cnrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cnrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_microk_power10.c POWER10: Improve copy performance 2020-12-13 10:41:45 -06:00
crot.c Add ifdefs around call to altivec microkernel 2020-07-23 18:30:42 +00:00
cscal_microk_power10.c Optimize cscal function for POWER10 2021-01-29 13:51:43 -06:00
cswap.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
cswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
cswap_microk_power10.c Optimize swap function for POWER10 2021-01-08 08:01:36 -06:00
ctrmm_kernel_8x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:37:04 +02:00
ctrmm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
ctrmm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum.c POWER10: Improving dasum performance 2021-08-10 22:06:04 -05:00
dasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum_microk_power10.c POWER10: Improving dasum performance 2021-08-10 22:06:04 -05:00
daxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
daxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
daxpy_microk_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
daxpy_power10.c POWER10: Improve axpy performance 2020-12-10 11:51:42 -06:00
dcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dcopy_power10.c POWER10: Improve copy performance 2020-12-13 10:41:45 -06:00
ddot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ddot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ddot_microk_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
ddot_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
def_vsx.h added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
dgemm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:41:43 +02:00
dgemm_kernel_power9.S sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 2019-06-04 07:11:30 +00:00
dgemm_kernel_power10.c POWER10: Improve dgemm performance 2021-04-13 22:30:06 -05:00
dgemm_logic_16x4_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_logic_power9.S power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 2019-03-29 15:49:40 +00:00
dgemm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_macros_power9.S power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 2019-03-29 15:49:40 +00:00
dgemm_ncopy_4_power8.S dgemm_ncopy_4_ save/restore 2018-02-18 01:30:17 +00:00
dgemm_ncopy_8_power10.c POWER10: Change dgemm unroll factors 2020-10-31 18:28:57 -05:00
dgemm_ncopy_logic_4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_ncopy_macros_4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_tcopy_16_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
dgemm_tcopy_logic_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_macros_16_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dgemv_n_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemv_n_microk_power10.c POWER10: Optimize dgemv_n 2020-11-29 15:28:28 -06:00
dgemv_n_power10.c POWER10: Optimize dgemv_n 2020-11-29 15:28:28 -06:00
dgemv_t.c Fix compilation issues with clang on POWER 2020-07-27 14:11:07 -05:00
dgemv_t_power10.c dgemv optimization for POWER10 2020-07-29 18:59:32 -05:00
dnrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dnrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
drot.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
drot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
drot_microk_power10.c Optimize s/drot function for POWER10 2021-01-21 13:24:45 -06:00
dscal.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
dscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dscal_microk_power10.c Optimize s/dscal function for POWER10 2021-01-24 07:48:28 -06:00
dswap.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
dswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrmm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:29:45 +02:00
dtrmm_logic_16x4_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
dtrmm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrsm_kernel_LT_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:31:01 +02:00
dtrsm_logic_LT_16x4_power8.S optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
dtrsm_macros_LT_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
exfunc.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power6.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_ncopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_hummer_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_n.S Add prefetch values for power3 2021-06-10 11:17:33 +02:00
gemv_n_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemv_t.S Add prefetch values for power3 2021-06-10 11:18:22 +02:00
gemv_t_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
iamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
icamax.c conflict resolve 2019-05-01 19:36:22 +00:00
icamax_power8.S Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) 2019-09-22 22:35:22 +02:00
icamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
icamin.c conflict resolve 2019-05-01 19:36:22 +00:00
icamin_power8.S Update icamin_power8.S 2020-02-13 18:38:43 +01:00
icamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
idamax.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
idamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
imax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
isamax.c Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin}, 2019-01-16 15:16:21 +00:00
isamax_power8.S Update isamax_power8.S 2020-02-12 23:59:50 +01:00
isamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
isamin.c Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin}, 2019-01-16 15:16:21 +00:00
isamin_power8.S Update isamin_power8.S 2020-02-13 00:00:32 +01:00
isamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
izamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lock.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
max.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sasum.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
sasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sasum_microk_power10.c Optimize s/dasum function for POWER10 2021-03-05 16:22:36 -06:00
saxpy.c Merge pull request #2721 from martin-frbg/p8align 2020-07-24 11:06:20 +02:00
saxpy_microk_power10.c Optimize saxpy for POWER10 2020-10-26 13:24:59 -05:00
saxpy_power10.c POWER10: Improve axpy performance 2020-12-10 11:51:42 -06:00
sbgemm_kernel_power10.c Power10: Fix for SBGEMM 2021-06-15 13:07:47 -05:00
sbgemm_ncopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_ncopy_16_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_tcopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_tcopy_16_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
scal.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
scal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
scopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
scopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
scopy_power10.c POWER10: Improve copy performance 2020-12-13 10:41:45 -06:00
sdot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sdot_microk_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
sdot_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
sgemm_kernel_16x8_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
sgemm_kernel_power9.S sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 2019-06-04 07:11:30 +00:00
sgemm_kernel_power10.c POWER10: Change dgemm unroll factors 2020-10-31 18:28:57 -05:00
sgemm_logic_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_logic_power9.S trmm fix 2019-09-29 02:28:34 +00:00
sgemm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_macros_power9.S new sgemm 8x16 2019-06-17 15:33:38 +00:00
sgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
sgemm_tcopy_16_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
sgemm_tcopy_logic_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_logic_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_tcopy_macros_16_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sgemv_n_8.c aligned 2019-08-29 23:22:23 +00:00
sgemv_t.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sgemv_t_8.c aligned 2019-08-29 23:22:23 +00:00
snrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
snrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
snrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
srot.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
srot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
srot_microk_power10.c Optimize s/drot function for POWER10 2021-01-21 13:24:45 -06:00
sscal.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
sscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sscal_microk_power10.c Optimize s/dscal function for POWER10 2021-01-24 07:48:28 -06:00
sswap.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
sswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
strmm_kernel_16x8_power8.S Fix spurious trailing whitespace in comment 2020-10-19 09:12:12 +02:00
strmm_logic_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
strmm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
swap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
swap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
swap_microk_power10.c Optimize swap function for POWER10 2021-01-08 08:01:36 -06:00
symv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
symv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LT_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_RN_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_RT_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zasum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zasum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zaxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zaxpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zaxpy_microk_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
zaxpy_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
zaxpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zcopy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zcopy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zcopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
zdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot.c Typo fix 2020-07-23 17:34:56 +00:00
zdot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zdot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power6.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power9.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_logic_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_logic_power9.S cgemm/ctrmm power9 2019-07-01 14:07:54 +00:00
zgemm_logic_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemm_macros_power9.S power9 zgemm ztrmm optimized 2019-06-05 20:07:16 +00:00
zgemm_macros_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_ncopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
zgemm_tcopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_logic_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemv_n.S Add prefetch values for power3 2021-06-10 11:20:29 +02:00
zgemv_n_4.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zgemv_n_power10.c POWER10: Optimized zgemv 2021-04-10 19:00:24 -05:00
zgemv_n_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemv_t.S Add prefetch values for power3 2021-06-10 11:19:40 +02:00
zgemv_t_4.c POWER10: Optimized zgemv 2021-04-10 19:00:24 -05:00
zgemv_t_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
znrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zrot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zscal.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
zscal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zscal_microk_power10.c Optimize zscal function for POWER10 2021-03-10 17:15:33 -06:00
zscal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
zswap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zswap.c POWER10: Adding check for little endian 2021-03-31 21:32:42 -05:00
zswap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zsymv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsymv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_logic_8x2_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
ztrmm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
ztrsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00