OpenBLAS/kernel/power
Hong Bo Peng db98f8753f Try to fix LAPACK testing failures on P7.
1. Remove the FADD insn from the GEMV Transpose code.
  2. Remove the FADD insn from GEMM and ZGEMM code.
  3. Reorder the compution of the Imaginary part in ZGEMM code.
2024-07-19 02:08:19 -04:00
..
KERNEL Add proper defaults for IxMIN/IxMAX kernels 2020-02-21 11:55:52 +01:00
KERNEL.CELL Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER4 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER5 fix c/zrot and sgemv for POWER5 2021-06-10 11:11:56 +02:00
KERNEL.POWER6 Use generic zrot.c on ppc64/POWER6 to work around utest failure from … (#1535) 2018-04-23 19:05:49 +02:00
KERNEL.POWER8 Only vectorize 64-bit version for Power8. 2024-02-01 08:11:43 -06:00
KERNEL.POWER9 Vectorize in-copy packing/copying for SGEMM - 4X faster. 2024-01-30 09:13:16 -06:00
KERNEL.POWER10 Revert "Merge pull request #4532 from austinpagan/cgemm_zgemm_c_code" 2024-03-01 07:57:39 -06:00
KERNEL.PPC440 Fix inverted conditional for caxpy/zaxpy 2021-06-10 11:14:03 +02:00
KERNEL.PPC440FP2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PPC970 KERNEL.PPC970, KERNEL.PPCG4: unbreak CMake parsing 2023-12-14 12:00:11 +08:00
KERNEL.PPCG4 KERNEL.PPC970, KERNEL.PPCG4: unbreak CMake parsing 2023-12-14 12:00:11 +08:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
axpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
casum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
casum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
caxpy.c caxpy and cdot are using vec_vsx_ld 2019-08-30 04:09:15 +00:00
caxpy_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
caxpy_power8.S Update caxpy_power8.S 2020-02-13 22:44:09 +01:00
caxpy_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
caxpy_power10.c Optimize caxpy for POWER10 2020-10-29 14:57:51 -05:00
ccopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ccopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ccopy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
cdot.c POWER: Fix complex dot function failures 2022-07-18 14:48:43 -05:00
cdot_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cdot_power9.S Fix SEGV in cdot_power9 2019-11-26 21:55:04 -07:00
cgemm_kernel_8x4_power8.S Update cgemm_kernel_8x4_power8.S 2020-06-30 15:16:39 +02:00
cgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemm_kernel_power10.S AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cgemm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
cgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemm_logic_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemm_macros_power10.S AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
cgemm_tcopy_logic_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemv_n.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemv_t.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cnrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
cnrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
cnrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
copy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
crot.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cscal_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
cswap.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
cswap_microk_power10.c POWER: Fixing endianness issue in cswap/zswap kernel for AIX 2024-05-15 19:36:46 -05:00
ctrmm_kernel_8x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:37:04 +02:00
ctrmm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
ctrmm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
dasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum_microk_power10.c POWER10: Improving dasum performance 2021-08-10 22:06:04 -05:00
daxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
daxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
daxpy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
daxpy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
dcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dcopy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
ddot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ddot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ddot_microk_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
ddot_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
def_vsx.h added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
dgemm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:41:43 +02:00
dgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
dgemm_kernel_power10.c POWER10: Improve dgemm performance 2021-04-13 22:30:06 -05:00
dgemm_logic_16x4_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
dgemm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
dgemm_ncopy_4_power8.S dgemm_ncopy_4_ save/restore 2018-02-18 01:30:17 +00:00
dgemm_ncopy_8_power10.c POWER10: Change dgemm unroll factors 2020-10-31 18:28:57 -05:00
dgemm_ncopy_logic_4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_ncopy_macros_4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_small_kernel_nn_power10.c Fix power10 gcc intrinsic check 2023-08-17 15:05:29 +01:00
dgemm_small_kernel_nt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
dgemm_small_kernel_tn_power10.c Fix power10 gcc intrinsic check 2023-08-17 15:05:29 +01:00
dgemm_small_kernel_tt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
dgemm_tcopy_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
dgemm_tcopy_logic_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_macros_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
dgemm_tcopy_macros_16_power10.S Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
dgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dgemv_n_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemv_n_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
dgemv_n_power10.c POWER10: Optimize dgemv_n 2020-11-29 15:28:28 -06:00
dgemv_t.c Fix compilation issues with clang on POWER 2020-07-27 14:11:07 -05:00
dgemv_t_power10.c POWER10: dgemv builtin rename 2022-07-18 09:48:01 -05:00
dnrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
dnrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
dot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
drot.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
drot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
drot_microk_power10.c Optimize s/drot function for POWER10 2021-01-21 13:24:45 -06:00
dscal.c handle INF and NAN in input 2024-06-22 15:55:29 +02:00
dscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dscal_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
dswap.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
dswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrmm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:29:45 +02:00
dtrmm_logic_16x4_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
dtrmm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrsm_kernel_LT_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:31:01 +02:00
dtrsm_logic_LT_16x4_power8.S optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
dtrsm_macros_LT_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
exfunc.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power6.S Try to fix LAPACK testing failures on P7. 2024-07-19 02:08:19 -04:00
gemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_ncopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_small_kernel_permit_power10.c POWER10: Fix multithreading check when USE_THREAD=0 2022-06-25 03:46:46 -03:00
gemm_tcopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_hummer_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_n.S Add prefetch values for power3 2021-06-10 11:17:33 +02:00
gemv_n_ppc440.S fix undefined prefetchsize 2022-04-16 10:00:10 +02:00
gemv_t.S Try to fix LAPACK testing failures on P7. 2024-07-19 02:08:19 -04:00
gemv_t_ppc440.S fix undefined prefetchsizes 2022-04-16 10:04:27 +02:00
ger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
iamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
icamax.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
icamax_power8.S Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) 2019-09-22 22:35:22 +02:00
icamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
icamin.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
icamin_power8.S Update icamin_power8.S 2020-02-13 18:38:43 +01:00
icamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
idamax.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
idamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
imax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
isamax.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
isamax_power8.S Update isamax_power8.S 2020-02-12 23:59:50 +01:00
isamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
isamin.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
isamin_power8.S Update isamin_power8.S 2020-02-13 00:00:32 +01:00
isamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
izamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lock.c Modernize obsolete inline order 2023-08-16 00:48:40 +02:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
max.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
rot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sasum.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
sasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sasum_microk_power10.c Optimize s/dasum function for POWER10 2021-03-05 16:22:36 -06:00
saxpy.c Merge pull request #2721 from martin-frbg/p8align 2020-07-24 11:06:20 +02:00
saxpy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
saxpy_power10.c Fix regression SAXPY when compiler with OpenXL compiler. 2024-05-12 23:27:55 -05:00
sbgemm_kernel_power10.c POWER10: Fix compiler warnings 2023-06-12 01:08:59 -04:00
sbgemm_ncopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_ncopy_16_power10.c Vectorize SBGEMM incopy - 4x faster. 2024-07-09 13:10:03 -05:00
sbgemm_tcopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_tcopy_16_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
scal.S temporarily(?) disable the alpha=0 branch as it does not handle INF,NAN 2024-06-27 16:21:57 +02:00
scal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
scopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
scopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
scopy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
sdot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sdot_microk_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
sdot_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
sgemm_kernel_16x8_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
sgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemm_kernel_power10.c POWER10: Change dgemm unroll factors 2020-10-31 18:28:57 -05:00
sgemm_logic_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemm_ncopy_16_power.c Vectorize in-copy packing/copying for SGEMM - 4X faster. 2024-01-30 09:13:16 -06:00
sgemm_small_kernel_nn_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_small_kernel_nt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_small_kernel_tn_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_small_kernel_tt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
sgemm_tcopy_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores. 2023-12-08 12:36:08 -06:00
sgemm_tcopy_logic_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_logic_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_tcopy_macros_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores. 2023-12-08 12:36:08 -06:00
sgemm_tcopy_macros_16_power10.S Replace two vector loads with one vector pair load and fix endianess of stores. 2023-12-08 12:36:08 -06:00
sgemv_n.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemv_n_8.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemv_t.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemv_t_8.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
snrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
snrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
snrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
srot.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
srot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
srot_microk_power10.c Optimize s/drot function for POWER10 2021-01-21 13:24:45 -06:00
sscal.c handle INF and NAN in input 2024-06-22 16:03:30 +02:00
sscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sscal_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
sswap.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
sswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
strmm_kernel_16x8_power8.S Fix spurious trailing whitespace in comment 2020-10-19 09:12:12 +02:00
strmm_logic_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
strmm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
swap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
swap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
swap_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
symv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
symv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN_power10.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
trsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LT_power10.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
trsm_kernel_RN_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_RT_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zasum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zasum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zaxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zaxpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zaxpy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
zaxpy_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
zaxpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zcopy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zcopy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zcopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
zdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot.c POWER: Fix complex dot function failures 2022-07-18 14:48:43 -05:00
zdot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zdot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power6.S Try to fix LAPACK testing failures on P7. 2024-07-19 02:08:19 -04:00
zgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
zgemm_kernel_power10.S POWER10: Fix ZGEMM testcase failures 2022-06-17 08:18:08 -05:00
zgemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_logic_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
zgemm_logic_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
zgemm_macros_power10.S AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zgemm_ncopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
zgemm_tcopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_logic_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemv_n.S Add prefetch values for power3 2021-06-10 11:20:29 +02:00
zgemv_n_4.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
zgemv_n_power10.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
zgemv_n_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemv_t.S Add prefetch values for power3 2021-06-10 11:19:40 +02:00
zgemv_t_4.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zgemv_t_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
znrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
znrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
znrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
zrot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zrot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zscal.c POWER: Fix issues in zscal to address lapack failures 2024-05-22 08:00:06 -05:00
zscal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zscal_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zscal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
zswap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zswap.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zswap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zsymv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsymv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_logic_8x2_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
ztrmm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
ztrsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00