OpenBLAS/kernel/power
Ayappan Perumal 020cce1068 Fix build issues with gcc compiler as well 2024-10-23 04:24:06 -05:00
..
KERNEL Add proper defaults for IxMIN/IxMAX kernels 2020-02-21 11:55:52 +01:00
KERNEL.CELL Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER4 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER5 fix c/zrot and sgemv for POWER5 2021-06-10 11:11:56 +02:00
KERNEL.POWER6 Use generic zrot.c on ppc64/POWER6 to work around utest failure from … (#1535) 2018-04-23 19:05:49 +02:00
KERNEL.POWER8 Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
KERNEL.POWER9 Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
KERNEL.POWER10 Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
KERNEL.PPC440 Fix inverted conditional for caxpy/zaxpy 2021-06-10 11:14:03 +02:00
KERNEL.PPC440FP2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PPC970 KERNEL.PPC970, KERNEL.PPCG4: unbreak CMake parsing 2023-12-14 12:00:11 +08:00
KERNEL.PPCG4 change sgemm kernel to 4x4 as the 16x4 altivec goes out of bounds 2024-10-03 22:04:20 +02:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
axpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
casum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
casum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
caxpy.c caxpy and cdot are using vec_vsx_ld 2019-08-30 04:09:15 +00:00
caxpy_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
caxpy_power8.S Update caxpy_power8.S 2020-02-13 22:44:09 +01:00
caxpy_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
caxpy_power10.c Optimize caxpy for POWER10 2020-10-29 14:57:51 -05:00
ccopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ccopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ccopy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
cdot.c POWER: Fix complex dot function failures 2022-07-18 14:48:43 -05:00
cdot_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cdot_power9.S Fix SEGV in cdot_power9 2019-11-26 21:55:04 -07:00
cgemm_kernel_8x4_power8.S Update cgemm_kernel_8x4_power8.S 2020-06-30 15:16:39 +02:00
cgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemm_kernel_power10.S AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cgemm_kernel_power10.c CGEMM & ZGEMM using C code. 2024-10-09 09:42:23 -05:00
cgemm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
cgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemm_logic_power10.S powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 2020-06-24 14:48:15 -05:00
cgemm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemm_macros_power10.S AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
cgemm_tcopy_logic_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
cgemv_n.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cgemv_t.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cnrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
cnrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
cnrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
copy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
crot.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
cscal_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
cswap.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
cswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
cswap_microk_power10.c POWER: Fixing endianness issue in cswap/zswap kernel for AIX 2024-05-15 19:36:46 -05:00
ctrmm_kernel_8x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:37:04 +02:00
ctrmm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
ctrmm_macros_8x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
dasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dasum_microk_power10.c POWER10: Improving dasum performance 2021-08-10 22:06:04 -05:00
daxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
daxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
daxpy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
daxpy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
dcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dcopy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
ddot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
ddot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
ddot_microk_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
ddot_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
def_vsx.h added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
dgemm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:41:43 +02:00
dgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
dgemm_kernel_power10.c POWER10: Improve dgemm performance 2021-04-13 22:30:06 -05:00
dgemm_logic_16x4_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
dgemm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
dgemm_ncopy_4_power8.S dgemm_ncopy_4_ save/restore 2018-02-18 01:30:17 +00:00
dgemm_ncopy_8_power10.c POWER10: Change dgemm unroll factors 2020-10-31 18:28:57 -05:00
dgemm_ncopy_logic_4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_ncopy_macros_4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemm_small_kernel_nn_power10.c Fix power10 gcc intrinsic check 2023-08-17 15:05:29 +01:00
dgemm_small_kernel_nt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
dgemm_small_kernel_tn_power10.c Fix power10 gcc intrinsic check 2023-08-17 15:05:29 +01:00
dgemm_small_kernel_tt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
dgemm_tcopy_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
dgemm_tcopy_logic_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_macros_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
dgemm_tcopy_macros_16_power10.S Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
dgemv_n.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
dgemv_n_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dgemv_n_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
dgemv_n_power10.c POWER10: Optimize dgemv_n 2020-11-29 15:28:28 -06:00
dgemv_t.c Fix compilation issues with clang on POWER 2020-07-27 14:11:07 -05:00
dgemv_t_power10.c POWER10: dgemv builtin rename 2022-07-18 09:48:01 -05:00
dnrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
dnrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
dot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
drot.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
drot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
drot_microk_power10.c Optimize s/drot function for POWER10 2021-01-21 13:24:45 -06:00
dscal.c make NAN handling depend on DUMMY2 parameter 2024-07-21 13:42:47 +02:00
dscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dscal_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
dswap.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
dswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrmm_kernel_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:29:45 +02:00
dtrmm_logic_16x4_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
dtrmm_macros_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
dtrsm_kernel_LT_16x4_power8.S Clean up STACKSIZE redefinition 2020-10-18 19:31:01 +02:00
dtrsm_logic_LT_16x4_power8.S optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
dtrsm_macros_LT_16x4_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
exfunc.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_common.c Fix build issues with gcc compiler as well 2024-10-23 04:24:06 -05:00
gemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_kernel_power6.S Try to fix LAPACK testing failures on P7. 2024-07-19 02:08:19 -04:00
gemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
gemm_ncopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_small_kernel_permit_power10.c POWER10: Fix multithreading check when USE_THREAD=0 2022-06-25 03:46:46 -03:00
gemm_tcopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_hummer_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_n.S Add prefetch values for power3 2021-06-10 11:17:33 +02:00
gemv_n_ppc440.S fix undefined prefetchsize 2022-04-16 10:00:10 +02:00
gemv_t.S Try to fix LAPACK testing failures on P7. 2024-07-19 02:08:19 -04:00
gemv_t_ppc440.S fix undefined prefetchsizes 2022-04-16 10:04:27 +02:00
ger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
iamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
icamax.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
icamax_power8.S Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) 2019-09-22 22:35:22 +02:00
icamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
icamin.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
icamin_power8.S Update icamin_power8.S 2020-02-13 18:38:43 +01:00
icamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
idamax.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
idamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
imax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
isamax.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
isamax_power8.S Update isamax_power8.S 2020-02-12 23:59:50 +01:00
isamax_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
isamin.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
isamin_power8.S Update isamin_power8.S 2020-02-13 00:00:32 +01:00
isamin_power9.S Fix DYNAMIC_ARCH build for POWER9 2020-03-03 12:35:10 -06:00
izamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
izamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lock.c Modernize obsolete inline order 2023-08-16 00:48:40 +02:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
max.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
rot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sasum.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
sasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sasum_microk_power10.c Optimize s/dasum function for POWER10 2021-03-05 16:22:36 -06:00
saxpy.c Merge pull request #2721 from martin-frbg/p8align 2020-07-24 11:06:20 +02:00
saxpy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
saxpy_power10.c Fix regression SAXPY when compiler with OpenXL compiler. 2024-05-12 23:27:55 -05:00
sbgemm_kernel_power10.c POWER10: Fix compiler warnings 2023-06-12 01:08:59 -04:00
sbgemm_ncopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_ncopy_16_power10.c Vectorize SBGEMM incopy - 4x faster. 2024-07-09 13:10:03 -05:00
sbgemm_tcopy_8_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemm_tcopy_16_power10.c POWER10: Change the packing format for bfloat16 2020-10-13 16:05:10 -05:00
sbgemv_common.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_common_power10.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_n.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_n_power10.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_n_vsx.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_t.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_t_power10.c Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 2024-10-13 13:46:11 -05:00
sbgemv_t_vsx.c Better VSX. 2024-10-17 08:25:02 -05:00
scal.S make NAN handling depend on dummy2 parameter 2024-07-17 23:29:50 +02:00
scal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_ppc440.S Handle inf/nan if dummy2 flag is set 2024-10-06 19:57:17 +02:00
scopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
scopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
scopy_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
sdot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
sdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sdot_microk_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
sdot_power10.c Optimize sdot/ddot for POWER10 2020-11-07 15:21:58 -06:00
sgemm_kernel_16x8_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
sgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemm_kernel_power10.c POWER10: Change dgemm unroll factors 2020-10-31 18:28:57 -05:00
sgemm_logic_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemm_ncopy_16_power.c Vectorize in-copy packing/copying for SGEMM - 4X faster. 2024-01-30 09:13:16 -06:00
sgemm_small_kernel_nn_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_small_kernel_nt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_small_kernel_tn_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_small_kernel_tt_power10.c [POWER] Add support for SMALL_MATRIX_OPT 2021-11-28 12:41:16 -06:00
sgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
sgemm_tcopy_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores. 2023-12-08 12:36:08 -06:00
sgemm_tcopy_logic_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_logic_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sgemm_tcopy_macros_16_power8.S Replace two vector loads with one vector pair load and fix endianess of stores. 2023-12-08 12:36:08 -06:00
sgemm_tcopy_macros_16_power10.S Replace two vector loads with one vector pair load and fix endianess of stores. 2023-12-08 12:36:08 -06:00
sgemv_n.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemv_n_8.c change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
sgemv_t.c Vectorize reduction stage of sgemv_t. 2024-07-16 15:57:24 -05:00
sgemv_t_8.c Vectorize reduction stage of sgemv_t. 2024-07-16 15:57:24 -05:00
snrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
snrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
snrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
srot.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
srot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
srot_microk_power10.c Optimize s/drot function for POWER10 2021-01-21 13:24:45 -06:00
sscal.c make NAN handling depend on DUMMY2 parameter 2024-07-21 13:42:47 +02:00
sscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
sscal_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
sswap.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
sswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
strmm_kernel_16x8_power8.S Fix spurious trailing whitespace in comment 2020-10-19 09:12:12 +02:00
strmm_logic_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
strmm_macros_16x8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
sum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
swap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
swap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
swap_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
symv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
symv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LN_power10.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
trsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_LT_power10.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
trsm_kernel_RN_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_RT_power10.c Added special unrolled vectorized versions of "Solve" for specific sizes, 2020-12-04 17:07:06 -06:00
trsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
trsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zasum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zasum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zaxpy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zaxpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zaxpy_microk_power10.c POWER10: Changing store instructions for Level1 functions 2022-05-12 11:17:33 -05:00
zaxpy_power10.c Optimize daxpy/zaxpy for POWER10 2020-09-17 12:56:28 -05:00
zaxpy_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zcopy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zcopy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zcopy_power10.c Optimize scopy/ccopy for POWER10 2020-10-21 09:53:45 -05:00
zdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot.c POWER: Fix complex dot function failures 2022-07-18 14:48:43 -05:00
zdot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zdot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_beta.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_altivec_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_cell.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_g4.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_hummer.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power3.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_kernel_power6.S Try to fix LAPACK testing failures on P7. 2024-07-19 02:08:19 -04:00
zgemm_kernel_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
zgemm_kernel_power10.S POWER10: Fix ZGEMM testcase failures 2022-06-17 08:18:08 -05:00
zgemm_kernel_power10.c CGEMM & ZGEMM using C code. 2024-10-09 09:42:23 -05:00
zgemm_kernel_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemm_logic_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_logic_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
zgemm_logic_power10.S powerpc: Optimized ZGEMM kernel for POWER10 2020-06-24 14:50:12 -05:00
zgemm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemm_macros_power9.S change line endings from CRLF to LF 2022-11-16 22:24:01 +01:00
zgemm_macros_power10.S AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zgemm_ncopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_8_power8.S power8 ?gemm_tcopy save/restore 2018-02-16 23:36:46 +00:00
zgemm_tcopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_logic_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_tcopy_macros_8_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
zgemv_n.S Add prefetch values for power3 2021-06-10 11:20:29 +02:00
zgemv_n_4.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
zgemv_n_power10.c Fix unused variable warnings on Power 2021-09-15 13:36:07 -05:00
zgemv_n_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zgemv_t.S Add prefetch values for power3 2021-06-10 11:19:40 +02:00
zgemv_t_4.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zgemv_t_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zger.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
znrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
znrm2_hummer.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
znrm2_ppc440.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2023-08-10 16:55:17 +02:00
zrot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.c Exclude altivec code paths if the compiler does not support them 2020-07-23 17:08:20 +02:00
zrot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zscal.c POWER: Fix issues in zscal to address lapack failures 2024-05-22 08:00:06 -05:00
zscal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zscal_microk_power10.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zscal_ppc440.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsum.S Add POWER implementation of ?sum 2019-03-30 22:23:42 +01:00
zswap.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zswap.c AIX changes for P10 with GNU Compiler 2021-09-30 06:06:27 -05:00
zswap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_microk_power8.c AIX changes for Power8 2019-08-20 06:51:35 -05:00
zsymv_L.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
zsymv_U.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_kernel_8x2_power8.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrmm_logic_8x2_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
ztrmm_macros_8x2_power8.S AIX changes for Power8 2019-08-20 06:51:35 -05:00
ztrsm_kernel_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_cell_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_hummer_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_power6_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LN.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_LT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00
ztrsm_kernel_ppc440_RT.S Fix build on FreeBSD/powerpc64. 2019-06-25 10:58:56 +02:00