OpenBLAS/kernel/power
Martin Kroeker 9e2f316ede Power8 inline assembly fixes
Quoting patch author amodra from #1078
Lots of issues here.
- The vsx regs weren't listed as clobbered.
- Poor choice of vsx regs, which along with the lack of clobbers led to
  trashing v0..v21 and fr14..fr23.  Ideally you'd let gcc choose all
  temp vsx regs, but asms currently have a limit of 30 i/o parms.
- Other regs were clobbered unnecessarily, seemingly in an attempt to
  clobber inputs, with gcc-7 complaining about the clobber of r2.
  (Changed inputs should be also listed as outputs or as an i/o.)
- "r" constraint used instead of "b" for gprs used in insns where the
  r0 encoding means zero rather than r0.
- There were unused asm inputs too.
- All memory was clobbered rather than hooking up memory outputs with
  proper memory constraints, and that and the lack of proper memory
  input constraints meant the asms needed to be volatile and their
  containing function noinline.
- Some parameters were being passed unnecessarily via memory.
- When a copy of a
2017-02-13 23:38:50 +01:00
..
KERNEL modified KERNEL for power, to use the generic DSDOT-KERNEL 2016-03-06 09:07:24 +01:00
KERNEL.CELL Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER4 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.POWER5 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER6 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.POWER8 Refs #946. Use nrm2 reference implementation for Power8. 2016-08-18 18:59:43 -07:00
KERNEL.PPC440 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.PPC440FP2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PPC970 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.PPCG4 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
axpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
casum.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
casum_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
ccopy.c added optimized ccopy kernel for POWER8 2016-03-25 16:54:25 +01:00
ccopy_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
cgemm_kernel_8x4_power8.S Updated cgemm- and sgemm-kernel for POWER8 SMP 2016-04-07 15:08:15 +02:00
cgemm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
cgemm_macros_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
cgemm_tcopy_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cgemm_tcopy_logic_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cgemm_tcopy_macros_8_power8.S added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
cnrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cnrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cnrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cswap.c added otimized cswap and zswap kernels for POWER8 2016-03-27 18:31:37 +02:00
cswap_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
ctrmm_kernel_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
ctrmm_logic_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
ctrmm_macros_8x4_power8.S updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
dasum.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dasum_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
daxpy.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
daxpy_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dcopy.c added optimized dcopy kernel for POWER8 2016-03-25 13:03:02 +01:00
dcopy_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
ddot.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
ddot_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
def_vsx.h added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
dgemm_kernel_16x4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_logic_16x4_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_macros_16x4_power8.S optimized dgemm for 20 threads 2016-05-16 14:14:25 +02:00
dgemm_ncopy_4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_ncopy_logic_4_power8.S optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
dgemm_ncopy_macros_4_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_logic_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemm_tcopy_macros_16_power8.S optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
dgemv_n.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dgemv_n_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dnrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dnrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
drot.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
drot_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dscal.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dscal_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dswap.c added optimized dswap kernel for POWER8 2016-03-25 14:35:43 +01:00
dswap_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
dtrmm_kernel_16x4_power8.S optimized dgemm for POWER8 2016-04-27 14:01:08 +02:00
dtrmm_logic_16x4_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
dtrmm_macros_16x4_power8.S optimized dgemm for POWER8 2016-04-27 14:01:08 +02:00
dtrsm_kernel_LT_16x4_power8.S optimized dtrsm_kernel_LT for POWER8 2016-05-22 15:20:04 +02:00
dtrsm_logic_LT_16x4_power8.S optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
dtrsm_macros_LT_16x4_power8.S optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
exfunc.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_beta.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_altivec.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_altivec_cell.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_altivec_g4.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_cell.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_g4.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_power3.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_power6.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_kernel_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemm_ncopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_4.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
gemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_hummer_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_hummer_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemv_n.S enabled gemv assembly on power8 2016-03-04 12:53:31 +01:00
gemv_n_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
gemv_t.S enabled gemv assembly on power8 2016-03-04 12:53:31 +01:00
gemv_t_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ger.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
iamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
imin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lock.c Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
max.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
max_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
min_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sasum.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sasum_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
scal.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scopy.c added optimized scopy kernel for POWER8 2016-03-25 16:06:56 +01:00
scopy_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sdot.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sdot_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sgemm_kernel_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_logic_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_macros_16x8_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_logic_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_logic_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
sgemm_tcopy_macros_8_power8.S added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
sgemm_tcopy_macros_16_power8.S Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
snrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
snrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
snrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
srot.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
srot_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sscal.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sscal_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
sswap.c added optimized sswap kernel for POWER8 2016-03-25 17:34:55 +01:00
sswap_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
strmm_kernel_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
strmm_logic_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
strmm_macros_16x8_power8.S updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
swap.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
swap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
symv_L.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
symv_U.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
trsm_kernel_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_cell_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_cell_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_cell_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_hummer_LN.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_hummer_LT.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_hummer_RT.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_power6_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_power6_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_power6_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_ppc440_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_ppc440_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
trsm_kernel_ppc440_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamin_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zasum_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zasum_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zaxpy.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zaxpy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zaxpy_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zcopy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy.c added optimized zswap kernel for POWER8 2016-03-25 15:27:34 +01:00
zcopy_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zdot_cell.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zdot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_beta.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_kernel_altivec.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_altivec_cell.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_altivec_g4.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_cell.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_g4.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_power3.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_power6.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_kernel_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemm_logic_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_macros_8x2_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_ncopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_ncopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_tcopy_hummer_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_hummer_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_logic_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemm_tcopy_macros_8_power8.S Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
zgemv_n.S enabled gemv assembly on power8 2016-03-04 12:53:31 +01:00
zgemv_n_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zgemv_t.S enabled gemv assembly on power8 2016-03-04 12:53:31 +01:00
zgemv_t_ppc440.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zger.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
znrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zscal_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zscal_ppc440.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
zswap.c added otimized cswap and zswap kernels for POWER8 2016-03-27 18:31:37 +02:00
zswap_hummer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_microk_power8.c Power8 inline assembly fixes 2017-02-13 23:38:50 +01:00
zsymv_L.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
zsymv_U.S added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 2016-03-01 07:33:56 +01:00
ztrmm_kernel_8x2_power8.S updated zgemm- and ztrmm-kernel for POWER8 2016-04-08 09:05:37 +02:00
ztrmm_logic_8x2_power8.S Modified assembly label name, so that they are hidden. 2016-03-05 10:27:27 +01:00
ztrmm_macros_8x2_power8.S updated zgemm- and ztrmm-kernel for POWER8 2016-04-08 09:05:37 +02:00
ztrsm_kernel_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_cell_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_cell_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_cell_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_hummer_LN.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_hummer_LT.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_hummer_RT.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_power6_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_power6_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_power6_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_ppc440_LN.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_ppc440_LT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00
ztrsm_kernel_ppc440_RT.S ppc64le platform support (ELF ABI v2) 2015-07-21 22:20:19 -05:00