OpenBLAS/kernel/x86_64
Werner Saar c8f2c5d636 added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
..
KERNEL Merge branch 'develop' into cmake 2015-08-03 23:59:01 -05:00
KERNEL.ATOM Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.BARCELONA segment violation in sgemv kernels 2014-07-13 10:46:14 +02:00
KERNEL.BOBCAT Fixed #395. Enable optimized cgemm for Sandybridge. Added optimized sdot kernel. 2014-06-29 10:34:51 +08:00
KERNEL.BULLDOZER added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
KERNEL.CORE2 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.DUNNINGTON Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.EXCAVATOR Add AMD Excavator target. 2015-05-13 16:16:30 -05:00
KERNEL.HASWELL added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
KERNEL.NANO Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.NEHALEM optimized dgemv_n kernel for small sizes 2014-09-08 15:22:35 +02:00
KERNEL.OPTERON Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.OPTERON_SSE3 Fixed #395. Enable optimized cgemm for Sandybridge. Added optimized sdot kernel. 2014-06-29 10:34:51 +08:00
KERNEL.PENRYN Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
KERNEL.PILEDRIVER added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
KERNEL.PRESCOTT fallback to zgemm_kernel_4x2_sse.S 2014-07-06 11:05:28 +02:00
KERNEL.SANDYBRIDGE added optimized cscal kernel for sandybridge 2015-05-18 08:46:06 +02:00
KERNEL.STEAMROLLER added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
KERNEL.generic Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit. 2015-08-10 14:10:44 -05:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
amax_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
asum_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
builtin_stinit.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cabs.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
caxpy.c bugfixes: replaced int with BLASLONG 2015-04-24 14:30:44 +02:00
caxpy_microk_bulldozer-2.c updated caxpy_microk_bulldozer-2.c and caxpy.c 2015-04-15 11:59:38 +02:00
caxpy_microk_haswell-2.c added optimized caxpy-kernel for haswell 2015-04-15 15:16:31 +02:00
caxpy_microk_sandy-2.c added optimized caxpy-kernel for sandybridge 2015-04-15 16:29:25 +02:00
caxpy_microk_steamroller-2.c added optimized caxpy-kernel for steamroller 2015-04-15 13:49:23 +02:00
cdot.c bugfixes: replaced int with BLASLONG 2015-04-24 14:30:44 +02:00
cdot_microk_bulldozer-2.c added optimized cdot- and zdot-kernels for bulldozer 2015-04-08 16:29:55 +02:00
cdot_microk_haswell-2.c add optimized cdot- and zdot-kernel for sandybridge 2015-04-10 09:37:26 +02:00
cdot_microk_sandy-2.c add optimized cdot- and zdot-kernel for sandybridge 2015-04-10 09:37:26 +02:00
cdot_microk_steamroller-2.c added optimized cdot- and zdot-kernel for steamroller 2015-04-09 09:45:23 +02:00
cgemm_kernel_4x2_bulldozer.S bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel 2014-06-28 12:16:20 +02:00
cgemm_kernel_4x2_piledriver.S bugfix for piledriver cgemm-, zgemm- and zgemv-kernel 2014-06-28 11:46:58 +02:00
cgemm_kernel_4x8_sandy.S Update organization info. 2014-11-25 15:28:58 +08:00
cgemm_kernel_8x2_haswell.S modification for clang compiler 2014-08-27 09:00:20 +02:00
cgemm_kernel_8x2_sandy.S optimization of sandybridge cgemm-kernel 2014-07-29 19:07:21 +02:00
cgemv_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cgemv_n_4.c more optimizations 2014-09-11 10:25:48 +02:00
cgemv_n_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
cgemv_t.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
cgemv_t_4.c added optimized cgemv_t kernel for haswell 2014-09-13 15:14:12 +02:00
cgemv_t_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
copy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
copy_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
copy_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
cscal.c added optimized cscal and zscal kernels for steamroller 2015-05-18 12:40:07 +02:00
cscal_microk_bulldozer-2.c added optimized cscal kernel for bulldozer 2015-05-18 07:33:52 +02:00
cscal_microk_haswell-2.c added optimized cscal kernel for haswell 2015-05-17 13:44:09 +02:00
cscal_microk_steamroller-2.c added optimized cscal and zscal kernels for steamroller 2015-05-18 12:40:07 +02:00
ctrsm_kernel_LN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
ctrsm_kernel_LT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
ctrsm_kernel_RN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
ctrsm_kernel_RT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
daxpy.c optimized saxpy and daxpy for sandybridge 2015-04-28 10:18:32 +02:00
daxpy_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
daxpy_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
daxpy_microk_haswell-2.c added optimized saxpy- and daxpy-kernel for haswell 2015-04-06 12:33:16 +02:00
daxpy_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
daxpy_microk_piledriver-2.c add optimized daxpy-kernel for piledriver 2015-04-14 14:23:29 +02:00
daxpy_microk_sandy-2.c optimized saxpy and daxpy for sandybridge 2015-04-28 10:18:32 +02:00
daxpy_microk_steamroller-2.c add optimized daxpy-kernel for steamroller 2015-04-13 12:22:43 +02:00
dcopy_bulldozer.S added dcopy_bulldozer.S 2013-06-21 16:06:51 +02:00
ddot.c bugfixes: replaced int with BLASLONG 2015-04-24 14:30:44 +02:00
ddot_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ddot_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
ddot_microk_haswell-2.c added optimized sdot- and ddot-kernel for HASWELL 2015-04-05 17:57:53 +02:00
ddot_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
ddot_microk_piledriver-2.c add optimized ddot-kernel for piledriver 2015-04-14 15:09:13 +02:00
ddot_microk_sandy-2.c added optimized ddot-kernel for sandybridge 2015-04-05 20:19:38 +02:00
ddot_microk_steamroller-2.c added optimized ddot kernel for steamroller 2015-04-10 16:18:03 +02:00
dgemm_kernel_4x4_haswell.S small optimization on dgemm_kernel for N=1 2014-12-18 20:35:51 +01:00
dgemm_kernel_4x8_haswell.S modified haswell parameter dgemm_unroll_n 2015-06-13 10:28:27 +02:00
dgemm_kernel_4x8_sandy.S Update organization info. 2014-11-25 15:28:58 +08:00
dgemm_kernel_6x4_piledriver.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_kernel_8x2_bulldozer.S Ref #380: lowered stack usage for piledriver and bulldozer kernels 2014-06-19 14:02:14 +02:00
dgemm_kernel_8x2_piledriver.S Ref #380: lowered stack usage for piledriver and bulldozer kernels 2014-06-19 14:02:14 +02:00
dgemm_kernel_16x2_haswell.S Refs #330. Fixed the compatible issue with clang on Mac OSX. 2013-12-16 20:31:17 +08:00
dgemm_ncopy_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_ncopy_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_ncopy_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_ncopy_8_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_tcopy_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_tcopy_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_tcopy_8.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemm_tcopy_8_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_n_4.c optimized dgemv_n kernel for haswell 2015-04-30 12:11:39 +02:00
dgemv_n_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_n_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_n_microk_haswell-4.c optimized dgemv_n kernel for haswell 2015-04-30 12:11:39 +02:00
dgemv_n_microk_nehalem-4.c updated dgemv_n kernel for nehalem 2015-04-30 14:38:06 +02:00
dgemv_n_microk_piledriver-4.c added optimizations for steamroller 2014-12-30 20:14:45 +08:00
dgemv_t.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_t_4.c Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t 2015-04-15 11:46:48 +02:00
dgemv_t_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_t_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dgemv_t_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
dger.c optimized dger kernel for sandybridge 2015-04-28 16:58:11 +02:00
dger_microk_sandy-2.c optimized dger kernel for sandybridge 2015-04-28 16:58:11 +02:00
dot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dot_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dot_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dscal.c bugfix: added static to functions 2015-05-13 13:31:26 +02:00
dscal_microk_bulldozer-2.c added optimized dscal kernel for bulldozer 2015-05-12 12:28:44 +02:00
dscal_microk_haswell-2.c added optimized dscal kernel for haswell 2015-05-12 17:19:58 +02:00
dscal_microk_sandy-2.c added optimized dscal kernel for sandybridge 2015-05-12 16:27:43 +02:00
dsymv_L.c added optimized dsymv kernels for sandybridge 2015-04-22 12:09:43 +02:00
dsymv_L_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
dsymv_L_microk_haswell-2.c added optimized dsymv kernels for haswell 2015-04-22 10:42:50 +02:00
dsymv_L_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
dsymv_L_microk_sandy-2.c added optimized dsymv kernels for sandybridge 2015-04-22 12:09:43 +02:00
dsymv_U.c added optimized dsymv kernels for sandybridge 2015-04-22 12:09:43 +02:00
dsymv_U_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
dsymv_U_microk_haswell-2.c added optimized dsymv kernels for haswell 2015-04-22 10:42:50 +02:00
dsymv_U_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
dsymv_U_microk_sandy-2.c added optimized dsymv kernels for sandybridge 2015-04-22 12:09:43 +02:00
dtrmm_kernel_4x8_haswell.c Fix DYNAMIC_ARCH=1 bug. 2015-10-27 05:10:40 +08:00
dtrsm_kernel_LN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
dtrsm_kernel_LT_8x2_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dtrsm_kernel_RN_8x2_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dtrsm_kernel_RN_haswell.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
dtrsm_kernel_RT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
gemm_beta.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_2x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x2_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x4_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x8_nano.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_4x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_8x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_8x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_8x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_8x4_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_kernel_8x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_2_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_ncopy_4_opteron.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_2_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_4.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gemm_tcopy_4_opteron.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
iamax_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
izamax_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
mcount.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
nrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qconjg.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qgemm_kernel_2x2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qgemv_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qgemv_t.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qtrsm_kernel_LN_2x2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qtrsm_kernel_LT_2x2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
qtrsm_kernel_RT_2x2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rot_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
saxpy.c optimized saxpy and daxpy for sandybridge 2015-04-28 10:18:32 +02:00
saxpy_microk_haswell-2.c added optimized saxpy- and daxpy-kernel for haswell 2015-04-06 12:33:16 +02:00
saxpy_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
saxpy_microk_piledriver-2.c optimized saxpy for piledriver 2015-04-14 08:34:11 +02:00
saxpy_microk_sandy-2.c optimized saxpy and daxpy for sandybridge 2015-04-28 10:18:32 +02:00
scal.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
scal_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
scal_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sdot.c optimized sdot.c for increments != 1 2015-04-24 13:13:20 +02:00
sdot_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sdot_microk_haswell-2.c added optimized sdot- and ddot-kernel for HASWELL 2015-04-05 17:57:53 +02:00
sdot_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sdot_microk_sandy-2.c add optimized sdot-kernel for sandybridge 2015-04-05 19:47:05 +02:00
sdot_microk_steamroller-2.c added optimized sdot-kernel for steamroller 2015-04-11 08:48:18 +02:00
sgemm_kernel_8x4_bulldozer.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sgemm_kernel_8x8_sandy.S Update organization info. 2014-11-25 15:28:58 +08:00
sgemm_kernel_16x2_bulldozer.S Ref #380: lowered stack usage for piledriver and bulldozer kernels 2014-06-19 14:02:14 +02:00
sgemm_kernel_16x2_piledriver.S Ref #380: lowered stack usage for piledriver and bulldozer kernels 2014-06-19 14:02:14 +02:00
sgemm_kernel_16x4_haswell.S modification for clang compiler 2014-08-27 09:00:20 +02:00
sgemm_kernel_16x4_sandy.S Refs #535. Fix the wrong vector instruction in sgemm sandy bridge kernel. 2015-04-08 03:55:49 +08:00
sgemv_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sgemv_n.c removed obsolete gemv kernel files 2014-09-14 11:00:53 +02:00
sgemv_n_4.c optimized dgemv_n kernel for haswell 2015-04-30 12:11:39 +02:00
sgemv_n_microk_bulldozer-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_n_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_n_microk_nehalem-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_n_microk_sandy-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_t.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
sgemv_t.c removed obsolete gemv kernel files 2014-09-14 11:00:53 +02:00
sgemv_t_4.c Merge pull request #482 from jeromerobert/develop 2015-01-02 02:26:17 +08:00
sgemv_t_microk_bulldozer-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_t_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_t_microk_nehalem-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sgemv_t_microk_sandy-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
sger.c added optimized sger kernel for sandybridge 2015-04-28 15:33:38 +02:00
sger_microk_sandy-2.c added optimized sger kernel for sandybridge 2015-04-28 15:33:38 +02:00
ssymv_L.c added optimized ssymv kernels for sandybridge 2015-04-23 12:19:24 +02:00
ssymv_L_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
ssymv_L_microk_haswell-2.c added optimized ssymv kernels for haswell 2015-04-23 10:23:13 +02:00
ssymv_L_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
ssymv_L_microk_sandy-2.c added optimized ssymv kernels for sandybridge 2015-04-23 12:19:24 +02:00
ssymv_U.c added optimized ssymv kernels for sandybridge 2015-04-23 12:19:24 +02:00
ssymv_U_microk_bulldozer-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
ssymv_U_microk_haswell-2.c added optimized ssymv kernels for haswell 2015-04-23 10:23:13 +02:00
ssymv_U_microk_nehalem-2.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
ssymv_U_microk_sandy-2.c added optimized ssymv kernels for sandybridge 2015-04-23 12:19:24 +02:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
strsm_kernel_LN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
strsm_kernel_LT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
strsm_kernel_RN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
strsm_kernel_RT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
swap.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
swap_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
swap_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
symv_L_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
symv_L_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
symv_U_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
symv_U_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_2x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x2_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x4_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_4x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LN_8x4_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_2x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x2_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x4_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_4x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_LT_8x4_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_2x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x2_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x4_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_4x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trsm_kernel_RT_8x4_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xgemm3m_kernel_2x2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xgemm_kernel_1x1.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xgemv_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xgemv_t.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xtrsm_kernel_LT_1x1.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zamax_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zasum_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy.c bugfixes: replaced int with BLASLONG 2015-04-24 14:30:44 +02:00
zaxpy_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_microk_bulldozer-2.c added optimized zaxpy-kernels 2015-04-16 11:19:37 +02:00
zaxpy_microk_haswell-2.c added optimized zaxpy-kernels 2015-04-16 11:19:37 +02:00
zaxpy_microk_sandy-2.c added optimized zaxpy-kernels 2015-04-16 11:19:37 +02:00
zaxpy_microk_steamroller-2.c added optimized zaxpy-kernels 2015-04-16 11:19:37 +02:00
zaxpy_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zaxpy_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zcopy_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot.c bugfixes: replaced int with BLASLONG 2015-04-24 14:30:44 +02:00
zdot_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot_microk_bulldozer-2.c updated cdot and zdot 2015-04-10 11:10:31 +02:00
zdot_microk_haswell-2.c updated cdot and zdot 2015-04-10 11:10:31 +02:00
zdot_microk_sandy-2.c add optimized cdot- and zdot-kernel for sandybridge 2015-04-10 09:37:26 +02:00
zdot_microk_steamroller-2.c added optimized cdot- and zdot-kernel for steamroller 2015-04-09 09:45:23 +02:00
zdot_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zdot_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_2x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x2_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x4_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_4x8_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_8x4_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_8x4_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_8x4_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_8x4_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm3m_kernel_8x4_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_beta.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_1x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x1_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x2_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x2_bulldozer.S bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel 2014-06-28 12:16:20 +02:00
zgemm_kernel_2x2_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x2_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x2_piledriver.S bugfix for piledriver cgemm-, zgemm- and zgemv-kernel 2014-06-28 11:46:58 +02:00
zgemm_kernel_2x2_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x2_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_2x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_4x2_barcelona.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_4x2_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_4x2_haswell.S modification for clang compiler 2014-08-27 09:00:20 +02:00
zgemm_kernel_4x2_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_4x2_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_4x2_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_kernel_4x4_sandy.S Update organization info. 2014-11-25 15:28:58 +08:00
zgemm_ncopy_1.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_ncopy_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_1.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemm_tcopy_2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_n.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_n_4.c added optimized zgemv_n kernel for sandybridge 2014-09-14 09:02:05 +02:00
zgemv_n_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_n_dup.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_n_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
zgemv_n_microk_sandy-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
zgemv_t.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_t_4.c added target processor STEAMROLLER 2014-12-28 20:16:46 +08:00
zgemv_t_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_t_dup.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zgemv_t_microk_bulldozer-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
zgemv_t_microk_haswell-4.c changed inline assembler labels to short form 2014-12-07 12:38:54 +01:00
znrm2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
znrm2_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zrot_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal.c added optimized cscal and zscal kernels for steamroller 2015-05-18 12:40:07 +02:00
zscal_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_microk_bulldozer-2.c added optimized zscal kernel for bulldozer 2015-05-17 11:45:19 +02:00
zscal_microk_haswell-2.c added optimized zscal kernel for haswell 2015-05-16 16:41:45 +02:00
zscal_microk_steamroller-2.c added optimized cscal and zscal kernels for steamroller 2015-05-18 12:40:07 +02:00
zscal_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zscal_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zsymv_L_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zsymv_L_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zsymv_U_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zsymv_U_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_2x1_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_2x2_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_2x2_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_2x2_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_2x2_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_2x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_4x2_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
ztrsm_kernel_LT_1x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_2x1_atom.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_2x2_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_2x2_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_2x2_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_2x2_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_2x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_4x2_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_LT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
ztrsm_kernel_RN_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
ztrsm_kernel_RT_1x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_2x2_core2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_2x2_penryn.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_2x2_sse2.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_2x2_sse3.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_2x4_nehalem.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_4x2_sse.S Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztrsm_kernel_RT_bulldozer.c added optimized trsm_kernels 2016-01-05 13:05:05 +01:00