OpenBLAS/kernel/x86_64
wangqian 1b3b9e841d Fixed a computational error in zgemm_kernel_4x4_sandy.S file. 2013-07-18 20:23:21 +08:00
..
KERNEL Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.ATOM Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.BARCELONA Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.BOBCAT Refs #113. Fixed the typo BOBCATE -> BOBCAT 2012-05-31 22:40:15 +08:00
KERNEL.BULLDOZER added dcopy_bulldozer.S 2013-06-21 16:06:51 +02:00
KERNEL.CORE2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.DUNNINGTON Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.NANO Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.NEHALEM Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.OPTERON Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.OPTERON_SSE3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PENRYN Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PILEDRIVER Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
KERNEL.PRESCOTT Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.SANDYBRIDGE Refs #83 #53. Adding Intel Sandy Bridge (AVX supported) kernel codes for BLAS level 3 functions. 2012-06-19 16:37:12 +08:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
axpy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
axpy_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
axpy_sse.S Fixed the Windows x86_64 ABI bug in s/daxpy kernels. 2013-03-08 22:28:34 +08:00
axpy_sse2.S Fixed the Windows x86_64 ABI bug in s/daxpy kernels. 2013-03-08 22:28:34 +08:00
builtin_stinit.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
cgemm_kernel_4x2_bulldozer.S added cgemm_kernel_4x2_bulldozer.S 2013-06-12 15:55:27 +02:00
cgemm_kernel_4x8_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
cgemv_n.S Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64. 2013-05-29 19:48:31 +08:00
cgemv_t.S Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64. 2013-05-29 19:48:31 +08:00
copy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
copy_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
copy_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
daxpy_bulldozer.S added daxpy_bulldozer.S 2013-06-20 14:07:54 +02:00
dcopy_bulldozer.S added dcopy_bulldozer.S 2013-06-21 16:06:51 +02:00
ddot_bulldozer.S added ddot_bulldozer.S 2013-06-20 16:15:09 +02:00
dgemm_kernel_4x8_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
dgemm_kernel_8x2_bulldozer.S changed stack touching 2013-06-08 10:43:08 +02:00
dgemm_ncopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_ncopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_ncopy_8.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_ncopy_8_bulldozer.S cleanup of dgemm_ncopy_8_bulldozer.S 2013-06-19 19:31:38 +02:00
dgemm_tcopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_tcopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_tcopy_8.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_tcopy_8_bulldozer.S added gemm_tcopy_2_bulldozer.S 2013-06-18 11:01:33 +02:00
dgemv_n.S Refs #223. Fixed s/dgemv bug on windows. 2013-06-04 16:01:05 +08:00
dgemv_n_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dgemv_n_bulldozer.S Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX. 2013-07-01 16:09:05 +08:00
dgemv_t.S Refs #154. Fixed the build bug of dgemv_t on MinW64. 2012-11-27 07:24:04 +08:00
dgemv_t_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dgemv_t_bulldozer.S added dgemv_t_bulldozer.S 2013-06-19 17:32:42 +02:00
dot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dot_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dot_sse.S Refs #189. Fixed the bug of s/cdot about invalid reading NAN on x86_64. 2013-01-25 20:56:14 +08:00
dot_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_beta.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x8_nano.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_barcelona.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
gemm_kernel_8x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_ncopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_ncopy_2_bulldozer.S added gemm_ncopy_2_bulldozer.S 2013-06-17 12:55:12 +02:00
gemm_ncopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_ncopy_4_opteron.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
gemm_tcopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_tcopy_2_bulldozer.S added gemm_tcopy_2_bulldozer.S 2013-06-18 11:01:33 +02:00
gemm_tcopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_tcopy_4_opteron.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
iamax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
iamax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
iamax_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
izamax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
izamax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
izamax_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
mcount.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
nrm2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
nrm2_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qconjg.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qdot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qgemm_kernel_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qgemv_t.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qtrsm_kernel_LN_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qtrsm_kernel_LT_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qtrsm_kernel_RT_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
rot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
rot_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
rot_sse2.S fixed a bug in drot whe incx or incy equals to zero. 2011-02-16 23:35:41 +08:00
scal.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
scal_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
scal_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
scal_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
sgemm_kernel_8x8_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
sgemm_kernel_16x2_bulldozer.S performance optimizations in sgemm_kernel_16x2_bulldozer.S 2013-06-13 11:35:15 +02:00
sgemv_n.S Refs #223. Fixed s/dgemv bug on windows. 2013-06-04 16:01:05 +08:00
sgemv_t.S Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64. 2013-05-29 19:48:31 +08:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
swap.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
swap_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
swap_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
symv_L_sse.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
symv_L_sse2.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
symv_U_sse.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
symv_U_sse2.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
trsm_kernel_LN_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_8x4_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
trsm_kernel_LT_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_8x4_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
trsm_kernel_RT_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_8x4_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
xdot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemm3m_kernel_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemm_kernel_1x1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemv_t.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xtrsm_kernel_LT_1x1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zaxpy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zaxpy_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zaxpy_sse.S Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0. 2011-02-21 00:24:21 +08:00
zaxpy_sse2.S Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0. 2011-02-21 00:24:21 +08:00
zcopy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zcopy_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zcopy_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot_sse.S Refs #189. Fixed the bug of s/cdot about invalid reading NAN on x86_64. 2013-01-25 20:56:14 +08:00
zdot_sse2.S Ref #65. Fixed 64-bit Windows calling convention bug in cdot and zdot. 2011-10-18 10:23:17 +08:00
zgemm3m_kernel_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_beta.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_1x4_nehalem.S Fixed #33 ztrmm bug on Nehalem. 2011-06-07 12:53:25 +08:00
zgemm_kernel_2x1_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_bulldozer.S added zgemm_kernel_2x2_bulldozer.S 2013-06-11 12:00:49 +02:00
zgemm_kernel_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x4_sandy.S Fixed a computational error in zgemm_kernel_4x4_sandy.S file. 2013-07-18 20:23:21 +08:00
zgemm_ncopy_1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_ncopy_2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
zgemm_tcopy_1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_tcopy_2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_n.S Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64. 2013-05-29 19:48:31 +08:00
zgemv_n_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_n_dup.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_t.S Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64. 2013-05-29 19:48:31 +08:00
zgemv_t_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_t_dup.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
znrm2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
znrm2_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zrot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zrot_sse.S fixed #4 csrot returned the wrong result when incx==incy==0. 2011-02-16 23:39:43 +08:00
zrot_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal_sse.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
zscal_sse2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
zswap.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zswap_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zswap_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zsymv_L_sse.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
zsymv_L_sse2.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
zsymv_U_sse.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
zsymv_U_sse2.S Support AMD Piledriver by bulldozer kernels. 2013-07-06 12:06:43 -03:00
ztrsm_kernel_LN_2x1_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x2_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LN_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_4x2_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LT_1x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x1_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x2_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LT_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_4x2_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_RT_1x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x2_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_RT_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_4x2_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00