OpenBLAS/kernel/x86_64
Zhang Xianyi f19af5ecc0 Refs #54. Added AMD Bulldozer x86_64 dgemm kernel developed by Werner Saar <wernsaar at googlemail.com>
Based on the dgemm kernel for AMD Barcelona, he used AVX and FMA4 instructions.
Thank Werner Saar!
2012-12-07 01:05:11 +08:00
..
KERNEL Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.ATOM Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.BARCELONA Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.BOBCAT Refs #113. Fixed the typo BOBCATE -> BOBCAT 2012-05-31 22:40:15 +08:00
KERNEL.BULLDOZER Refs #54. Added AMD Bulldozer x86_64 dgemm kernel developed by Werner Saar <wernsaar at googlemail.com> 2012-12-07 01:05:11 +08:00
KERNEL.CORE2 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.DUNNINGTON Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.NANO Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.NEHALEM Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.OPTERON Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.OPTERON_SSE3 Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PENRYN Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.PRESCOTT Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
KERNEL.SANDYBRIDGE Refs #83 #53. Adding Intel Sandy Bridge (AVX supported) kernel codes for BLAS level 3 functions. 2012-06-19 16:37:12 +08:00
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
amax_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
asum_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
axpy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
axpy_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
axpy_sse.S Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0. 2011-02-21 00:24:21 +08:00
axpy_sse2.S Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0. 2011-02-21 00:24:21 +08:00
builtin_stinit.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
cabs.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
cgemm_kernel_4x8_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
cgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
cgemv_t.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
copy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
copy_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
copy_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dgemm_kernel_4x4_bulldozer.S Refs #54. Added AMD Bulldozer x86_64 dgemm kernel developed by Werner Saar <wernsaar at googlemail.com> 2012-12-07 01:05:11 +08:00
dgemm_kernel_4x8_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
dgemm_ncopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_ncopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_ncopy_8.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_tcopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_tcopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemm_tcopy_8.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
dgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dgemv_n_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dgemv_t.S Refs #154. Fixed the build bug of dgemv_t on MinW64. 2012-11-27 07:24:04 +08:00
dgemv_t_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dot_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
dot_sse.S Fixed #28. Convert the result to double precision in the end of dsdot kernel. 2011-05-13 02:34:30 +08:00
dot_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_beta.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x8_nano.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_barcelona.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
gemm_kernel_8x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_kernel_8x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
gemm_ncopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_ncopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_ncopy_4_opteron.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
gemm_tcopy_2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_tcopy_4.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
gemm_tcopy_4_opteron.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
iamax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
iamax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
iamax_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
izamax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
izamax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
izamax_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
lsame.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
mcount.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
nrm2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
nrm2_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qconjg.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qdot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qgemm_kernel_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qgemv_t.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qtrsm_kernel_LN_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qtrsm_kernel_LT_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
qtrsm_kernel_RT_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
rot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
rot_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
rot_sse2.S fixed a bug in drot whe incx or incy equals to zero. 2011-02-16 23:35:41 +08:00
scal.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
scal_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
scal_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
scal_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
sgemm_kernel_8x8_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
sgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
sgemv_t.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
staticbuffer.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
swap.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
swap_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
swap_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
symv_L_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
symv_L_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
symv_U_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
symv_U_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
trsm_kernel_LN_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LN_8x4_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
trsm_kernel_LT_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_LT_8x4_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
trsm_kernel_RT_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
trsm_kernel_RT_8x4_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
xdot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemm3m_kernel_2x2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemm_kernel_1x1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xgemv_t.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
xtrsm_kernel_LT_1x1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zamax_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zasum_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zaxpy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zaxpy_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zaxpy_sse.S Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0. 2011-02-21 00:24:21 +08:00
zaxpy_sse2.S Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0. 2011-02-21 00:24:21 +08:00
zcopy.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zcopy_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zcopy_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zdot_sse.S Ref #65. Fixed 64-bit Windows calling convention bug in cdot and zdot. 2011-10-18 10:23:17 +08:00
zdot_sse2.S Ref #65. Fixed 64-bit Windows calling convention bug in cdot and zdot. 2011-10-18 10:23:17 +08:00
zgemm3m_kernel_2x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x2_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_4x8_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm3m_kernel_8x4_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_beta.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_1x4_nehalem.S Fixed #33 ztrmm bug on Nehalem. 2011-06-07 12:53:25 +08:00
zgemm_kernel_2x1_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_barcelona.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_kernel_4x4_sandy.S Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM. 2012-06-25 19:00:37 +08:00
zgemm_ncopy_1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_ncopy_2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
zgemm_tcopy_1.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemm_tcopy_2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_n.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_n_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_n_dup.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_t.S Fixed #66 the bug in zgemv kernel with transpose matrix on 64-bit MingW (Windows). 2011-10-18 18:44:23 +08:00
zgemv_t_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zgemv_t_dup.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
znrm2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
znrm2_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zrot.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zrot_sse.S fixed #4 csrot returned the wrong result when incx==incy==0. 2011-02-16 23:39:43 +08:00
zrot_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zscal_sse.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
zscal_sse2.S Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
zswap.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zswap_sse.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zswap_sse2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
zsymv_L_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
zsymv_L_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
zsymv_U_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
zsymv_U_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LN_2x1_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x2_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LN_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LN_4x2_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LT_1x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x1_atom.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x2_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_LT_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_LT_4x2_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_RT_1x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x2_core2.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x2_penryn.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x2_sse2.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00
ztrsm_kernel_RT_2x2_sse3.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_2x4_nehalem.S Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
ztrsm_kernel_RT_4x2_sse.S Added BULLDOZER target. So far it uses barcelona kernels. 2012-12-07 00:53:31 +08:00