.. |
KERNEL
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.ATOM
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.BARCELONA
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.BOBCAT
|
Refs #113. Fixed the typo BOBCATE -> BOBCAT
|
2012-05-31 22:40:15 +08:00 |
KERNEL.BULLDOZER
|
added dcopy_bulldozer.S
|
2013-06-21 16:06:51 +02:00 |
KERNEL.CORE2
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.DUNNINGTON
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.NANO
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.NEHALEM
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.OPTERON
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.OPTERON_SSE3
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.PENRYN
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.PILEDRIVER
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
KERNEL.PRESCOTT
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
KERNEL.SANDYBRIDGE
|
Refs #83 #53. Adding Intel Sandy Bridge (AVX supported) kernel codes for BLAS level 3 functions.
|
2012-06-19 16:37:12 +08:00 |
Makefile
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
amax.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
amax_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
amax_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
amax_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
asum.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
asum_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
asum_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
asum_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
axpy.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
axpy_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
axpy_sse.S
|
Fixed the Windows x86_64 ABI bug in s/daxpy kernels.
|
2013-03-08 22:28:34 +08:00 |
axpy_sse2.S
|
Fixed the Windows x86_64 ABI bug in s/daxpy kernels.
|
2013-03-08 22:28:34 +08:00 |
builtin_stinit.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
cabs.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
cgemm_kernel_4x2_bulldozer.S
|
added cgemm_kernel_4x2_bulldozer.S
|
2013-06-12 15:55:27 +02:00 |
cgemm_kernel_4x8_sandy.S
|
Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM.
|
2012-06-25 19:00:37 +08:00 |
cgemv_n.S
|
Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64.
|
2013-05-29 19:48:31 +08:00 |
cgemv_t.S
|
Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64.
|
2013-05-29 19:48:31 +08:00 |
copy.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
copy_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
copy_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
daxpy_bulldozer.S
|
added daxpy_bulldozer.S
|
2013-06-20 14:07:54 +02:00 |
dcopy_bulldozer.S
|
added dcopy_bulldozer.S
|
2013-06-21 16:06:51 +02:00 |
ddot_bulldozer.S
|
added ddot_bulldozer.S
|
2013-06-20 16:15:09 +02:00 |
dgemm_kernel_4x8_sandy.S
|
Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM.
|
2012-06-25 19:00:37 +08:00 |
dgemm_kernel_8x2_bulldozer.S
|
changed stack touching
|
2013-06-08 10:43:08 +02:00 |
dgemm_ncopy_2.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
dgemm_ncopy_4.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
dgemm_ncopy_8.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
dgemm_ncopy_8_bulldozer.S
|
cleanup of dgemm_ncopy_8_bulldozer.S
|
2013-06-19 19:31:38 +02:00 |
dgemm_tcopy_2.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
dgemm_tcopy_4.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
dgemm_tcopy_8.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
dgemm_tcopy_8_bulldozer.S
|
added gemm_tcopy_2_bulldozer.S
|
2013-06-18 11:01:33 +02:00 |
dgemv_n.S
|
Refs #223. Fixed s/dgemv bug on windows.
|
2013-06-04 16:01:05 +08:00 |
dgemv_n_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
dgemv_n_bulldozer.S
|
Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX.
|
2013-07-01 16:09:05 +08:00 |
dgemv_t.S
|
Refs #154. Fixed the build bug of dgemv_t on MinW64.
|
2012-11-27 07:24:04 +08:00 |
dgemv_t_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
dgemv_t_bulldozer.S
|
added dgemv_t_bulldozer.S
|
2013-06-19 17:32:42 +02:00 |
dot.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
dot_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
dot_sse.S
|
Refs #189. Fixed the bug of s/cdot about invalid reading NAN on x86_64.
|
2013-01-25 20:56:14 +08:00 |
dot_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_beta.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_2x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x2_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x4_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x4_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x8_nano.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_4x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_8x4_barcelona.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
gemm_kernel_8x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_8x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_8x4_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_kernel_8x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
gemm_ncopy_2.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
gemm_ncopy_2_bulldozer.S
|
added gemm_ncopy_2_bulldozer.S
|
2013-06-17 12:55:12 +02:00 |
gemm_ncopy_4.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
gemm_ncopy_4_opteron.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
gemm_tcopy_2.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
gemm_tcopy_2_bulldozer.S
|
added gemm_tcopy_2_bulldozer.S
|
2013-06-18 11:01:33 +02:00 |
gemm_tcopy_4.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
gemm_tcopy_4_opteron.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
iamax.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
iamax_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
iamax_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
izamax.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
izamax_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
izamax_sse2.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
lsame.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
mcount.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
nrm2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
nrm2_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qconjg.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qdot.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qgemm_kernel_2x2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qgemv_n.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qgemv_t.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qtrsm_kernel_LN_2x2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qtrsm_kernel_LT_2x2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
qtrsm_kernel_RT_2x2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
rot.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
rot_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
rot_sse2.S
|
fixed a bug in drot whe incx or incy equals to zero.
|
2011-02-16 23:35:41 +08:00 |
scal.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
scal_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
scal_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
scal_sse2.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
sgemm_kernel_8x8_sandy.S
|
Fixed the issue of mixing AVX and SSE codes in S/D/C/ZGEMM.
|
2012-06-25 19:00:37 +08:00 |
sgemm_kernel_16x2_bulldozer.S
|
performance optimizations in sgemm_kernel_16x2_bulldozer.S
|
2013-06-13 11:35:15 +02:00 |
sgemv_n.S
|
Refs #223. Fixed s/dgemv bug on windows.
|
2013-06-04 16:01:05 +08:00 |
sgemv_t.S
|
Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64.
|
2013-05-29 19:48:31 +08:00 |
staticbuffer.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
swap.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
swap_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
swap_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
symv_L_sse.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
symv_L_sse2.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
symv_U_sse.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
symv_U_sse2.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
trsm_kernel_LN_2x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x2_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x4_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x4_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_4x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LN_8x4_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
trsm_kernel_LT_2x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x2_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x4_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x4_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_4x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_LT_8x4_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
trsm_kernel_RT_2x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x2_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x4_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x4_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_4x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
trsm_kernel_RT_8x4_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
xdot.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
xgemm3m_kernel_2x2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
xgemm_kernel_1x1.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
xgemv_n.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
xgemv_t.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
xtrsm_kernel_LT_1x1.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zamax.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zamax_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zamax_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zamax_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zasum.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zasum_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zasum_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zasum_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zaxpy.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zaxpy_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zaxpy_sse.S
|
Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0.
|
2011-02-21 00:24:21 +08:00 |
zaxpy_sse2.S
|
Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0.
|
2011-02-21 00:24:21 +08:00 |
zcopy.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zcopy_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zcopy_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zdot.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zdot_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zdot_sse.S
|
Refs #189. Fixed the bug of s/cdot about invalid reading NAN on x86_64.
|
2013-01-25 20:56:14 +08:00 |
zdot_sse2.S
|
Ref #65. Fixed 64-bit Windows calling convention bug in cdot and zdot.
|
2011-10-18 10:23:17 +08:00 |
zgemm3m_kernel_2x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x2_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x4_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x4_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_4x8_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_8x4_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_8x4_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_8x4_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_8x4_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm3m_kernel_8x4_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_beta.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_1x4_nehalem.S
|
Fixed #33 ztrmm bug on Nehalem.
|
2011-06-07 12:53:25 +08:00 |
zgemm_kernel_2x1_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_2x2_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_2x2_bulldozer.S
|
added zgemm_kernel_2x2_bulldozer.S
|
2013-06-11 12:00:49 +02:00 |
zgemm_kernel_2x2_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_2x2_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_2x2_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_2x2_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_2x4_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_4x2_barcelona.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_4x2_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_4x2_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_4x2_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_4x2_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_kernel_4x4_sandy.S
|
Fixed a computational error in zgemm_kernel_4x4_sandy.S file.
|
2013-07-18 20:23:21 +08:00 |
zgemm_ncopy_1.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_ncopy_2.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
zgemm_tcopy_1.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemm_tcopy_2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemv_n.S
|
Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64.
|
2013-05-29 19:48:31 +08:00 |
zgemv_n_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemv_n_dup.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemv_t.S
|
Fixed overflow internal buffer bug of (s/d/c/z)gemv on x86_64.
|
2013-05-29 19:48:31 +08:00 |
zgemv_t_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zgemv_t_dup.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
znrm2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
znrm2_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zrot.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zrot_sse.S
|
fixed #4 csrot returned the wrong result when incx==incy==0.
|
2011-02-16 23:39:43 +08:00 |
zrot_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zscal.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zscal_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zscal_sse.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
zscal_sse2.S
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
zswap.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zswap_sse.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zswap_sse2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
zsymv_L_sse.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
zsymv_L_sse2.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
zsymv_U_sse.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
zsymv_U_sse2.S
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
ztrsm_kernel_LN_2x1_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LN_2x2_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LN_2x2_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LN_2x2_sse2.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
ztrsm_kernel_LN_2x2_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LN_2x4_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LN_4x2_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
ztrsm_kernel_LT_1x4_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LT_2x1_atom.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LT_2x2_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LT_2x2_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LT_2x2_sse2.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
ztrsm_kernel_LT_2x2_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LT_2x4_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_LT_4x2_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
ztrsm_kernel_RT_1x4_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_RT_2x2_core2.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_RT_2x2_penryn.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_RT_2x2_sse2.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |
ztrsm_kernel_RT_2x2_sse3.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_RT_2x4_nehalem.S
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |
ztrsm_kernel_RT_4x2_sse.S
|
Added BULLDOZER target. So far it uses barcelona kernels.
|
2012-12-07 00:53:31 +08:00 |