Zhang Xianyi
|
6e7be06e07
|
Refs JuliaLang/julia#5728. Fix gemv performance bug on Haswell Mac OSX.
On Mac OS X, it should use .align 4 (equal to .align 16 on Linux).
I didn't get the performance benefit from .align. Thus, I deleted it.
|
2016-02-19 17:56:07 -05:00 |
Werner Saar
|
30f52d53df
|
optimized dgemv_n kernel for haswell
|
2015-04-30 12:11:39 +02:00 |
Werner Saar
|
ddf983d643
|
added optimizations for steamroller
|
2014-12-30 20:14:45 +08:00 |
Werner Saar
|
bc5fff7085
|
changed inline assembler labels to short form
|
2014-12-07 12:38:54 +01:00 |
wernsaar
|
baa46e4fba
|
added and tested optimized dgemv_n kernel for haswell
|
2014-09-09 16:17:45 +02:00 |
wernsaar
|
658939faaa
|
optimized dgemv_n kernel for small sizes
|
2014-09-08 15:22:35 +02:00 |