Martin Kroeker
|
723f396a20
|
Tag %1 and %2 as both input and output
The inline assembly modifies its input operands, so mark them as output to avoid surprises with optimization. Fixes #1292
|
2017-12-29 23:56:41 +01:00 |
Zhang Xianyi
|
6e7be06e07
|
Refs JuliaLang/julia#5728. Fix gemv performance bug on Haswell Mac OSX.
On Mac OS X, it should use .align 4 (equal to .align 16 on Linux).
I didn't get the performance benefit from .align. Thus, I deleted it.
|
2016-02-19 17:56:07 -05:00 |
Werner Saar
|
bc5fff7085
|
changed inline assembler labels to short form
|
2014-12-07 12:38:54 +01:00 |
wernsaar
|
bb8d698335
|
optimized zgemv_n_microk_haswell-4.c for small size
|
2014-09-11 13:44:55 +02:00 |
wernsaar
|
bced4594bb
|
added optimized zgemv_n kernel
|
2014-09-11 12:34:57 +02:00 |