Martin Kroeker
|
723f396a20
|
Tag %1 and %2 as both input and output
The inline assembly modifies its input operands, so mark them as output to avoid surprises with optimization. Fixes #1292
|
2017-12-29 23:56:41 +01:00 |
Zhang Xianyi
|
6e7be06e07
|
Refs JuliaLang/julia#5728. Fix gemv performance bug on Haswell Mac OSX.
On Mac OS X, it should use .align 4 (equal to .align 16 on Linux).
I didn't get the performance benefit from .align. Thus, I deleted it.
|
2016-02-19 17:56:07 -05:00 |
Werner Saar
|
bc5fff7085
|
changed inline assembler labels to short form
|
2014-12-07 12:38:54 +01:00 |
wernsaar
|
cafba99b6b
|
bufix in cgemv_n_microk_haswell-4.c
|
2014-09-11 11:12:44 +02:00 |
wernsaar
|
ac8f232b2a
|
more optimizations
|
2014-09-11 10:25:48 +02:00 |
wernsaar
|
be95700b30
|
added optimized cgemv_kernel for haswell
|
2014-09-10 14:11:24 +02:00 |