Martin Kroeker
|
e976557d29
|
Fix inline assembly constraints
rework indices to allow marking argument lda as input and output.
|
2019-02-16 18:36:39 +01:00 |
Martin Kroeker
|
7c1925acec
|
Use .p2align instead of .align for compatibility on Sandybridge as well
|
2018-02-24 19:43:15 +01:00 |
Martin Kroeker
|
b973990df2
|
Tag %1 and %2 as both input and output operands
fix from #1292 extended to the other gemv microkernels
|
2017-12-31 18:03:36 +01:00 |
Werner Saar
|
bc5fff7085
|
changed inline assembler labels to short form
|
2014-12-07 12:38:54 +01:00 |
wernsaar
|
2be5c7a640
|
bugfix for windows
|
2014-09-07 21:48:42 +02:00 |
wernsaar
|
553e275407
|
optimized sgemv_n kernel for sandybridge
|
2014-09-07 20:53:30 +02:00 |
wernsaar
|
a64fe9bcc9
|
added optimized sgemv_n kernel for sandybridge
|
2014-09-06 08:41:53 +02:00 |