Martin Kroeker
|
675cd551da
|
fix improper function prototypes (empty parentheses)
|
2023-09-30 12:56:38 +02:00 |
Martin Kroeker
|
d1ee6ff73f
|
fix function typecasts
|
2021-12-21 18:45:28 +01:00 |
Martin Kroeker
|
950c047b49
|
Use Haswell optimizations for Zen as well
|
2021-02-11 09:24:51 +01:00 |
Martin Kroeker
|
114eb159a4
|
Disable FMA intrinsics in the srot kernel when the compiler is PGI/NVIDIA
|
2020-12-19 22:15:58 +01:00 |
Qiyu8
|
a87e537b8c
|
modify macro
|
2020-11-11 15:53:48 +08:00 |
Qiyu8
|
5bc0a7583f
|
only FMA3 and vector larger than 128 have positive effects.
|
2020-11-11 15:18:01 +08:00 |
Qiyu8
|
8c0b206d4c
|
Optimize the performance of rot by using universal intrinsics
|
2020-11-11 14:33:12 +08:00 |
Gengxin Xie
|
d9ba49165a
|
Improve the performance of rot by using AVX512 and AVX2 intrinsic
|
2020-11-05 15:12:36 +08:00 |