Deeksha Goplani
|
4894c54055
|
Improve TN case with further unrolling
|
2024-09-02 22:22:49 +05:30 |
Chris Sidebottom
|
ea4ab3b310
|
Better header guard around bridge
|
2024-07-20 14:39:57 +01:00 |
Chris Sidebottom
|
9984c5ce9d
|
Clean up k2 removal more and unroll SGEMM more
|
2024-07-18 18:35:43 +01:00 |
Chris Sidebottom
|
b1c9fafabb
|
Remove k2 loop from DGEMM TN and use a more conservative heuristic for SGEMM
|
2024-07-18 17:37:18 +01:00 |
Chris Sidebottom
|
8c472ef7e3
|
Further tweak small GEMM for AArch64
|
2024-06-24 10:47:47 +01:00 |
Chris Sidebottom
|
7a6fa699f2
|
Small GEMM for AArch64
This is a fairly conservative addition of small matrix kernels using
SVE.
|
2024-03-04 15:48:47 +00:00 |