Wangyang Guo
|
ca7682e3a3
|
Small Matrix: skylakex: sgemm nn: fix n6 conflicts with n4
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
9967e61abb
|
Small Matrix: skylakex: sgemm nn: fix error when beta not zero
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
a87736346f
|
Small Matrix: skylakex: sgemm nn: add n6 to improve performance
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
4c9d9940fd
|
Small Matrix: skylakex: sgemm nn: reduce store 4 N at a time
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
13b32f69b7
|
Small Matrix: skylakex: sgemm nn: reduce store 4 M at a time
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
3d8c6d9607
|
Small Matrix: skylakex: sgemm nn: clean up unused code
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
49b61a3f30
|
Small Matrix: skylakex: sgemm_nn: optimize for M <= 8
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
f88470323b
|
Optimize M < 16 using AVX512 mask
|
2021-08-02 07:06:54 +00:00 |
Wangyang Guo
|
9186456a12
|
small matrix: SkylakeX: add SGEMM NN kernel
|
2021-08-02 07:06:54 +00:00 |