Martin Kroeker
|
b716c0ef01
|
Add workaround for NVIDIA HPC
|
2021-01-12 16:51:35 +01:00 |
shengyang
|
80db5f11e1
|
update
|
2020-01-02 11:01:57 +08:00 |
shengyang
|
8d84403205
|
Use arm neon instructions to optimize ncopy operation
modified: KERNEL.ARMV8
modified: KERNEL.TSV110
new file: sgemm_ncopy_4.S
|
2019-12-31 17:06:35 +08:00 |
zq
|
50f7fc1401
|
[WIP] Use arm neon instructions to optimize tcopy operation
|
2019-12-31 10:21:23 +08:00 |
maomao194313
|
783ba8058f
|
HiSilicon tsv110 CPUs optimization branch
add HiSilicon tsv110 CPUs optimization branch
|
2019-03-04 16:30:50 +08:00 |