Logo
Explore Help
Register Sign In
floraachy/OpenBLAS
1
0
Fork 0
You've already forked OpenBLAS
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
173 Commits 47 Branches 64 Tags
e08cfaf9ca9a65e28c4e0f790421aa03e7041c94
Commit Graph

10 Commits

Author SHA1 Message Date
traz
d238a768ab Use ps instructions in cgemm. 2011-09-14 15:32:25 +00:00
traz
cb0214787b Modify compile options. 2011-08-30 20:57:00 +00:00
traz
c8360e3ae5 Complete all the plura single precision functions of level3 on Loongson3a, the performance is 2.3GFlops. 2011-07-18 17:03:38 +00:00
traz
e72113f06a Add ztrmm and ztrsm part on loongson3a. The average performance is 2.2G. 2011-06-23 21:11:00 +00:00
traz
1c96d345e2 Improve zgemm performance from 1G to 1.8G, change block size in param.h. 2011-06-21 22:16:23 +00:00
traz
fc84909115 Modify single precision compiler conditions, increasing single precision kernel code on Loongson3a. 2011-05-27 09:47:17 +00:00
traz
d2f351d819 Modify dtrsm compiler options 2011-05-09 17:31:58 +00:00
traz
782205a693 Add dgemm compiler Options in KERNEL.LOONGSON3A. 2011-04-06 10:38:34 +00:00
Xianyi Zhang
1e671b49f3 Did the experiment with Loongson 3A 128bit load & store instruction. 2011-01-29 03:05:27 +08:00
Xianyi Zhang
c0b5992fab added axpy kernel with prefetch for Loongson3A. To-Do: tuning prefetch distance & instruction order. 2011-01-26 22:34:33 +08:00
Powered by Gitea Version: v3.0.2, by v1.25.1, 20260427 Page: 83ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API