OpenBLAS/interface
Martin Kroeker c3a2d407a0
Merge pull request #4048 from imzhuhl/spr_sbgemm_fix
Sapphire Rapids sbgemm fix
2023-06-17 20:47:09 +02:00
..
lapack Add lower threshold for multithreading 2021-07-01 17:41:05 +02:00
netlib ref #80. On P4 CPU with 32-bit Windows XP, Octave crashed with OpenBLAS. Walkaroud: Use netlib reference gemv instead of own funtions. 2012-03-16 20:29:39 +08:00
CMakeLists.txt Merge pull request #3796 from martin-frbg/gemmt 2022-11-12 19:06:05 +01:00
Makefile redo lost edit 2023-03-28 18:31:04 +02:00
asum.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
axpby.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
axpy.c fix function typecasts 2021-12-21 18:47:35 +01:00
bf16dot.c Add bfloat16 based dot and conversion with single/double 2020-09-04 02:31:25 +08:00
bf16to.c Add bfloat16 based dot and conversion with single/double 2020-09-04 02:31:25 +08:00
copy.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
create rewrite perl scripts in universal shell 2022-05-18 19:00:15 -05:00
create.pl Add back original PERL-based script under new name 2022-05-22 18:29:01 +02:00
dot.c updated some level1 funcions, that are not thread save 2017-01-10 14:05:07 +01:00
dsdot.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
gbmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
geadd.c Add ATLAS-style ?geadd function 2015-02-16 13:46:20 +01:00
gemm.c Fix syscall error on non-x86 platform 2023-05-22 21:59:59 +08:00
gemmt.c Improve input argument checks of gemmt 2023-05-26 08:51:27 +02:00
gemv.c Revert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64 2021-07-14 20:32:57 +02:00
ger.c Clean up some warnings 2021-07-11 16:00:29 +02:00
imatcopy.c More detailed error message in [z]imatcopy.c. 2023-05-12 09:41:52 -07:00
imax.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
max.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
nrm2.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
omatcopy.c Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them. 2014-09-08 17:57:44 +02:00
rot.c updated some level1 funcions, that are not thread save 2017-01-10 14:05:07 +01:00
rotg.c fabs -> fabsl 2018-08-03 13:00:10 -04:00
rotm.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
rotmg.c Remove dead assignment to dflag in rotmg functions 2021-01-14 19:40:32 -07:00
sbgemv.c Implementation of BF16 based gemv 2020-10-29 02:08:23 +08:00
sbmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
scal.c fix function typecasts 2021-12-21 18:47:35 +01:00
sdsdot.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
spmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
spr.c Add shortcuts for (small) cases that do not need expensive buffer allocation 2021-05-27 22:39:18 +02:00
spr2.c Add shortcuts for (small) cases that do not need expensive buffer allocation 2021-05-27 22:39:18 +02:00
sum.c Add interface for ?sum (derived from ?asum) 2019-03-30 21:59:18 +01:00
swap.c ARM64: Add THUNDERX3T110 Target 2020-07-26 23:32:24 -07:00
symm.c Add multithreading threshold 2023-03-26 00:25:28 +01:00
symv.c Do not use multithreading for small workloads 2023-03-23 23:13:02 +01:00
syr.c Add fast path for small xSYR with INCX==1 2021-05-22 20:41:18 +02:00
syr2.c Add shortcuts for (small) cases that do not need expensive buffer allocation 2021-05-27 22:39:18 +02:00
syr2k.c Do not use multithreading for small workloads 2023-03-23 23:13:02 +01:00
syrk.c Rework multithreading threshold 2023-03-26 00:23:57 +01:00
tbmv.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
tbsv.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
tobf16.c Add bfloat16 based dot and conversion with single/double 2020-09-04 02:31:25 +08:00
tpmv.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
tpsv.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
trmv.c Allow multithreading TRMV again 2019-02-19 21:03:30 +01:00
trsm.c Correct length of name string in xerbla call 2019-04-27 22:49:04 +02:00
trsv.c Revert #3252 2021-10-24 23:57:06 +02:00
xerbla.c Update xerbla.c 2017-04-26 20:29:30 +02:00
zaxpby.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
zaxpy.c fix function typecasts 2021-12-21 18:47:35 +01:00
zdot.c Make return parameter of cblas_Xdotc_sub, cblas_Xdotu_sub a void pointer as well 2017-11-18 20:28:02 +01:00
zgbmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
zgeadd.c Add ATLAS-style ?geadd function 2015-02-16 13:46:20 +01:00
zgemv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
zger.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
zhbmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
zhemv.c re-arrange new code for readability 2018-10-20 21:37:53 +03:00
zher.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
zher2.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
zhpmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
zhpr.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
zhpr2.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
zimatcopy.c More detailed error message in [z]imatcopy.c. 2023-05-12 09:41:52 -07:00
zomatcopy.c Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them. 2014-09-08 17:57:44 +02:00
zrot.c Add cblas_{c/z}srot cblas_{c/z}rotg support 2021-11-01 20:19:13 +08:00
zrotg.c Add cblas_{c/z}srot cblas_{c/z}rotg support 2021-11-01 20:19:13 +08:00
zsbmv.c Use blasabs to switch between abs and labs as needed for INTERFACE64 2018-08-04 20:06:49 +02:00
zscal.c fix function typecasts 2021-12-21 18:47:35 +01:00
zspmv.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zspr.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zspr2.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zswap.c ARM64: Add THUNDERX3T110 Target 2020-07-26 23:32:24 -07:00
zsymv.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
zsyr.c Fix stride in shortcut path for small N 2022-12-08 21:02:01 +01:00
zsyr2.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
ztbmv.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
ztbsv.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
ztpmv.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
ztpsv.c Modify complex CBLAS functions to take void pointers 2017-11-05 15:53:14 +01:00
ztrmv.c Allow multithreading TRMV again 2019-02-19 21:03:30 +01:00
ztrsv.c Remove dangerous optimization from previous #3252 - buffer is never unused here 2021-10-20 10:50:02 +02:00