Commit Graph

8188 Commits

Author SHA1 Message Date
Martin Kroeker
03698f4c1d fix typedef of logical to support INTERFACE64 2024-04-04 01:46:46 +02:00
Martin Kroeker
353b1180c1 fix typedef of logical to support INTERFACE64 2024-04-04 01:38:10 +02:00
Martin Kroeker
8bed05a936 fix typedef of logical to support INTERFACE64 2024-04-04 01:32:37 +02:00
Martin Kroeker
466fb61c95 fix typedef of logical to support INTERFACE64 2024-04-04 01:27:57 +02:00
Martin Kroeker
7e470400c9 fix typedef of logical to support INTERFACE64 2024-04-04 01:24:27 +02:00
Martin Kroeker
373e36377c fix typedef of logical to support INTERFACE64 2024-04-04 01:19:55 +02:00
Martin Kroeker
abea53b467 fix typedef of logical to support INTERFACE64 2024-04-04 01:11:37 +02:00
Martin Kroeker
b6440160ce fix typedef of logical to support INTERFACE64 2024-04-04 01:05:35 +02:00
Martin Kroeker
a16adcddd9 fix typedef of logical to support INTERFACE64 2024-04-04 00:59:57 +02:00
Martin Kroeker
1d4d36e835 fix typedef of logical to support INTERFACE64 2024-04-04 00:51:36 +02:00
Martin Kroeker
c0d7477260 fix typedef of logical to support INTERFACE64 2024-04-04 00:41:14 +02:00
Martin Kroeker
b0973a98e2 fix typedef of logical to support INTERFACE64 2024-04-04 00:34:32 +02:00
Martin Kroeker
0d33422b8d fix typedef of logical to support INTERFACE64 2024-04-04 00:24:54 +02:00
Martin Kroeker
cda91fdc48 fix typedef of logical to support INTERFACE64 2024-04-04 00:20:19 +02:00
Martin Kroeker
f7dd80ba73 fix typedef of logical to support INTERFACE64 2024-04-04 00:14:57 +02:00
Martin Kroeker
34be8fae29 fix typedef of logical to support INTERFACE64 2024-04-04 00:10:01 +02:00
Martin Kroeker
770246363f fix typedef of logical to support INTERFACE64 2024-04-04 00:04:50 +02:00
Martin Kroeker
04aae0e12b fix typedef of logical to support INTERFACE64 2024-04-03 23:57:12 +02:00
Martin Kroeker
7a267e4c50 fix typedef for logical to support INTERFACE64 2024-04-03 23:51:25 +02:00
Martin Kroeker
837a173115 fix prototype of logical to support INTERFACE64 2024-04-03 23:43:55 +02:00
Martin Kroeker
c7d0a0b009 fix typedef of logical to support INTERFACE64 2024-04-03 23:39:43 +02:00
Martin Kroeker
b1da12356d fix typedef of logical to support INTERFACE64 2024-04-03 23:00:54 +02:00
Martin Kroeker
592905a72c fix typedef of logical to support INTERFACE64 2024-04-03 22:53:22 +02:00
Martin Kroeker
f0f9e26090 fix typedef of logical to support INTERFACE64 2024-04-03 22:47:34 +02:00
Martin Kroeker
547533fe9a fix typedef of logical to support INTERFACE64 2024-04-03 22:35:10 +02:00
Martin Kroeker
7bc0ff77bd fix typedef of logical to support INTERFACE64 2024-04-03 22:23:18 +02:00
Martin Kroeker
16a597927f fix typedef of logical to support INTERFACE64 2024-04-03 22:09:28 +02:00
Martin Kroeker
d277c6d15b Merge pull request #4585 from martin-frbg/issue1881
Cap the number of parallel threads for GEMM;GETRF and POTRF to ensure sensible workloads on big systems
2024-04-03 18:35:16 +02:00
Martin Kroeker
e1b618c6cf Merge pull request #4598 from jip/fix
fix dtrtrs_ and ztrtrs_ to accept case-insensitive uplo and diag parameters
2024-04-03 14:37:45 +02:00
Igor Zhuravlov
22d305e2df fix dtrtrs_ and ztrtrs_ to accept case-insensitive parameters uplo and diag
Changes to be committed:
	modified:   interface/lapack/trtrs.c
	modified:   interface/lapack/ztrtrs.c
2024-04-03 19:01:38 +10:00
Martin Kroeker
a87713f14b Merge pull request #4597 from martin-frbg/issue4581
Add support for the ARM Cortex-A76 cpu
2024-04-03 08:46:29 +02:00
Martin Kroeker
584e87661d set SWITCH_RATIO for Cortex-A76 2024-04-02 23:10:45 +02:00
Martin Kroeker
4718d97eb9 Mention support for Cortex-A76 2024-04-02 19:45:38 +02:00
Martin Kroeker
b925f61fb0 Add support for Cortex-A76 2024-04-02 19:44:17 +02:00
Martin Kroeker
3af736fb9d Add support for Cortex-A76 2024-04-02 19:42:23 +02:00
Martin Kroeker
441c81026e Add support for Cortex-A76 2024-04-02 19:41:44 +02:00
Martin Kroeker
58659730a4 Merge pull request #4595 from martin-frbg/issue4583-2
Temporarily revert S/DNRM2 on NeoverseN1 and Apple M to the older NEON kernel
2024-04-02 17:06:03 +02:00
Martin Kroeker
9ead81bd39 Revert S/DNRM2 to the base NEON kernel to fix precision loss 2024-04-02 15:59:20 +02:00
Martin Kroeker
29995b2969 Merge pull request #4594 from mattip/openblas.pc.il
fix typo
2024-04-02 08:27:43 +02:00
Matti Picus
5b33e648b4 fix typo 2024-04-02 08:54:56 +11:00
Martin Kroeker
b1e8ba5017 Merge pull request #4587 from mseminatore/win_queue_fix
Address Windows thread server re-entrant queue bug #4582
2024-03-29 14:36:21 +01:00
Martin Kroeker
8267fcfda8 Merge pull request #4588 from XiWeiGu/loongarch_fixed_dzamax
loongarch: Fixed dzamax
2024-03-29 13:48:27 +01:00
Mark Seminatore
b0ad8a78ff code to fix lost work in case of re-entrant calls to exec_blas_async() 2024-03-28 15:24:52 -07:00
Martin Kroeker
e1638ea43a Merge pull request #4586 from martin-frbg/potrf-para
use atomic acq/rel operations in potrf_parallel as in the corresponding getrf_parallel
2024-03-28 14:51:20 +01:00
Martin Kroeker
2dda40d280 use atomic operations as in the corresponding getrf 2024-03-28 11:33:31 +01:00
Martin Kroeker
68ab5185d0 Update potrf.c 2024-03-27 22:10:01 +01:00
Martin Kroeker
19b29b3448 Update getrf.c 2024-03-27 22:09:30 +01:00
Martin Kroeker
a3354a7630 Cap the number of parallel threads 2024-03-27 22:00:30 +01:00
Martin Kroeker
5da4c93ef2 Cap the number of parallel threads 2024-03-27 20:34:55 +01:00
Martin Kroeker
496106642f Cap the number of parallel threads 2024-03-27 20:32:11 +01:00