Commit Graph

8459 Commits

Author SHA1 Message Date
Martin Kroeker
a5c04e326a Update scal.c 2024-07-04 22:28:01 +02:00
Martin Kroeker
536200bc9e fix handling of INF or NAN 2024-07-04 17:47:19 +02:00
Martin Kroeker
b422742899 collect error output from ctest, if any 2024-07-04 15:42:34 +02:00
Martin Kroeker
bdb6069051 Merge pull request #4775 from martin-frbg/issue4770
Guard against invalid thread_status.queue
2024-07-01 00:35:30 +02:00
Martin Kroeker
4052b312b2 Merge pull request #4763 from ev-br/sync-codspeed
BENCH: sync codspeed-benchmarks with BLAS-benchmarks
2024-07-01 00:18:08 +02:00
Martin Kroeker
3677b3886c Merge pull request #4702 from bashimao/detect-nv-grace
Correctly detect ARM Neoverse V2 CPUs.
2024-06-30 22:48:48 +02:00
Martin Kroeker
d0b9948b23 Guard against invalid thread_status.queue 2024-06-30 19:31:15 +02:00
Martin Kroeker
c33bc849aa Merge pull request #4729 from martin-frbg/issue4728
Fix handling of INF or NAN arguments in S/D/C SCAL
2024-06-29 18:22:03 +02:00
Martin Kroeker
8cbb797199 Merge pull request #4773 from rgommers/update-docs-yml
Update GitHub Actions used in docs.yml
2024-06-29 17:47:17 +02:00
Martin Kroeker
30770c57b9 Merge pull request #4772 from isuruf/rename
fix regex for renaming callbacks
2024-06-29 12:45:18 +02:00
Ralf Gommers
017a26863a Update GitHub Actions used in docs.yml
- Updates to latest versions for all actions
- Pin actions not maintained by GitHub under `github.com/action` to exact
  commit hashes. This is good practice for security, since tags can be
  moved and then arbitray new code may execute. Since updating commit
  hashes is slightly more hassle, the trade-off made here is to trust
  GitHub, but not third-party actions.

This should also solve the warnings about Node.js 16 actions being
deprecated that is currently visible on the log page when docs.yml
is executed.
2024-06-28 20:12:24 +02:00
Isuru Fernando
e8da541fb4 fix regex for renaming callbacks 2024-06-28 09:48:00 -05:00
Martin Kroeker
f3c364c2cc temporarily(?) disable the alpha=0 branch as it fails to handle INF,NAN 2024-06-27 22:18:27 +02:00
Martin Kroeker
2a5fe97e3b temporarily(?) disable the alpha=0 branch as it does not handle INF,NAN 2024-06-27 16:21:57 +02:00
Evgeni Burovski
28fb95d0be BENCH: actually add gemv/gbmv f2py wrappers 2024-06-27 12:38:47 +03:00
Martin Kroeker
c1019d5832 Handle INF and NAN in inputs 2024-06-27 10:58:59 +02:00
Evgeni Burovski
11a0c56166 BENCH: add BLAS level 2 gemv and gbmv 2024-06-27 11:14:22 +03:00
Evgeni Burovski
400cf9f63d restore the problem sizes for codspeed benchmarks 2024-06-24 16:47:20 +03:00
Evgeni Burovski
37a854718b BENCH: sync codspeed-benchmarks with BLAS-benchmarks 2024-06-24 14:33:06 +03:00
Martin Kroeker
9e24121e7e temporarily(?) disable da=0 shortcut to handle x=Inf or NAN 2024-06-23 17:48:18 +02:00
Martin Kroeker
a11f086c17 Update sscal_msa.c 2024-06-23 12:55:19 +02:00
Martin Kroeker
541e1b6959 disable the fast path for inc=1, alpha=0 as it does not handle x=NaN or Inf 2024-06-23 10:37:55 +02:00
Martin Kroeker
c08113c279 fix special cases of x= NAN or INF 2024-06-23 01:12:33 +02:00
Martin Kroeker
bd47630bcf exclude the alpha=0 branch as it does not handle NaN or Inf in x 2024-06-23 00:54:39 +02:00
Martin Kroeker
68f2501958 temporarily(?) disable the alpha=0 branch to handle Inf/NaN in x 2024-06-22 21:08:57 +02:00
Martin Kroeker
0a744a939a temporarily(?) disable the alpha=0 branch to handle NaN/Inf in x 2024-06-22 21:07:43 +02:00
Martin Kroeker
7f8f037a36 handle INF and NAN in input 2024-06-22 16:03:30 +02:00
Martin Kroeker
f1248b849d handle INF and NAN in input 2024-06-22 15:55:29 +02:00
Martin Kroeker
a2ee4b1966 Merge branch 'OpenMathLib:develop' into issue4728 2024-06-21 09:35:56 +02:00
Martin Kroeker
1ba1b9c357 Merge pull request #4761 from martin-frbg/m1zdot
Add a clobber list to the non-SVE ARM64 ZDOT kernel
2024-06-20 23:31:25 +02:00
Martin Kroeker
3ec59922b6 Add a clobber list to fix utest errors seen with gcc13 on Apple M 2024-06-20 16:19:32 +02:00
Martin Kroeker
7e9a4ba427 Merge pull request #4741 from shivammonaka/Pthread_Scalability_Improvement
Enhancing Core Utilization in BLAS Calls: A Scalable Architecture
2024-06-20 13:36:23 +02:00
Martin Kroeker
0773695a5c Merge pull request #4760 from martin-frbg/zhaoxin7k
Add cpuid for Zhaoxin KX-7000
2024-06-20 11:08:30 +02:00
Martin Kroeker
9b2a0c79cb Add Zhaoxin KX7000 2024-06-20 09:23:08 +02:00
Martin Kroeker
758279605f Add support forZhaoxin KX7000 2024-06-20 09:21:06 +02:00
Martin Kroeker
18063b1ccd Merge pull request #4757 from martin-frbg/lapack1024
Fix possible infinite loop on error in the LAPACK testsuite  (Reference-LAPACK PR 1024)
2024-06-19 10:07:38 +02:00
Martin Kroeker
215279662e fix possible infinite loop on error (Reference-LAPACK PR 1024) 2024-06-18 11:21:33 +02:00
Martin Kroeker
a9817b4212 fix reference in format (Reference-LAPACK PR 1024) 2024-06-18 11:20:22 +02:00
Martin Kroeker
bf521a2ced fix possible infinite loop on error (Reference-LAPACK PR 1024) 2024-06-18 11:18:04 +02:00
Martin Kroeker
cf2962bdb5 fix possible infinite loop on error (Reference-LAPACK PR 1024) 2024-06-18 11:15:44 +02:00
Martin Kroeker
33bb4b98a4 Improve error message output from the fork() utest (#4753)
* Add perror to report the reason for a fork failure
* reword the malloc failure message
2024-06-15 14:16:48 +02:00
Martin Kroeker
f13403b6b6 Merge pull request #4755 from martin-frbg/issue4739
Fix Intel oneAPI compiler support in the CMAKE build
2024-06-15 12:26:18 +02:00
Martin Kroeker
8bc37f9384 Merge pull request #4754 from martin-frbg/issue4750-2
Add a clobber list to the arm64 SVE DOT kernel
2024-06-15 10:29:03 +02:00
Martin Kroeker
d25ee4d0f5 Fix detection of Intel ifx and apply -fp-model option to it 2024-06-14 23:58:45 +02:00
Martin Kroeker
21c0f769ef ensure that cpu-specific -march options are always applied to icx 2024-06-14 23:54:27 +02:00
Martin Kroeker
3d8054fb16 add clobber list 2024-06-14 22:07:44 +02:00
Martin Kroeker
fdb88e010f Merge pull request #4749 from XiWeiGu/loongarch64-qemu-update
LoongArch64: Update QEMU
2024-06-14 17:19:14 +02:00
Martin Kroeker
dd7efcf9ef Avoid exceeding the configured thread count in x86_64 TOBF16 (#4748)
* avoid setting nthreads higher than available
2024-06-14 14:21:13 +02:00
guxiwei
ed5db5b122 LoongArch64: Update the address for obtaining the Clang cross-toolchain
Improve the stability and speed of testing
2024-06-13 11:25:01 +08:00
guxiwei
1ca1bb829d LoongArch64: Update QEMU
Compile the community version of QEMU to support LSX/LASX extension instructions
2024-06-13 11:24:32 +08:00