Martin Kroeker
bdb6069051
Merge pull request #4775 from martin-frbg/issue4770
...
Guard against invalid thread_status.queue
2024-07-01 00:35:30 +02:00
Martin Kroeker
4052b312b2
Merge pull request #4763 from ev-br/sync-codspeed
...
BENCH: sync codspeed-benchmarks with BLAS-benchmarks
2024-07-01 00:18:08 +02:00
Martin Kroeker
3677b3886c
Merge pull request #4702 from bashimao/detect-nv-grace
...
Correctly detect ARM Neoverse V2 CPUs.
2024-06-30 22:48:48 +02:00
Martin Kroeker
d0b9948b23
Guard against invalid thread_status.queue
2024-06-30 19:31:15 +02:00
Ralf Gommers
ca9a0c28e8
docs: improve extensions page
2024-06-30 18:04:12 +02:00
Ralf Gommers
3eba16c583
docs: improve the Developer manual
2024-06-30 18:04:08 +02:00
Ralf Gommers
a8e1ff84ce
docs: improve the "Build system" page
2024-06-30 16:27:54 +02:00
Ralf Gommers
c1b9bb8519
docs: improvements to the User Manual
2024-06-30 16:18:24 +02:00
Ralf Gommers
237c2c4130
docs: fix footnote rendering on "Redistributing OpenBLAS" page
2024-06-30 16:09:00 +02:00
Ralf Gommers
3a8e72c0e6
docs: improve the "About" documentation page
2024-06-30 16:08:56 +02:00
Martin Kroeker
c33bc849aa
Merge pull request #4729 from martin-frbg/issue4728
...
Fix handling of INF or NAN arguments in S/D/C SCAL
2024-06-29 18:22:03 +02:00
Martin Kroeker
8cbb797199
Merge pull request #4773 from rgommers/update-docs-yml
...
Update GitHub Actions used in docs.yml
2024-06-29 17:47:17 +02:00
Martin Kroeker
30770c57b9
Merge pull request #4772 from isuruf/rename
...
fix regex for renaming callbacks
2024-06-29 12:45:18 +02:00
Ralf Gommers
97df47644d
Improve html theme: dark mode, nicer color scheme, icons for github/license, etc.
2024-06-29 09:22:54 +02:00
Ralf Gommers
017a26863a
Update GitHub Actions used in docs.yml
...
- Updates to latest versions for all actions
- Pin actions not maintained by GitHub under `github.com/action` to exact
commit hashes. This is good practice for security, since tags can be
moved and then arbitray new code may execute. Since updating commit
hashes is slightly more hassle, the trade-off made here is to trust
GitHub, but not third-party actions.
This should also solve the warnings about Node.js 16 actions being
deprecated that is currently visible on the log page when docs.yml
is executed.
2024-06-28 20:12:24 +02:00
Isuru Fernando
e8da541fb4
fix regex for renaming callbacks
2024-06-28 09:48:00 -05:00
Pol Dellaiera
1b52e3dcfa
openblas: fix `BUFFERSIZE` value
2024-06-28 08:33:38 +02:00
Martin Kroeker
f3c364c2cc
temporarily(?) disable the alpha=0 branch as it fails to handle INF,NAN
2024-06-27 22:18:27 +02:00
Martin Kroeker
2a5fe97e3b
temporarily(?) disable the alpha=0 branch as it does not handle INF,NAN
2024-06-27 16:21:57 +02:00
Evgeni Burovski
28fb95d0be
BENCH: actually add gemv/gbmv f2py wrappers
2024-06-27 12:38:47 +03:00
Martin Kroeker
c1019d5832
Handle INF and NAN in inputs
2024-06-27 10:58:59 +02:00
Evgeni Burovski
11a0c56166
BENCH: add BLAS level 2 gemv and gbmv
2024-06-27 11:14:22 +03:00
Evgeni Burovski
400cf9f63d
restore the problem sizes for codspeed benchmarks
2024-06-24 16:47:20 +03:00
Evgeni Burovski
37a854718b
BENCH: sync codspeed-benchmarks with BLAS-benchmarks
2024-06-24 14:33:06 +03:00
Chris Sidebottom
8c472ef7e3
Further tweak small GEMM for AArch64
2024-06-24 10:47:47 +01:00
Martin Kroeker
9e24121e7e
temporarily(?) disable da=0 shortcut to handle x=Inf or NAN
2024-06-23 17:48:18 +02:00
Martin Kroeker
a11f086c17
Update sscal_msa.c
2024-06-23 12:55:19 +02:00
Martin Kroeker
541e1b6959
disable the fast path for inc=1, alpha=0 as it does not handle x=NaN or Inf
2024-06-23 10:37:55 +02:00
Martin Kroeker
c08113c279
fix special cases of x= NAN or INF
2024-06-23 01:12:33 +02:00
Martin Kroeker
bd47630bcf
exclude the alpha=0 branch as it does not handle NaN or Inf in x
2024-06-23 00:54:39 +02:00
Martin Kroeker
68f2501958
temporarily(?) disable the alpha=0 branch to handle Inf/NaN in x
2024-06-22 21:08:57 +02:00
Martin Kroeker
0a744a939a
temporarily(?) disable the alpha=0 branch to handle NaN/Inf in x
2024-06-22 21:07:43 +02:00
Martin Kroeker
7f8f037a36
handle INF and NAN in input
2024-06-22 16:03:30 +02:00
Martin Kroeker
f1248b849d
handle INF and NAN in input
2024-06-22 15:55:29 +02:00
Martin Kroeker
a2ee4b1966
Merge branch 'OpenMathLib:develop' into issue4728
2024-06-21 09:35:56 +02:00
Martin Kroeker
1ba1b9c357
Merge pull request #4761 from martin-frbg/m1zdot
...
Add a clobber list to the non-SVE ARM64 ZDOT kernel
2024-06-20 23:31:25 +02:00
Martin Kroeker
3ec59922b6
Add a clobber list to fix utest errors seen with gcc13 on Apple M
2024-06-20 16:19:32 +02:00
Martin Kroeker
7e9a4ba427
Merge pull request #4741 from shivammonaka/Pthread_Scalability_Improvement
...
Enhancing Core Utilization in BLAS Calls: A Scalable Architecture
2024-06-20 13:36:23 +02:00
Martin Kroeker
0773695a5c
Merge pull request #4760 from martin-frbg/zhaoxin7k
...
Add cpuid for Zhaoxin KX-7000
2024-06-20 11:08:30 +02:00
Martin Kroeker
9b2a0c79cb
Add Zhaoxin KX7000
2024-06-20 09:23:08 +02:00
Martin Kroeker
758279605f
Add support forZhaoxin KX7000
2024-06-20 09:21:06 +02:00
Martin Kroeker
18063b1ccd
Merge pull request #4757 from martin-frbg/lapack1024
...
Fix possible infinite loop on error in the LAPACK testsuite (Reference-LAPACK PR 1024)
2024-06-19 10:07:38 +02:00
Martin Kroeker
215279662e
fix possible infinite loop on error (Reference-LAPACK PR 1024)
2024-06-18 11:21:33 +02:00
Martin Kroeker
a9817b4212
fix reference in format (Reference-LAPACK PR 1024)
2024-06-18 11:20:22 +02:00
Martin Kroeker
bf521a2ced
fix possible infinite loop on error (Reference-LAPACK PR 1024)
2024-06-18 11:18:04 +02:00
Martin Kroeker
cf2962bdb5
fix possible infinite loop on error (Reference-LAPACK PR 1024)
2024-06-18 11:15:44 +02:00
Martin Kroeker
33bb4b98a4
Improve error message output from the fork() utest ( #4753 )
...
* Add perror to report the reason for a fork failure
* reword the malloc failure message
2024-06-15 14:16:48 +02:00
Martin Kroeker
f13403b6b6
Merge pull request #4755 from martin-frbg/issue4739
...
Fix Intel oneAPI compiler support in the CMAKE build
2024-06-15 12:26:18 +02:00
Martin Kroeker
8bc37f9384
Merge pull request #4754 from martin-frbg/issue4750-2
...
Add a clobber list to the arm64 SVE DOT kernel
2024-06-15 10:29:03 +02:00
Martin Kroeker
d25ee4d0f5
Fix detection of Intel ifx and apply -fp-model option to it
2024-06-14 23:58:45 +02:00