Commit Graph

8475 Commits

Author SHA1 Message Date
Martin Kroeker
e1eef56e05 Merge pull request #4783 from martin-frbg/cpuid_meteor
Add another CPUID for Intel Meteor Lake
2024-07-04 18:09:27 +02:00
Martin Kroeker
3063d03021 Add another CPUID for Meteor Lake 2024-07-04 16:05:05 +02:00
Martin Kroeker
f729013d2e Merge pull request #4781 from rgommers/fix-docs-deployment
fix CI job to deploy docs, and make it run on pull requests too
2024-07-03 21:00:18 +02:00
Ralf Gommers
6ede8b14c6 ci: fix CI job to deploy docs, and make it run on pull requests too 2024-07-03 20:14:02 +02:00
Martin Kroeker
9836883ee9 Merge pull request #4780 from martin-frbg/azureosx12
AzureCI: Update OSX jobs to use the macos-12 image
2024-07-03 19:53:05 +02:00
Martin Kroeker
df81b159e8 Merge pull request #4774 from rgommers/improve-docs
Improve documention content, formatting, and html theme
2024-07-03 17:10:44 +02:00
Martin Kroeker
2df4007425 Update compiler and sdk versions for osx 2024-07-03 16:48:43 +02:00
Martin Kroeker
acf0c3ccaf Merge pull request #4777 from ev-br/sgesdd_ci_err
ignore the gesdd failure on codspeed
2024-07-03 15:21:33 +02:00
Martin Kroeker
74f059a3ce Update OSX jobs to use the macos-12 image 2024-07-03 13:24:02 +02:00
Evgeni Burovski
cd3c167c28 ignore sgesdd failure on codspeed
In https://github.com/OpenMathLib/OpenBLAS/issues/4776
we're hitting
** On entry to SLASCL parameter number  4 had an illegal value

on codspeed, but not outside (either locally or on github runners)
2024-07-03 12:35:26 +03:00
Evgeni Burovski
5b385fd453 WIP: fish out the gesdd failure? 2024-07-02 20:10:26 +03:00
Ralf Gommers
c1c0dbfd60 docs: address review comments on PR 4774 2024-07-02 14:05:47 +02:00
Martin Kroeker
bdb6069051 Merge pull request #4775 from martin-frbg/issue4770
Guard against invalid thread_status.queue
2024-07-01 00:35:30 +02:00
Martin Kroeker
4052b312b2 Merge pull request #4763 from ev-br/sync-codspeed
BENCH: sync codspeed-benchmarks with BLAS-benchmarks
2024-07-01 00:18:08 +02:00
Martin Kroeker
3677b3886c Merge pull request #4702 from bashimao/detect-nv-grace
Correctly detect ARM Neoverse V2 CPUs.
2024-06-30 22:48:48 +02:00
Martin Kroeker
d0b9948b23 Guard against invalid thread_status.queue 2024-06-30 19:31:15 +02:00
Ralf Gommers
ca9a0c28e8 docs: improve extensions page 2024-06-30 18:04:12 +02:00
Ralf Gommers
3eba16c583 docs: improve the Developer manual 2024-06-30 18:04:08 +02:00
Ralf Gommers
a8e1ff84ce docs: improve the "Build system" page 2024-06-30 16:27:54 +02:00
Ralf Gommers
c1b9bb8519 docs: improvements to the User Manual 2024-06-30 16:18:24 +02:00
Ralf Gommers
237c2c4130 docs: fix footnote rendering on "Redistributing OpenBLAS" page 2024-06-30 16:09:00 +02:00
Ralf Gommers
3a8e72c0e6 docs: improve the "About" documentation page 2024-06-30 16:08:56 +02:00
Martin Kroeker
c33bc849aa Merge pull request #4729 from martin-frbg/issue4728
Fix handling of INF or NAN arguments in S/D/C SCAL
2024-06-29 18:22:03 +02:00
Martin Kroeker
8cbb797199 Merge pull request #4773 from rgommers/update-docs-yml
Update GitHub Actions used in docs.yml
2024-06-29 17:47:17 +02:00
Martin Kroeker
30770c57b9 Merge pull request #4772 from isuruf/rename
fix regex for renaming callbacks
2024-06-29 12:45:18 +02:00
Ralf Gommers
97df47644d Improve html theme: dark mode, nicer color scheme, icons for github/license, etc. 2024-06-29 09:22:54 +02:00
Ralf Gommers
017a26863a Update GitHub Actions used in docs.yml
- Updates to latest versions for all actions
- Pin actions not maintained by GitHub under `github.com/action` to exact
  commit hashes. This is good practice for security, since tags can be
  moved and then arbitray new code may execute. Since updating commit
  hashes is slightly more hassle, the trade-off made here is to trust
  GitHub, but not third-party actions.

This should also solve the warnings about Node.js 16 actions being
deprecated that is currently visible on the log page when docs.yml
is executed.
2024-06-28 20:12:24 +02:00
Isuru Fernando
e8da541fb4 fix regex for renaming callbacks 2024-06-28 09:48:00 -05:00
Martin Kroeker
f3c364c2cc temporarily(?) disable the alpha=0 branch as it fails to handle INF,NAN 2024-06-27 22:18:27 +02:00
Martin Kroeker
2a5fe97e3b temporarily(?) disable the alpha=0 branch as it does not handle INF,NAN 2024-06-27 16:21:57 +02:00
Evgeni Burovski
28fb95d0be BENCH: actually add gemv/gbmv f2py wrappers 2024-06-27 12:38:47 +03:00
Martin Kroeker
c1019d5832 Handle INF and NAN in inputs 2024-06-27 10:58:59 +02:00
Evgeni Burovski
11a0c56166 BENCH: add BLAS level 2 gemv and gbmv 2024-06-27 11:14:22 +03:00
Evgeni Burovski
400cf9f63d restore the problem sizes for codspeed benchmarks 2024-06-24 16:47:20 +03:00
Evgeni Burovski
37a854718b BENCH: sync codspeed-benchmarks with BLAS-benchmarks 2024-06-24 14:33:06 +03:00
Martin Kroeker
9e24121e7e temporarily(?) disable da=0 shortcut to handle x=Inf or NAN 2024-06-23 17:48:18 +02:00
Martin Kroeker
a11f086c17 Update sscal_msa.c 2024-06-23 12:55:19 +02:00
Martin Kroeker
541e1b6959 disable the fast path for inc=1, alpha=0 as it does not handle x=NaN or Inf 2024-06-23 10:37:55 +02:00
Martin Kroeker
c08113c279 fix special cases of x= NAN or INF 2024-06-23 01:12:33 +02:00
Martin Kroeker
bd47630bcf exclude the alpha=0 branch as it does not handle NaN or Inf in x 2024-06-23 00:54:39 +02:00
Martin Kroeker
68f2501958 temporarily(?) disable the alpha=0 branch to handle Inf/NaN in x 2024-06-22 21:08:57 +02:00
Martin Kroeker
0a744a939a temporarily(?) disable the alpha=0 branch to handle NaN/Inf in x 2024-06-22 21:07:43 +02:00
Martin Kroeker
7f8f037a36 handle INF and NAN in input 2024-06-22 16:03:30 +02:00
Martin Kroeker
f1248b849d handle INF and NAN in input 2024-06-22 15:55:29 +02:00
Martin Kroeker
a2ee4b1966 Merge branch 'OpenMathLib:develop' into issue4728 2024-06-21 09:35:56 +02:00
Martin Kroeker
1ba1b9c357 Merge pull request #4761 from martin-frbg/m1zdot
Add a clobber list to the non-SVE ARM64 ZDOT kernel
2024-06-20 23:31:25 +02:00
Martin Kroeker
3ec59922b6 Add a clobber list to fix utest errors seen with gcc13 on Apple M 2024-06-20 16:19:32 +02:00
Martin Kroeker
7e9a4ba427 Merge pull request #4741 from shivammonaka/Pthread_Scalability_Improvement
Enhancing Core Utilization in BLAS Calls: A Scalable Architecture
2024-06-20 13:36:23 +02:00
Martin Kroeker
0773695a5c Merge pull request #4760 from martin-frbg/zhaoxin7k
Add cpuid for Zhaoxin KX-7000
2024-06-20 11:08:30 +02:00
Martin Kroeker
9b2a0c79cb Add Zhaoxin KX7000 2024-06-20 09:23:08 +02:00