Ralf Gommers
017a26863a
Update GitHub Actions used in docs.yml
...
- Updates to latest versions for all actions
- Pin actions not maintained by GitHub under `github.com/action` to exact
commit hashes. This is good practice for security, since tags can be
moved and then arbitray new code may execute. Since updating commit
hashes is slightly more hassle, the trade-off made here is to trust
GitHub, but not third-party actions.
This should also solve the warnings about Node.js 16 actions being
deprecated that is currently visible on the log page when docs.yml
is executed.
2024-06-28 20:12:24 +02:00
Martin Kroeker
1ba1b9c357
Merge pull request #4761 from martin-frbg/m1zdot
...
Add a clobber list to the non-SVE ARM64 ZDOT kernel
2024-06-20 23:31:25 +02:00
Martin Kroeker
3ec59922b6
Add a clobber list to fix utest errors seen with gcc13 on Apple M
2024-06-20 16:19:32 +02:00
Martin Kroeker
7e9a4ba427
Merge pull request #4741 from shivammonaka/Pthread_Scalability_Improvement
...
Enhancing Core Utilization in BLAS Calls: A Scalable Architecture
2024-06-20 13:36:23 +02:00
Martin Kroeker
0773695a5c
Merge pull request #4760 from martin-frbg/zhaoxin7k
...
Add cpuid for Zhaoxin KX-7000
2024-06-20 11:08:30 +02:00
Martin Kroeker
9b2a0c79cb
Add Zhaoxin KX7000
2024-06-20 09:23:08 +02:00
Martin Kroeker
758279605f
Add support forZhaoxin KX7000
2024-06-20 09:21:06 +02:00
Martin Kroeker
18063b1ccd
Merge pull request #4757 from martin-frbg/lapack1024
...
Fix possible infinite loop on error in the LAPACK testsuite (Reference-LAPACK PR 1024)
2024-06-19 10:07:38 +02:00
Martin Kroeker
215279662e
fix possible infinite loop on error (Reference-LAPACK PR 1024)
2024-06-18 11:21:33 +02:00
Martin Kroeker
a9817b4212
fix reference in format (Reference-LAPACK PR 1024)
2024-06-18 11:20:22 +02:00
Martin Kroeker
bf521a2ced
fix possible infinite loop on error (Reference-LAPACK PR 1024)
2024-06-18 11:18:04 +02:00
Martin Kroeker
cf2962bdb5
fix possible infinite loop on error (Reference-LAPACK PR 1024)
2024-06-18 11:15:44 +02:00
Martin Kroeker
33bb4b98a4
Improve error message output from the fork() utest ( #4753 )
...
* Add perror to report the reason for a fork failure
* reword the malloc failure message
2024-06-15 14:16:48 +02:00
Martin Kroeker
f13403b6b6
Merge pull request #4755 from martin-frbg/issue4739
...
Fix Intel oneAPI compiler support in the CMAKE build
2024-06-15 12:26:18 +02:00
Martin Kroeker
8bc37f9384
Merge pull request #4754 from martin-frbg/issue4750-2
...
Add a clobber list to the arm64 SVE DOT kernel
2024-06-15 10:29:03 +02:00
Martin Kroeker
d25ee4d0f5
Fix detection of Intel ifx and apply -fp-model option to it
2024-06-14 23:58:45 +02:00
Martin Kroeker
21c0f769ef
ensure that cpu-specific -march options are always applied to icx
2024-06-14 23:54:27 +02:00
Martin Kroeker
3d8054fb16
add clobber list
2024-06-14 22:07:44 +02:00
Martin Kroeker
fdb88e010f
Merge pull request #4749 from XiWeiGu/loongarch64-qemu-update
...
LoongArch64: Update QEMU
2024-06-14 17:19:14 +02:00
Martin Kroeker
dd7efcf9ef
Avoid exceeding the configured thread count in x86_64 TOBF16 ( #4748 )
...
* avoid setting nthreads higher than available
2024-06-14 14:21:13 +02:00
guxiwei
ed5db5b122
LoongArch64: Update the address for obtaining the Clang cross-toolchain
...
Improve the stability and speed of testing
2024-06-13 11:25:01 +08:00
guxiwei
1ca1bb829d
LoongArch64: Update QEMU
...
Compile the community version of QEMU to support LSX/LASX extension instructions
2024-06-13 11:24:32 +08:00
Martin Kroeker
62c33db37d
Merge pull request #4746 from martin-frbg/issue4743
...
Correct CMAKE build definitions for CAXPYC/ZAXPYC
2024-06-09 22:44:50 +02:00
Martin Kroeker
2f12a47405
fix build options for CAXPYC/ZAXPYC
2024-06-09 20:32:10 +02:00
shivammonaka
9e22d70957
Dynamic locking in Pthread Backend to allow multiple BLAS calls to be executed parallelly
2024-06-07 08:40:17 +05:30
Martin Kroeker
f955616f98
Merge pull request #4740 from martin-frbg/fixlapackmod
...
remove LAPACK .mod files during make clean
2024-06-06 23:22:31 +02:00
Martin Kroeker
f96ee86711
remove .mod files during make clean
2024-06-06 21:17:36 +02:00
Martin Kroeker
442dec28df
Merge pull request #4738 from martin-frbg/issue4737
...
Disable GEMM3M for generic targets (not implemented)
2024-06-06 17:22:38 +02:00
Martin Kroeker
0cf8b98f61
Merge pull request #4736 from XiWeiGu/loongarch_issue4728
...
LoongArch: Fixed issue 4728
2024-06-06 15:28:44 +02:00
Martin Kroeker
2787c9f8e4
Disable GEMM3M for generic targets (not implemented)
2024-06-06 14:39:50 +02:00
gxw
af73ae6208
LoongArch: Fixed issue 4728
2024-06-06 16:43:09 +08:00
Martin Kroeker
4e9144b39f
Update .cirrus.yml ( #4735 )
...
* Update versions (and paths) of XCode, and update FreeBSD version
2024-06-05 23:43:52 +02:00
Martin Kroeker
0c2ac76a63
Merge pull request #4734 from XiWeiGu/loongarch64_small_matrix
...
LoongArch: DGEMM small matrix opt
2024-06-05 19:13:17 +02:00
Martin Kroeker
913be34bf0
Merge pull request #4733 from martin-frbg/issue4719
...
Drop the -static Fortran flag from generic RISCV builds as it breaks OpenMP
2024-06-05 00:11:09 +02:00
gxw
8ab2e9ec65
LoongArch: DGEMM small matrix opt
2024-06-04 16:52:45 +08:00
Martin Kroeker
df87aeb5a2
Drop the -static Fortran flag from generic builds as it breaks OpenMP
2024-06-04 09:49:18 +02:00
Martin Kroeker
3a3ff1ba5e
Merge pull request #4732 from martin-frbg/issue4731
...
fix conflicting types for cblas_sbgemm_batch
2024-06-03 21:18:28 +02:00
Martin Kroeker
db9f7bc552
fix float array types to include bfloat16
2024-06-03 00:22:16 +02:00
Martin Kroeker
a9fae32a33
Merge pull request #4730 from jake-arkinstall/develop
...
Updated CONTRIBUTORS.md
2024-06-01 13:38:04 +02:00
Jake Arkinstall
44004178aa
Updated CONTRIBUTORS.md
...
As requested on X (https://x.com/KroekerMartin/status/1755218919290278185 )
2024-06-01 11:22:26 +01:00
Martin Kroeker
83bc8d5dd8
Merge pull request #4712 from RajalakshmiSR/zscalp10
...
POWER: Fix issues in zscal to address lapack failures
2024-06-01 11:22:08 +02:00
Martin Kroeker
56bd57ca99
Merge pull request #4720 from martin-frbg/issue3039
...
Resurrect and complete cblas_?gemm_batch
2024-06-01 00:34:32 +02:00
Martin Kroeker
6b564d53fd
Merge pull request #4727 from martin-frbg/issue4726
...
Fix another corner case of infinity handling in x86_64 ZSCAL
2024-05-31 19:44:33 +02:00
Martin Kroeker
db070a9223
add gemm_batch drivers
2024-05-31 18:29:27 +02:00
Martin Kroeker
076766df4e
Update CMakeLists.txt
2024-05-31 18:23:18 +02:00
Martin Kroeker
8c05765a5a
fix other corner cases where x=INF
2024-05-31 18:06:36 +02:00
Martin Kroeker
516743f7dc
fix other instances of mishandling INF
2024-05-31 16:02:12 +02:00
Martin Kroeker
9ff4e9714e
additional fixes for handling INF arguments
2024-05-31 15:44:07 +02:00
Martin Kroeker
ce130f11d2
Update zscal.c
2024-05-31 15:09:03 +02:00
Martin Kroeker
ab13cfef93
more fixes for infinite x
2024-05-31 14:34:49 +02:00