Commit Graph

8417 Commits

Author SHA1 Message Date
Martin Kroeker
9b2a0c79cb Add Zhaoxin KX7000 2024-06-20 09:23:08 +02:00
Martin Kroeker
758279605f Add support forZhaoxin KX7000 2024-06-20 09:21:06 +02:00
Martin Kroeker
18063b1ccd Merge pull request #4757 from martin-frbg/lapack1024
Fix possible infinite loop on error in the LAPACK testsuite  (Reference-LAPACK PR 1024)
2024-06-19 10:07:38 +02:00
Martin Kroeker
215279662e fix possible infinite loop on error (Reference-LAPACK PR 1024) 2024-06-18 11:21:33 +02:00
Martin Kroeker
a9817b4212 fix reference in format (Reference-LAPACK PR 1024) 2024-06-18 11:20:22 +02:00
Martin Kroeker
bf521a2ced fix possible infinite loop on error (Reference-LAPACK PR 1024) 2024-06-18 11:18:04 +02:00
Martin Kroeker
cf2962bdb5 fix possible infinite loop on error (Reference-LAPACK PR 1024) 2024-06-18 11:15:44 +02:00
Martin Kroeker
33bb4b98a4 Improve error message output from the fork() utest (#4753)
* Add perror to report the reason for a fork failure
* reword the malloc failure message
2024-06-15 14:16:48 +02:00
Martin Kroeker
f13403b6b6 Merge pull request #4755 from martin-frbg/issue4739
Fix Intel oneAPI compiler support in the CMAKE build
2024-06-15 12:26:18 +02:00
Martin Kroeker
8bc37f9384 Merge pull request #4754 from martin-frbg/issue4750-2
Add a clobber list to the arm64 SVE DOT kernel
2024-06-15 10:29:03 +02:00
Martin Kroeker
d25ee4d0f5 Fix detection of Intel ifx and apply -fp-model option to it 2024-06-14 23:58:45 +02:00
Martin Kroeker
21c0f769ef ensure that cpu-specific -march options are always applied to icx 2024-06-14 23:54:27 +02:00
Martin Kroeker
3d8054fb16 add clobber list 2024-06-14 22:07:44 +02:00
Martin Kroeker
fdb88e010f Merge pull request #4749 from XiWeiGu/loongarch64-qemu-update
LoongArch64: Update QEMU
2024-06-14 17:19:14 +02:00
Martin Kroeker
dd7efcf9ef Avoid exceeding the configured thread count in x86_64 TOBF16 (#4748)
* avoid setting nthreads higher than available
2024-06-14 14:21:13 +02:00
guxiwei
ed5db5b122 LoongArch64: Update the address for obtaining the Clang cross-toolchain
Improve the stability and speed of testing
2024-06-13 11:25:01 +08:00
guxiwei
1ca1bb829d LoongArch64: Update QEMU
Compile the community version of QEMU to support LSX/LASX extension instructions
2024-06-13 11:24:32 +08:00
Martin Kroeker
62c33db37d Merge pull request #4746 from martin-frbg/issue4743
Correct CMAKE build definitions for CAXPYC/ZAXPYC
2024-06-09 22:44:50 +02:00
Martin Kroeker
2f12a47405 fix build options for CAXPYC/ZAXPYC 2024-06-09 20:32:10 +02:00
Martin Kroeker
f955616f98 Merge pull request #4740 from martin-frbg/fixlapackmod
remove LAPACK .mod files during make clean
2024-06-06 23:22:31 +02:00
Martin Kroeker
f96ee86711 remove .mod files during make clean 2024-06-06 21:17:36 +02:00
Martin Kroeker
442dec28df Merge pull request #4738 from martin-frbg/issue4737
Disable GEMM3M for generic targets (not implemented)
2024-06-06 17:22:38 +02:00
Martin Kroeker
0cf8b98f61 Merge pull request #4736 from XiWeiGu/loongarch_issue4728
LoongArch: Fixed issue 4728
2024-06-06 15:28:44 +02:00
Martin Kroeker
2787c9f8e4 Disable GEMM3M for generic targets (not implemented) 2024-06-06 14:39:50 +02:00
gxw
af73ae6208 LoongArch: Fixed issue 4728 2024-06-06 16:43:09 +08:00
Martin Kroeker
4e9144b39f Update .cirrus.yml (#4735)
* Update versions (and paths) of XCode, and update FreeBSD version
2024-06-05 23:43:52 +02:00
Martin Kroeker
0c2ac76a63 Merge pull request #4734 from XiWeiGu/loongarch64_small_matrix
LoongArch: DGEMM small matrix opt
2024-06-05 19:13:17 +02:00
Martin Kroeker
913be34bf0 Merge pull request #4733 from martin-frbg/issue4719
Drop the -static Fortran flag from generic RISCV builds as it breaks OpenMP
2024-06-05 00:11:09 +02:00
gxw
8ab2e9ec65 LoongArch: DGEMM small matrix opt 2024-06-04 16:52:45 +08:00
Martin Kroeker
df87aeb5a2 Drop the -static Fortran flag from generic builds as it breaks OpenMP 2024-06-04 09:49:18 +02:00
Martin Kroeker
3a3ff1ba5e Merge pull request #4732 from martin-frbg/issue4731
fix conflicting types for cblas_sbgemm_batch
2024-06-03 21:18:28 +02:00
Martin Kroeker
db9f7bc552 fix float array types to include bfloat16 2024-06-03 00:22:16 +02:00
Martin Kroeker
a9fae32a33 Merge pull request #4730 from jake-arkinstall/develop
Updated CONTRIBUTORS.md
2024-06-01 13:38:04 +02:00
Jake Arkinstall
44004178aa Updated CONTRIBUTORS.md
As requested on X (https://x.com/KroekerMartin/status/1755218919290278185)
2024-06-01 11:22:26 +01:00
Martin Kroeker
83bc8d5dd8 Merge pull request #4712 from RajalakshmiSR/zscalp10
POWER: Fix issues in zscal to address lapack failures
2024-06-01 11:22:08 +02:00
Martin Kroeker
56bd57ca99 Merge pull request #4720 from martin-frbg/issue3039
Resurrect and complete cblas_?gemm_batch
2024-06-01 00:34:32 +02:00
Martin Kroeker
6b564d53fd Merge pull request #4727 from martin-frbg/issue4726
Fix another corner case of infinity handling in x86_64 ZSCAL
2024-05-31 19:44:33 +02:00
Martin Kroeker
db070a9223 add gemm_batch drivers 2024-05-31 18:29:27 +02:00
Martin Kroeker
076766df4e Update CMakeLists.txt 2024-05-31 18:23:18 +02:00
Martin Kroeker
8c05765a5a fix other corner cases where x=INF 2024-05-31 18:06:36 +02:00
Martin Kroeker
516743f7dc fix other instances of mishandling INF 2024-05-31 16:02:12 +02:00
Martin Kroeker
9ff4e9714e additional fixes for handling INF arguments 2024-05-31 15:44:07 +02:00
Martin Kroeker
ce130f11d2 Update zscal.c 2024-05-31 15:09:03 +02:00
Martin Kroeker
ab13cfef93 more fixes for infinite x 2024-05-31 14:34:49 +02:00
Martin Kroeker
a16f8249ba add tests with the imaginary part of the array infinite 2024-05-31 01:08:17 +02:00
Martin Kroeker
ad2b5c67c8 fix another corner case involving infinity 2024-05-31 01:06:58 +02:00
Martin Kroeker
0d007adb18 fix clang_cl-flang job to use flang-new after the llvm update 2024-05-30 23:30:16 +02:00
Martin Kroeker
b9a1c9a06c Merge pull request #4725 from Neumann-A/patch-1
Fix CMake warning
2024-05-30 21:32:32 +02:00
Martin Kroeker
ff6670cb83 don't generate non-cblas files for gemm_batch 2024-05-30 18:26:02 +02:00
Alexander Neumann
dd4505c5dd Fix CMake warning 2024-05-30 09:04:23 +02:00