Martin Kroeker
|
21c0f769ef
|
ensure that cpu-specific -march options are always applied to icx
|
2024-06-14 23:54:27 +02:00 |
Martin Kroeker
|
3d8054fb16
|
add clobber list
|
2024-06-14 22:07:44 +02:00 |
Martin Kroeker
|
fdb88e010f
|
Merge pull request #4749 from XiWeiGu/loongarch64-qemu-update
LoongArch64: Update QEMU
|
2024-06-14 17:19:14 +02:00 |
Martin Kroeker
|
dd7efcf9ef
|
Avoid exceeding the configured thread count in x86_64 TOBF16 (#4748)
* avoid setting nthreads higher than available
|
2024-06-14 14:21:13 +02:00 |
guxiwei
|
ed5db5b122
|
LoongArch64: Update the address for obtaining the Clang cross-toolchain
Improve the stability and speed of testing
|
2024-06-13 11:25:01 +08:00 |
guxiwei
|
1ca1bb829d
|
LoongArch64: Update QEMU
Compile the community version of QEMU to support LSX/LASX extension instructions
|
2024-06-13 11:24:32 +08:00 |
Martin Kroeker
|
62c33db37d
|
Merge pull request #4746 from martin-frbg/issue4743
Correct CMAKE build definitions for CAXPYC/ZAXPYC
|
2024-06-09 22:44:50 +02:00 |
Martin Kroeker
|
2f12a47405
|
fix build options for CAXPYC/ZAXPYC
|
2024-06-09 20:32:10 +02:00 |
Martin Kroeker
|
6ffaf99817
|
disable da=0 shortcut to handle NAN and INF correctly
|
2024-06-07 14:46:58 +02:00 |
Martin Kroeker
|
c7cacd9b38
|
disable the shortcut for da=0 to ensure proper handling of INF and NAN
|
2024-06-07 13:48:56 +02:00 |
Martin Kroeker
|
5ed4f24d6e
|
Handle corner cases with INF and NAN arguments
|
2024-06-07 09:39:08 +02:00 |
shivammonaka
|
9e22d70957
|
Dynamic locking in Pthread Backend to allow multiple BLAS calls to be executed parallelly
|
2024-06-07 08:40:17 +05:30 |
Martin Kroeker
|
2bd43ad0eb
|
Merge branch 'OpenMathLib:develop' into issue4728
|
2024-06-07 00:37:25 +02:00 |
Martin Kroeker
|
1abafcd9b2
|
handle corner cases involving NAN and/or INF
|
2024-06-06 23:59:43 +02:00 |
Martin Kroeker
|
ffc1ab3f6e
|
Test corner cases of all SCAL variants
|
2024-06-06 23:58:16 +02:00 |
Martin Kroeker
|
f955616f98
|
Merge pull request #4740 from martin-frbg/fixlapackmod
remove LAPACK .mod files during make clean
|
2024-06-06 23:22:31 +02:00 |
Martin Kroeker
|
f96ee86711
|
remove .mod files during make clean
|
2024-06-06 21:17:36 +02:00 |
Martin Kroeker
|
442dec28df
|
Merge pull request #4738 from martin-frbg/issue4737
Disable GEMM3M for generic targets (not implemented)
|
2024-06-06 17:22:38 +02:00 |
Martin Kroeker
|
0cf8b98f61
|
Merge pull request #4736 from XiWeiGu/loongarch_issue4728
LoongArch: Fixed issue 4728
|
2024-06-06 15:28:44 +02:00 |
Martin Kroeker
|
2787c9f8e4
|
Disable GEMM3M for generic targets (not implemented)
|
2024-06-06 14:39:50 +02:00 |
gxw
|
af73ae6208
|
LoongArch: Fixed issue 4728
|
2024-06-06 16:43:09 +08:00 |
Martin Kroeker
|
4e9144b39f
|
Update .cirrus.yml (#4735)
* Update versions (and paths) of XCode, and update FreeBSD version
|
2024-06-05 23:43:52 +02:00 |
Martin Kroeker
|
0c2ac76a63
|
Merge pull request #4734 from XiWeiGu/loongarch64_small_matrix
LoongArch: DGEMM small matrix opt
|
2024-06-05 19:13:17 +02:00 |
Martin Kroeker
|
913be34bf0
|
Merge pull request #4733 from martin-frbg/issue4719
Drop the -static Fortran flag from generic RISCV builds as it breaks OpenMP
|
2024-06-05 00:11:09 +02:00 |
gxw
|
8ab2e9ec65
|
LoongArch: DGEMM small matrix opt
|
2024-06-04 16:52:45 +08:00 |
Martin Kroeker
|
df87aeb5a2
|
Drop the -static Fortran flag from generic builds as it breaks OpenMP
|
2024-06-04 09:49:18 +02:00 |
Martin Kroeker
|
3a3ff1ba5e
|
Merge pull request #4732 from martin-frbg/issue4731
fix conflicting types for cblas_sbgemm_batch
|
2024-06-03 21:18:28 +02:00 |
Martin Kroeker
|
db9f7bc552
|
fix float array types to include bfloat16
|
2024-06-03 00:22:16 +02:00 |
Martin Kroeker
|
a9fae32a33
|
Merge pull request #4730 from jake-arkinstall/develop
Updated CONTRIBUTORS.md
|
2024-06-01 13:38:04 +02:00 |
Jake Arkinstall
|
44004178aa
|
Updated CONTRIBUTORS.md
As requested on X (https://x.com/KroekerMartin/status/1755218919290278185)
|
2024-06-01 11:22:26 +01:00 |
Martin Kroeker
|
83bc8d5dd8
|
Merge pull request #4712 from RajalakshmiSR/zscalp10
POWER: Fix issues in zscal to address lapack failures
|
2024-06-01 11:22:08 +02:00 |
Martin Kroeker
|
020b3e1682
|
fix handling of INF arguments
|
2024-06-01 00:51:18 +02:00 |
Martin Kroeker
|
56bd57ca99
|
Merge pull request #4720 from martin-frbg/issue3039
Resurrect and complete cblas_?gemm_batch
|
2024-06-01 00:34:32 +02:00 |
Martin Kroeker
|
6b564d53fd
|
Merge pull request #4727 from martin-frbg/issue4726
Fix another corner case of infinity handling in x86_64 ZSCAL
|
2024-05-31 19:44:33 +02:00 |
Martin Kroeker
|
db070a9223
|
add gemm_batch drivers
|
2024-05-31 18:29:27 +02:00 |
Martin Kroeker
|
076766df4e
|
Update CMakeLists.txt
|
2024-05-31 18:23:18 +02:00 |
Martin Kroeker
|
8c05765a5a
|
fix other corner cases where x=INF
|
2024-05-31 18:06:36 +02:00 |
Martin Kroeker
|
516743f7dc
|
fix other instances of mishandling INF
|
2024-05-31 16:02:12 +02:00 |
Martin Kroeker
|
9ff4e9714e
|
additional fixes for handling INF arguments
|
2024-05-31 15:44:07 +02:00 |
Martin Kroeker
|
ce130f11d2
|
Update zscal.c
|
2024-05-31 15:09:03 +02:00 |
Martin Kroeker
|
ab13cfef93
|
more fixes for infinite x
|
2024-05-31 14:34:49 +02:00 |
Martin Kroeker
|
a16f8249ba
|
add tests with the imaginary part of the array infinite
|
2024-05-31 01:08:17 +02:00 |
Martin Kroeker
|
ad2b5c67c8
|
fix another corner case involving infinity
|
2024-05-31 01:06:58 +02:00 |
Martin Kroeker
|
0d007adb18
|
fix clang_cl-flang job to use flang-new after the llvm update
|
2024-05-30 23:30:16 +02:00 |
Martin Kroeker
|
b9a1c9a06c
|
Merge pull request #4725 from Neumann-A/patch-1
Fix CMake warning
|
2024-05-30 21:32:32 +02:00 |
Martin Kroeker
|
ff6670cb83
|
don't generate non-cblas files for gemm_batch
|
2024-05-30 18:26:02 +02:00 |
Alexander Neumann
|
dd4505c5dd
|
Fix CMake warning
|
2024-05-30 09:04:23 +02:00 |
Martin Kroeker
|
362a063396
|
remove return value
|
2024-05-29 23:16:58 +02:00 |
Martin Kroeker
|
d0794f88dc
|
add gemm_batch driver
|
2024-05-29 15:49:20 +02:00 |
Martin Kroeker
|
833a8880c6
|
add cblas_?gemm_batch
|
2024-05-29 15:47:50 +02:00 |