Martin Kroeker
|
79cb121ab9
|
Merge pull request #4563 from XiWeiGu/loongarch_fix_lasx
Loongarch: Fixed LASX opt
|
2024-03-16 10:34:32 +01:00 |
gxw
|
60e251a1f8
|
loongarch: Fixed {sc/dz}amax LASX opt
|
2024-03-16 14:52:17 +08:00 |
gxw
|
a10dde5554
|
loongarch: Fixed {s/d/sc/dz}amin LASX opt
|
2024-03-16 14:52:14 +08:00 |
gxw
|
6534d378b7
|
loongarch: Fixed {s/d/c/z}sum LASX opt
|
2024-03-16 14:52:10 +08:00 |
gxw
|
6159cffc58
|
loongarch: Fixed i{s/c/z}amin LASX opt
|
2024-03-16 14:52:06 +08:00 |
gxw
|
7d755912b9
|
loongarch: Fixed {s/d/c/z}axpby LASX opt
|
2024-03-16 14:51:56 +08:00 |
Martin Kroeker
|
66bde6243e
|
Merge pull request #4503 from shivammonaka/OpenMP-Locks
OpenMP locks instead of busy-waiting with NUM_PARALLEL
|
2024-03-14 20:56:04 +01:00 |
Martin Kroeker
|
dc0338af47
|
Merge pull request #4560 from martin-frbg/issue4551-3
Add support for negative increments to the ?NRM2 kernels for RISC-V RVV targets
|
2024-03-13 14:48:56 +01:00 |
Martin Kroeker
|
cf80bd8500
|
Update nrm2_rvv.c
|
2024-03-13 13:07:26 +01:00 |
Martin Kroeker
|
9baa757905
|
Update nrm2_vector.c
|
2024-03-13 11:40:14 +01:00 |
Martin Kroeker
|
18a6db6862
|
Update nrm2_vector.c
|
2024-03-13 11:10:26 +01:00 |
Martin Kroeker
|
855bbdda4f
|
Merge pull request #4556 from ChipKerchner/updateREADMEAIX
Update README for build instructions on AIX and OpenXL.
|
2024-03-12 23:12:50 +01:00 |
Martin Kroeker
|
3752e73919
|
handle incx < 0
|
2024-03-12 20:44:01 +01:00 |
Martin Kroeker
|
db70c7f7fb
|
handle incx < 0
|
2024-03-12 20:42:11 +01:00 |
Martin Kroeker
|
dee8557d58
|
handle incx < 0
|
2024-03-12 20:40:29 +01:00 |
Martin Kroeker
|
d9dff17aec
|
handle incx < 0
|
2024-03-12 20:38:23 +01:00 |
Martin Kroeker
|
5802e7a62f
|
Merge pull request #4559 from martin-frbg/issue4551-2
Remove another unwanted early exit in the ThunderX2/NeoN1/AppleM ?NRM2 kernels
|
2024-03-12 20:34:20 +01:00 |
Martin Kroeker
|
552c521353
|
remove another early exit for incx < 0
|
2024-03-12 18:49:27 +01:00 |
Martin Kroeker
|
ed532dc75b
|
remove another early exit for incx < 0
|
2024-03-12 18:47:00 +01:00 |
Martin Kroeker
|
161ac17466
|
Merge pull request #4554 from ev-br/patch-1
Add a test for dnrm(..., incx=-1)
|
2024-03-12 18:24:42 +01:00 |
Martin Kroeker
|
42584b4158
|
Merge pull request #4557 from martin-frbg/4552-2
Fix support for a negative INC_X in the "generic" NRM2 kernels
|
2024-03-12 16:13:54 +01:00 |
Martin Kroeker
|
6b89e1f1d7
|
fix loop condition for incx < 0
|
2024-03-12 15:49:41 +01:00 |
Martin Kroeker
|
20016a0096
|
fix loop condition for incx < 0
|
2024-03-12 15:48:55 +01:00 |
Martin Kroeker
|
09e84bd29a
|
fix loop condition for incx < 0
|
2024-03-12 15:48:00 +01:00 |
Martin Kroeker
|
f747aedb52
|
fix loop condition for incx < 0
|
2024-03-12 15:47:17 +01:00 |
Martin Kroeker
|
23796f8d31
|
fix loop condition for incx < 0
|
2024-03-12 15:46:23 +01:00 |
Martin Kroeker
|
bf93459746
|
fix loop condition for incx < 0
|
2024-03-12 15:45:23 +01:00 |
Chip Kerchner
|
6759f2de4e
|
Update README for build instructions on AIX and OpenXL.
|
2024-03-12 09:07:47 -05:00 |
Evgeni Burovski
|
07dddde1fe
|
Update utest/test_dnrm2.c
|
2024-03-12 15:28:50 +03:00 |
Evgeni Burovski
|
a6e16a065d
|
Add a test for dnrm(..., incx=-1)
|
2024-03-12 15:15:56 +03:00 |
Martin Kroeker
|
4a6025af9a
|
Merge pull request #4552 from martin-frbg/issue4551
Allow negative INC_X in arm64 ThunderX2T99 and Apple M ?NRM2
|
2024-03-12 10:08:09 +01:00 |
Martin Kroeker
|
afab848543
|
react to macos vm image updates on cirrus (#4550)
* react to macos vm image updates on cirrus
|
2024-03-11 23:09:05 +01:00 |
Martin Kroeker
|
e41d01bad9
|
remove early exit on negative inc_x
|
2024-03-11 22:53:54 +01:00 |
Martin Kroeker
|
02a025f9c1
|
remove early exit on negative inc_x
|
2024-03-11 22:52:18 +01:00 |
Martin Kroeker
|
4fd8bc1f11
|
Merge pull request #4549 from martin-frbg/issue4496
Stop using sched_yield on non-Windows x86_64
|
2024-03-11 13:30:33 +01:00 |
Martin Kroeker
|
07b1c0bc10
|
Stop using sched_yield on non-Windows x86_64
|
2024-03-11 08:01:49 +01:00 |
Martin Kroeker
|
cc9713aa20
|
Merge pull request #4548 from martin-frbg/issue4547
Fix compiler options for Cortex-X1,X2,A710 in gmake builds
|
2024-03-11 07:15:16 +01:00 |
Martin Kroeker
|
a14176440a
|
Add version macro for GCC12
|
2024-03-10 23:22:05 +01:00 |
Martin Kroeker
|
c1f7a81663
|
fix mtune for CortexX1, add mtune for X2 and A710
|
2024-03-10 23:19:56 +01:00 |
Martin Kroeker
|
bc21464e14
|
Merge pull request #4546 from martin-frbg/uclibc-nommu2
Skip post-fork utest in uclibc builds without fork support
|
2024-03-10 20:58:06 +01:00 |
Martin Kroeker
|
8a665f0d57
|
Skip test when building with a uclibc that does not implement fork
|
2024-03-10 19:08:03 +01:00 |
Martin Kroeker
|
91b1fb7f9b
|
Merge pull request #4544 from martin-frbg/uclibc-nommu
Skip the fork utest in builds with an uClibc that lacks fork
|
2024-03-10 15:57:24 +01:00 |
Martin Kroeker
|
30e8d255de
|
Skip the fork test when building against an uClibc that does not implement fork
|
2024-03-10 00:05:01 +01:00 |
Martin Kroeker
|
b1f2ef5e0b
|
Merge pull request #4542 from ErnstPeng/feature-branch
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-06 14:01:02 +01:00 |
Martin Kroeker
|
56fad407d1
|
Merge pull request #4527 from ChipKerchner/fixAIXBuildIssues
Fix LAPACK unit testing build issues.
|
2024-03-05 17:55:08 +01:00 |
pengxu
|
680a77fafc
|
Optimized ssymv and dsymv kernel LSX for LoongArch
|
2024-03-05 20:36:59 +08:00 |
Martin Kroeker
|
dcfe782f4b
|
Merge pull request #4540 from ErnstPeng/feature-branch
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 14:58:56 +01:00 |
pengxu
|
6546600342
|
Optimized ssymv and dsymv kernel LASX for LoongArch
|
2024-03-04 16:18:39 +08:00 |
Martin Kroeker
|
780497303c
|
Merge pull request #4539 from ChipKerchner/revertCgemmZgemmChanges
Revert cgemm zgemm changes
|
2024-03-01 16:41:07 +01:00 |
Chip-Kerchner
|
99384933ff
|
Revert "Merge pull request #4532 from austinpagan/cgemm_zgemm_c_code"
This reverts commit accea15551, reversing
changes made to b925353006.
|
2024-03-01 07:57:39 -06:00 |