Commit Graph

8393 Commits

Author SHA1 Message Date
Martin Kroeker 20016a0096
fix loop condition for incx < 0 2024-03-12 15:48:55 +01:00
Martin Kroeker 09e84bd29a
fix loop condition for incx < 0 2024-03-12 15:48:00 +01:00
Martin Kroeker f747aedb52
fix loop condition for incx < 0 2024-03-12 15:47:17 +01:00
Martin Kroeker 23796f8d31
fix loop condition for incx < 0 2024-03-12 15:46:23 +01:00
Martin Kroeker bf93459746
fix loop condition for incx < 0 2024-03-12 15:45:23 +01:00
Chip Kerchner 6759f2de4e Update README for build instructions on AIX and OpenXL. 2024-03-12 09:07:47 -05:00
Evgeni Burovski 07dddde1fe
Update utest/test_dnrm2.c 2024-03-12 15:28:50 +03:00
Evgeni Burovski a6e16a065d
Add a test for dnrm(..., incx=-1) 2024-03-12 15:15:56 +03:00
Martin Kroeker 4a6025af9a
Merge pull request #4552 from martin-frbg/issue4551
Allow negative INC_X in arm64 ThunderX2T99 and Apple M ?NRM2
2024-03-12 10:08:09 +01:00
Martin Kroeker afab848543
react to macos vm image updates on cirrus (#4550)
* react to macos vm image updates on cirrus
2024-03-11 23:09:05 +01:00
Martin Kroeker e41d01bad9
remove early exit on negative inc_x 2024-03-11 22:53:54 +01:00
Martin Kroeker 02a025f9c1
remove early exit on negative inc_x 2024-03-11 22:52:18 +01:00
Martin Kroeker 4fd8bc1f11
Merge pull request #4549 from martin-frbg/issue4496
Stop using sched_yield on non-Windows x86_64
2024-03-11 13:30:33 +01:00
Martin Kroeker 07b1c0bc10
Stop using sched_yield on non-Windows x86_64 2024-03-11 08:01:49 +01:00
Martin Kroeker cc9713aa20
Merge pull request #4548 from martin-frbg/issue4547
Fix compiler options for Cortex-X1,X2,A710 in gmake builds
2024-03-11 07:15:16 +01:00
Martin Kroeker a14176440a
Add version macro for GCC12 2024-03-10 23:22:05 +01:00
Martin Kroeker c1f7a81663
fix mtune for CortexX1, add mtune for X2 and A710 2024-03-10 23:19:56 +01:00
Martin Kroeker bc21464e14
Merge pull request #4546 from martin-frbg/uclibc-nommu2
Skip post-fork utest in uclibc builds without fork support
2024-03-10 20:58:06 +01:00
Martin Kroeker 8a665f0d57
Skip test when building with a uclibc that does not implement fork 2024-03-10 19:08:03 +01:00
Martin Kroeker 91b1fb7f9b
Merge pull request #4544 from martin-frbg/uclibc-nommu
Skip the fork utest in builds with an uClibc that lacks fork
2024-03-10 15:57:24 +01:00
Martin Kroeker 30e8d255de
Skip the fork test when building against an uClibc that does not implement fork 2024-03-10 00:05:01 +01:00
Martin Kroeker b1f2ef5e0b
Merge pull request #4542 from ErnstPeng/feature-branch
Optimized ssymv and dsymv kernel LSX for LoongArch
2024-03-06 14:01:02 +01:00
Martin Kroeker 56fad407d1
Merge pull request #4527 from ChipKerchner/fixAIXBuildIssues
Fix LAPACK unit testing build issues.
2024-03-05 17:55:08 +01:00
pengxu 680a77fafc Optimized ssymv and dsymv kernel LSX for LoongArch 2024-03-05 20:36:59 +08:00
Martin Kroeker dcfe782f4b
Merge pull request #4540 from ErnstPeng/feature-branch
Optimized ssymv and dsymv kernel LASX for LoongArch
2024-03-04 14:58:56 +01:00
pengxu 6546600342 Optimized ssymv and dsymv kernel LASX for LoongArch 2024-03-04 16:18:39 +08:00
Martin Kroeker 780497303c
Merge pull request #4539 from ChipKerchner/revertCgemmZgemmChanges
Revert cgemm zgemm changes
2024-03-01 16:41:07 +01:00
Chip-Kerchner 99384933ff Revert "Merge pull request #4532 from austinpagan/cgemm_zgemm_c_code"
This reverts commit accea15551, reversing
changes made to b925353006.
2024-03-01 07:57:39 -06:00
Chip-Kerchner edb7ab5ccf Merge branch 'develop' of https://github.com/openmathlib/openblas into develop 2024-03-01 07:43:41 -06:00
shivammonaka d49ebc54e1 Merge branch 'shivam-develop' into shivam-Locks 2024-02-29 11:58:14 +05:30
shivammonaka bc191015e3 Using OpenMP locks with NUM_PARALLEL 2024-02-29 11:47:05 +05:30
Martin Kroeker 577d480c62
Merge pull request #4529 from ErnstPeng/feature-branch
Optimized sgemv and dgemv kernel LSX for LoongArch
2024-02-28 13:49:54 +01:00
pengxu b2db064285 Optimized sgemv and dgemv kernel LSX for LoongArch 2024-02-28 18:07:27 +08:00
Martin Kroeker cfbb701497
Merge pull request #4536 from XiWeiGu/loongarch64-cgemv-zgemv-opt
Loongarch64 cgemv zgemv opt
2024-02-28 10:15:34 +01:00
gxw 8e05c053be LoongArch64:Fixed the failed test cases test_{c/z}gemv_n in test_extensions 2024-02-27 22:19:26 -05:00
gxw 3f22fc2233 LoongArch64: Add zgemv LSX opt 2024-02-27 22:19:04 -05:00
gxw c508a10cf2 LoongArch64: Add cgemv LSX opt 2024-02-27 22:17:30 -05:00
Martin Kroeker 5fbe25954b
Merge pull request #4534 from martin-frbg/issue4528
Omit redundant prefixes or suffixes in library naming
2024-02-27 22:29:44 +01:00
Martin Kroeker accea15551
Merge pull request #4532 from austinpagan/cgemm_zgemm_c_code
Cgemm zgemm c code
2024-02-27 22:07:07 +01:00
Martin Kroeker d1409407a0
Omit redundant prefixes or suffixes in library naming 2024-02-27 21:05:59 +01:00
Martin Kroeker b925353006
Merge pull request #4533 from martin-frbg/fixup4501
CMake: add option for building benchmarks and handle their dependencies
2024-02-27 19:55:39 +01:00
Martin Kroeker cfc28c586e
Exclude LAPACK testsuite and LAPACK-dependent benchmarks in no-LAPACK builds 2024-02-27 19:55:12 +01:00
Martin Kroeker 4266b39304
Make building the benchmarks optional and handle dependency on other options 2024-02-27 17:52:45 +01:00
Martin Kroeker 96e4218daf
Merge pull request #4501 from dmikushin/add-benchmarks-to-cmake
Adding basic support of benchmarks into CMake
2024-02-27 16:25:04 +01:00
Martin Kroeker 5a57cbf92d
Merge pull request #4531 from martin-frbg/issue4530
Fix erroneous mapping of Z13 SUM kernels to ASUM
2024-02-27 13:10:40 +01:00
Martin Kroeker 8e872a91a9
Fix erroneous mapping of SUM kernels to ASUM 2024-02-27 11:28:50 +01:00
Martin Kroeker 608effee7b
Merge pull request #4524 from martin-frbg/test3m
Enable GEMM3M tests when the architecture supports GEMM3M
2024-02-27 10:39:28 +01:00
Martin Kroeker f81c1d4b59
Fix portability problem 2024-02-27 07:19:52 +01:00
Martin Kroeker f7ffab870b
fix missing malloc 2024-02-26 23:03:10 +01:00
Martin Kroeker 38283f678e
Fix portability problems 2024-02-26 22:22:48 +01:00