Commit Graph

8393 Commits

Author SHA1 Message Date
Martin Kroeker 89e3fd0821
Merge pull request #4666 from martin-frbg/issue4633
Fix spurious errors in the extended utest for INTERFACE64=1 on big-endian systems
2024-04-29 17:23:20 +02:00
Martin Kroeker b1d722fc0c
Fix cast to work with INTERFACE64 (especially on big-endian) 2024-04-29 15:37:26 +02:00
Martin Kroeker 1031d161f6
Merge pull request #4663 from ayappanec/develop
Fix openblas_utest_ext build in AIX
2024-04-25 18:05:33 +02:00
Ayappan P f4ee0a423b Fix openblas_utest_ext build in AIX 2024-04-25 07:32:21 -04:00
Martin Kroeker faf7b3d1bb
Document the two HUGETLB options for buffer allocation 2024-04-24 17:49:40 +02:00
Martin Kroeker ab5882ebf0
Merge pull request #4661 from martin-frbg/issue4660
Fix CMAKE builds for Loongarch64
2024-04-24 09:01:22 +02:00
Martin Kroeker 69aa93e34f
Fix Loongson compiler flag check 2024-04-23 21:57:42 +02:00
Martin Kroeker 015042f7b5
Fix Loongson compiler flag test 2024-04-23 21:55:57 +02:00
Martin Kroeker 992b71fea2
remove stray comma 2024-04-23 21:52:26 +02:00
Martin Kroeker d421dec278
Merge pull request #4656 from zboszor/fix-x86-64-build-v2
Add forgotten conditional uses of PREFETCH
2024-04-23 21:05:08 +02:00
Martin Kroeker ae695d4ca0
Merge pull request #4642 from XiWeiGu/loongarch64_clang
CI: Add clang test for loongarch64
2024-04-23 18:25:49 +02:00
gxw 1cdad09760 CI: Add clang test for loongarch64 2024-04-23 19:30:24 +08:00
gxw 7cd438a5ac loongarch64: Fixed clang compilation issues 2024-04-23 19:19:11 +08:00
Martin Kroeker 35d84ad012
Merge pull request #4658 from mattip/remove-extra-suffix
do not add LIBNAMESUFFIX to dylib
2024-04-23 11:03:33 +02:00
Martin Kroeker f6eadf0971
Merge pull request #4577 from shivammonaka/Threading_Callback
Introduced callback to Pthread, Win32 and OpenMP backend
2024-04-22 19:19:46 +02:00
Martin Kroeker 61214fcef7
Fix utest_ext build on AIX (#4657)
* Add all-in-one version of utest_ext for AIX
2024-04-22 14:24:33 +02:00
Martin Kroeker ddcd7d6fa8
Merge branch 'develop' into Threading_Callback 2024-04-21 22:27:11 +02:00
Matti Picus 94feadf242 do not add LIBNAMESUFFIX to dylib 2024-04-21 13:16:40 +10:00
Zoltán Böszörményi ca64861ce8 Add forgotten conditional uses of PREFETCH
This fixes a (cross-)compilation/linker error for PRESCOTT
on Yocto.

Signed-off-by: Zoltán Böszörményi <zoltan.boszormenyi@xenial.com>
2024-04-19 10:52:28 +02:00
Martin Kroeker de465ffdd5
Merge pull request #4632 from martin-frbg/issue4570
Align Makefile build options for embedded (Cortex M)  systems with the CMAKE code
2024-04-18 19:47:55 +02:00
Martin Kroeker a5ab76c75e
Update .cirrus.yml 2024-04-18 16:44:04 +02:00
Martin Kroeker 0ec0746ae4
Update Makefile.system 2024-04-18 16:11:20 +02:00
Martin Kroeker d6b0badc05
Fix declarations for EMBEDDED 2024-04-18 16:06:21 +02:00
Martin Kroeker 930ebed48b
Add option EMBEDDED 2024-04-18 15:59:24 +02:00
yamazaki-mitsufumi 51ab1903e7 Expanding the scop of 2D thread distribution 2024-04-18 18:20:25 +09:00
Martin Kroeker bece730218
Merge pull request #4649 from ChipKerchner/readme2AIX
Change xlf compiler to xlf_r in README for AIX.
2024-04-17 12:54:50 +02:00
Martin Kroeker c57f9326d6
Add implementation of WhereAmI() to support NO_AFFINITY=0 on ARM64 (#4648)
* Add preliminary implementation of WhereAmI()
2024-04-17 08:25:17 +02:00
Chip-Kerchner 1d86e21cda Change xlf compiler to xlf_r in README for AIX. 2024-04-16 15:18:19 -05:00
Martin Kroeker a0083fdf68
Merge pull request #4647 from martin-frbg/lapack961
Fix uninitialized variables in LAPACK tests for ?QP3RK (Reference-LAPACK PR 961)
2024-04-16 00:02:12 +02:00
Martin Kroeker c20caa4960
Fix uninitialized variables in the LAPACK tests for the new ?QP3RK functions (Reference-LAPACK PR961) 2024-04-15 22:42:53 +02:00
Martin Kroeker 91a24b837d
Merge pull request #4643 from XiWeiGu/mips64_bug_fix
mips64: Fixed MSA optimization bugs for zgemv and cgemv
2024-04-15 14:55:14 +02:00
gxw 9c39e969f5 mips64: Fixed MSA optimization bugs for zgemv and cgemv 2024-04-15 15:17:29 +08:00
Martin Kroeker 4c03ed437f
Fix SICORTEX ASUM/ZASUM and SUM/ZSUM for INCX <=0 (#4640)
* Exit early if INCX <= 0
2024-04-14 15:39:11 +02:00
Martin Kroeker 23d5a8b16e
Merge pull request #4628 from XiWeiGu/CI-c910v-mips64-loongarch64
CI: add openblas_utest_ext for c910v, mips64 and loongarch64
2024-04-13 18:35:02 +02:00
Martin Kroeker b8618fa7f6
Merge pull request #4634 from martin-frbg/utest0
Fix uninitialized variables in the extensions utest
2024-04-13 17:16:54 +02:00
Martin Kroeker b883526a34
Fix uninitialized variables in the extensions utest 2024-04-13 12:26:35 +02:00
Martin Kroeker 83ec1f86ec
Merge pull request #4630 from martin-frbg/issue4178-2
Add another object signature for classic flang
2024-04-12 22:04:34 +02:00
Martin Kroeker a9703c70f3
Merge pull request #4631 from martin-frbg/issue4626
revert the C/Z NRM2 kernels for NEOVERSEN1 and VORTEX to the base NEON kernel as well
2024-04-12 20:21:31 +02:00
Martin Kroeker e6ae4b6f38
Merge pull request #4627 from mattip/soname
remove extraneous suffix from shared object SONAME
2024-04-12 16:44:58 +02:00
Martin Kroeker 00ee5d0367
On ARM, do not assume -marm by default if OS_EMBEDDED=1 2024-04-12 15:59:45 +02:00
Martin Kroeker 7cfd433d0c
revert the C/Z NRM2 kernels to the base NEON kernel as well 2024-04-12 15:34:04 +02:00
Martin Kroeker d26caff60c
Add another object signature for classic flang 2024-04-12 14:29:27 +02:00
Martin Kroeker 45dbf50036
Merge pull request #4629 from tetsuzo-usui/PfSizeTune_forNeoverseV1
Set GEMM_PREFERED_SIZE parameter for Neoverse V1
2024-04-12 14:08:27 +02:00
Usui, Tetsuzo ca673ca774 Add GEMM_PREFERED_SIZE parameter for Neoverse V1 2024-04-12 17:21:14 +09:00
gxw d9e2db3735 CI: add openblas_utest_ext for c910v, mips64 and loongarch64 2024-04-12 11:33:16 +08:00
Martin Kroeker 15b9fc3f78
Merge pull request #4624 from ChipKerchner/removeOMPfromXLF
Remove -openmp flag from XLF (since it doesn't support it).
2024-04-11 08:48:09 +02:00
Matti Picus 4d96e0ce18 remove extraneous suffix from shared object SONAME 2024-04-11 10:36:30 +10:00
Chip Kerchner 1c13cda3fc Remove -openmp flag from XLF (since it doesn't support it). 2024-04-10 15:16:47 -05:00
Martin Kroeker 93d975d8fd
Merge pull request #4593 from XiWeiGu/loongarch_add_buffer_offset
loongarch: Optimizing the performance of the GEMM on servers
2024-04-10 14:23:31 +02:00
gxw d8c4ea8793 loongarch: Optimizing the performance of the GEMM on servers 2024-04-09 09:03:34 -04:00