Commit Graph

8331 Commits

Author SHA1 Message Date
Martin Kroeker 9a2a6a2e52
Merge pull request #4696 from frjohnst/restore_second
Revert PRs 4515 and 4520 (restore second, dsecnd)
2024-05-15 18:35:20 +02:00
frjohnst 87026ac1b1 Revert "fix conlict between PR 4515 and AIX shared obj support"
This reverts commit bdaa6705ca.

It turns out that PRs 4515 and 4520 break the tests under
lapack-netlib/TESTING which require SECOND and DSECND. IBM
has decided this is a bigger biger problem than the conflict
between lapack second_ and the xlf run time.
2024-05-15 09:45:17 -04:00
frjohnst 56d3d1039c Revert "resolve second_ conflict which breaks xlf timef"
This reverts commit 9b24b31419.

It turns out that PRs 4515 and 4520 break the tests under
lapack-netlib/TESTING which require SECOND and DSECND. IBM
has decided this is a bigger biger problem than the conflict
between lapack second_ and the xlf run time.
2024-05-15 09:44:29 -04:00
Martin Kroeker 6ca9ffa7f5
Merge pull request #4655 from yamazakimitsufumi/update_2d_thread_distribution
Expanding the scope of 2D thread distribution to improve multi-threaded DGEMM performance
2024-05-14 18:12:43 +02:00
Martin Kroeker b45a78c6e9
fix zdotu argument passing in utest_ext on windows (#4691)
* fix passing of results on windows
2024-05-13 14:50:50 +02:00
Martin Kroeker 1ab9f50561
Merge pull request #4690 from mattip/blasint
use blasint instead of int to quiet warnings
2024-05-13 11:00:33 +02:00
Matti Picus 243640c354 use blasint instead of int to quiet warnings 2024-05-12 10:24:16 +03:00
Martin Kroeker f0560f906f
Merge pull request #4689 from martin-frbg/issue4684
Fix compilation of the BLAS extension utests for NO_CBLAS=1
2024-05-11 14:39:54 +02:00
Martin Kroeker e1e0d9a2ae
Merge pull request #4688 from XiWeiGu/loongarch64_fixed_gcc14_compilation
loongarch64: Fixed GCC14 compilation issue
2024-05-11 13:38:45 +02:00
Martin Kroeker d8baf2f2ea
Support compilation without CBLAS 2024-05-11 13:10:54 +02:00
Martin Kroeker a6c184d150
forward NO_CFLAGS to the CFLAGS, if set 2024-05-11 13:07:30 +02:00
gxw ecf8b588a9 loongarch64: Fixed GCC14 compilation issue 2024-05-11 16:14:18 +08:00
Martin Kroeker 8da6f7e5f2
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6
Loongarch64: Improving the Performance and Stability of dgemm
2024-05-10 11:29:12 +02:00
gxw f9a26240a7 loongarch64: Fixed icamax_lsx 2024-05-10 14:16:40 +08:00
gxw cb0f707409 loongarch64: Fixed utest fork:safety 2024-05-10 14:16:36 +08:00
gxw 637c650f4f loongarch64: Add buffer offset for target LOONGSON3R5 2024-05-10 11:42:53 +08:00
Martin Kroeker 5d678f1831
Merge pull request #4685 from martin-frbg/issue4660-2
Fix builds for LOONGARCH64 in LSX mode
2024-05-09 13:17:29 +02:00
Martin Kroeker b45d8e1ab2
remove stray comma 2024-05-09 12:33:19 +02:00
Martin Kroeker 5500b4ab26
Merge pull request #4680 from theAeon/develop
Expose whether locking is enabled in get_config
2024-05-08 19:03:57 +02:00
gxw 6017ad7146 loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6 2024-05-08 10:10:26 +08:00
Martin Kroeker d66aa63478
Merge pull request #4681 from martin-frbg/fix4662-2
fix HUGETLB allocation for TLS mode as well
2024-05-08 01:44:32 +02:00
Martin Kroeker f0f1ff7820
fix HUGETLB allocation for TLS mode as well 2024-05-08 00:40:36 +02:00
Andrew Robbins edfe1aa471
Expose whether locking is enabled in get_config 2024-05-07 11:12:03 -04:00
Martin Kroeker edeb5259a1
Merge pull request #4679 from martin-frbg/fix4662
Restore Loongson LA64ARCH handling
2024-05-07 15:57:50 +02:00
Martin Kroeker 4376b6f7d2
Restore Loongson LA64ARCH handling 2024-05-07 14:42:01 +02:00
Martin Kroeker 8735b54fa8
Merge pull request #4662 from martin-frbg/hugetlb-doc
Fix and document the two HUGETLB options for buffer allocation in Makefile.rule
2024-05-07 13:32:07 +02:00
Martin Kroeker fc10673fd3
Merge branch 'develop' into hugetlb-doc 2024-05-07 13:31:39 +02:00
Martin Kroeker c20189cc82
Merge pull request #4677 from martin-frbg/issue4676
Add autodetection of Intel Meteor Lake and Emerald Rapids
2024-05-06 17:10:19 +02:00
Martin Kroeker bbd227ce4a
Add Intel Meteor Lake and Emerald Rapids 2024-05-06 00:11:44 +02:00
Martin Kroeker f034745ce6
Merge pull request #4675 from martin-frbg/issue4619
Mention LD_LIBRARY_PATH in user documentation
2024-05-04 15:50:13 +02:00
Martin Kroeker a82ecadc11
mention LD_LIBRARY_PATH 2024-05-04 15:48:48 +02:00
Martin Kroeker b859f6f191
Merge pull request #4617 from cyk2018/patch-1
[Doc]Update user_manual.md for static linker
2024-05-04 15:20:52 +02:00
Martin Kroeker dc99b61380
sort unwanted interdependencies of alloc_shm and alloc_hugetlb 2024-05-04 14:49:00 +02:00
Martin Kroeker 9c4e10fbd1
sort hugetlb and shm alloc options 2024-05-04 14:48:02 +02:00
Martin Kroeker a63d71129c
Merge pull request #4671 from martin-frbg/issue4668
Silence a GCC14 warning/error in the f2c-converted LAPACK
2024-04-30 20:06:42 +02:00
Martin Kroeker 3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK 2024-04-30 19:05:18 +02:00
Martin Kroeker 7c915e64ca
Silence a GCC14 warning/error in the f2c-converted LAPACK 2024-04-30 17:48:14 +02:00
Martin Kroeker edacf9b397
Work around spurious BLAS3 test errors on LOONGSON3R3/4 (#4667)
Force compilation with gfortran to use O0 on older Loongson hardware to avoid spurious test failures
2024-04-30 08:50:47 +02:00
Martin Kroeker 89e3fd0821
Merge pull request #4666 from martin-frbg/issue4633
Fix spurious errors in the extended utest for INTERFACE64=1 on big-endian systems
2024-04-29 17:23:20 +02:00
Martin Kroeker b1d722fc0c
Fix cast to work with INTERFACE64 (especially on big-endian) 2024-04-29 15:37:26 +02:00
Martin Kroeker 1031d161f6
Merge pull request #4663 from ayappanec/develop
Fix openblas_utest_ext build in AIX
2024-04-25 18:05:33 +02:00
Ayappan P f4ee0a423b Fix openblas_utest_ext build in AIX 2024-04-25 07:32:21 -04:00
Martin Kroeker faf7b3d1bb
Document the two HUGETLB options for buffer allocation 2024-04-24 17:49:40 +02:00
Martin Kroeker ab5882ebf0
Merge pull request #4661 from martin-frbg/issue4660
Fix CMAKE builds for Loongarch64
2024-04-24 09:01:22 +02:00
Martin Kroeker 69aa93e34f
Fix Loongson compiler flag check 2024-04-23 21:57:42 +02:00
Martin Kroeker 015042f7b5
Fix Loongson compiler flag test 2024-04-23 21:55:57 +02:00
Martin Kroeker 992b71fea2
remove stray comma 2024-04-23 21:52:26 +02:00
Martin Kroeker d421dec278
Merge pull request #4656 from zboszor/fix-x86-64-build-v2
Add forgotten conditional uses of PREFETCH
2024-04-23 21:05:08 +02:00
Martin Kroeker ae695d4ca0
Merge pull request #4642 from XiWeiGu/loongarch64_clang
CI: Add clang test for loongarch64
2024-04-23 18:25:49 +02:00
gxw 1cdad09760 CI: Add clang test for loongarch64 2024-04-23 19:30:24 +08:00