Martin Kroeker
|
ae695d4ca0
|
Merge pull request #4642 from XiWeiGu/loongarch64_clang
CI: Add clang test for loongarch64
|
2024-04-23 18:25:49 +02:00 |
gxw
|
1cdad09760
|
CI: Add clang test for loongarch64
|
2024-04-23 19:30:24 +08:00 |
gxw
|
7cd438a5ac
|
loongarch64: Fixed clang compilation issues
|
2024-04-23 19:19:11 +08:00 |
Martin Kroeker
|
35d84ad012
|
Merge pull request #4658 from mattip/remove-extra-suffix
do not add LIBNAMESUFFIX to dylib
|
2024-04-23 11:03:33 +02:00 |
Martin Kroeker
|
f6eadf0971
|
Merge pull request #4577 from shivammonaka/Threading_Callback
Introduced callback to Pthread, Win32 and OpenMP backend
|
2024-04-22 19:19:46 +02:00 |
Martin Kroeker
|
61214fcef7
|
Fix utest_ext build on AIX (#4657)
* Add all-in-one version of utest_ext for AIX
|
2024-04-22 14:24:33 +02:00 |
Martin Kroeker
|
ddcd7d6fa8
|
Merge branch 'develop' into Threading_Callback
|
2024-04-21 22:27:11 +02:00 |
Matti Picus
|
94feadf242
|
do not add LIBNAMESUFFIX to dylib
|
2024-04-21 13:16:40 +10:00 |
Zoltán Böszörményi
|
ca64861ce8
|
Add forgotten conditional uses of PREFETCH
This fixes a (cross-)compilation/linker error for PRESCOTT
on Yocto.
Signed-off-by: Zoltán Böszörményi <zoltan.boszormenyi@xenial.com>
|
2024-04-19 10:52:28 +02:00 |
Martin Kroeker
|
de465ffdd5
|
Merge pull request #4632 from martin-frbg/issue4570
Align Makefile build options for embedded (Cortex M) systems with the CMAKE code
|
2024-04-18 19:47:55 +02:00 |
Martin Kroeker
|
a5ab76c75e
|
Update .cirrus.yml
|
2024-04-18 16:44:04 +02:00 |
Martin Kroeker
|
0ec0746ae4
|
Update Makefile.system
|
2024-04-18 16:11:20 +02:00 |
Martin Kroeker
|
d6b0badc05
|
Fix declarations for EMBEDDED
|
2024-04-18 16:06:21 +02:00 |
Martin Kroeker
|
930ebed48b
|
Add option EMBEDDED
|
2024-04-18 15:59:24 +02:00 |
yamazaki-mitsufumi
|
51ab1903e7
|
Expanding the scop of 2D thread distribution
|
2024-04-18 18:20:25 +09:00 |
Martin Kroeker
|
bece730218
|
Merge pull request #4649 from ChipKerchner/readme2AIX
Change xlf compiler to xlf_r in README for AIX.
|
2024-04-17 12:54:50 +02:00 |
Martin Kroeker
|
c57f9326d6
|
Add implementation of WhereAmI() to support NO_AFFINITY=0 on ARM64 (#4648)
* Add preliminary implementation of WhereAmI()
|
2024-04-17 08:25:17 +02:00 |
Chip-Kerchner
|
1d86e21cda
|
Change xlf compiler to xlf_r in README for AIX.
|
2024-04-16 15:18:19 -05:00 |
Martin Kroeker
|
a0083fdf68
|
Merge pull request #4647 from martin-frbg/lapack961
Fix uninitialized variables in LAPACK tests for ?QP3RK (Reference-LAPACK PR 961)
|
2024-04-16 00:02:12 +02:00 |
Martin Kroeker
|
c20caa4960
|
Fix uninitialized variables in the LAPACK tests for the new ?QP3RK functions (Reference-LAPACK PR961)
|
2024-04-15 22:42:53 +02:00 |
Martin Kroeker
|
91a24b837d
|
Merge pull request #4643 from XiWeiGu/mips64_bug_fix
mips64: Fixed MSA optimization bugs for zgemv and cgemv
|
2024-04-15 14:55:14 +02:00 |
gxw
|
9c39e969f5
|
mips64: Fixed MSA optimization bugs for zgemv and cgemv
|
2024-04-15 15:17:29 +08:00 |
Martin Kroeker
|
4c03ed437f
|
Fix SICORTEX ASUM/ZASUM and SUM/ZSUM for INCX <=0 (#4640)
* Exit early if INCX <= 0
|
2024-04-14 15:39:11 +02:00 |
Martin Kroeker
|
23d5a8b16e
|
Merge pull request #4628 from XiWeiGu/CI-c910v-mips64-loongarch64
CI: add openblas_utest_ext for c910v, mips64 and loongarch64
|
2024-04-13 18:35:02 +02:00 |
Martin Kroeker
|
b8618fa7f6
|
Merge pull request #4634 from martin-frbg/utest0
Fix uninitialized variables in the extensions utest
|
2024-04-13 17:16:54 +02:00 |
Martin Kroeker
|
b883526a34
|
Fix uninitialized variables in the extensions utest
|
2024-04-13 12:26:35 +02:00 |
Martin Kroeker
|
83ec1f86ec
|
Merge pull request #4630 from martin-frbg/issue4178-2
Add another object signature for classic flang
|
2024-04-12 22:04:34 +02:00 |
Martin Kroeker
|
a9703c70f3
|
Merge pull request #4631 from martin-frbg/issue4626
revert the C/Z NRM2 kernels for NEOVERSEN1 and VORTEX to the base NEON kernel as well
|
2024-04-12 20:21:31 +02:00 |
Martin Kroeker
|
e6ae4b6f38
|
Merge pull request #4627 from mattip/soname
remove extraneous suffix from shared object SONAME
|
2024-04-12 16:44:58 +02:00 |
Martin Kroeker
|
00ee5d0367
|
On ARM, do not assume -marm by default if OS_EMBEDDED=1
|
2024-04-12 15:59:45 +02:00 |
Martin Kroeker
|
7cfd433d0c
|
revert the C/Z NRM2 kernels to the base NEON kernel as well
|
2024-04-12 15:34:04 +02:00 |
Martin Kroeker
|
d26caff60c
|
Add another object signature for classic flang
|
2024-04-12 14:29:27 +02:00 |
Martin Kroeker
|
45dbf50036
|
Merge pull request #4629 from tetsuzo-usui/PfSizeTune_forNeoverseV1
Set GEMM_PREFERED_SIZE parameter for Neoverse V1
|
2024-04-12 14:08:27 +02:00 |
Usui, Tetsuzo
|
ca673ca774
|
Add GEMM_PREFERED_SIZE parameter for Neoverse V1
|
2024-04-12 17:21:14 +09:00 |
gxw
|
d9e2db3735
|
CI: add openblas_utest_ext for c910v, mips64 and loongarch64
|
2024-04-12 11:33:16 +08:00 |
Martin Kroeker
|
15b9fc3f78
|
Merge pull request #4624 from ChipKerchner/removeOMPfromXLF
Remove -openmp flag from XLF (since it doesn't support it).
|
2024-04-11 08:48:09 +02:00 |
Matti Picus
|
4d96e0ce18
|
remove extraneous suffix from shared object SONAME
|
2024-04-11 10:36:30 +10:00 |
Chip Kerchner
|
1c13cda3fc
|
Remove -openmp flag from XLF (since it doesn't support it).
|
2024-04-10 15:16:47 -05:00 |
Martin Kroeker
|
93d975d8fd
|
Merge pull request #4593 from XiWeiGu/loongarch_add_buffer_offset
loongarch: Optimizing the performance of the GEMM on servers
|
2024-04-10 14:23:31 +02:00 |
gxw
|
d8c4ea8793
|
loongarch: Optimizing the performance of the GEMM on servers
|
2024-04-09 09:03:34 -04:00 |
Martin Kroeker
|
3cf57a61d5
|
Merge pull request #4609 from yu-chen-surf/develop
Get the l2 cache size via environment variable on confidential VM
|
2024-04-06 19:02:00 +02:00 |
Martin Kroeker
|
fbd42e9e0e
|
Merge pull request #4616 from MehdiChinoune/patch-1
Don't pass `-exhaustive-register-search` directly to clang compiler
|
2024-04-06 14:12:10 +02:00 |
Martin Kroeker
|
03ff65190d
|
Merge pull request #4614 from martin-frbg/issue4449-2
Retain the bf16 in fallback versions of the NeoverseN2 -march flag
|
2024-04-06 12:29:28 +02:00 |
Martin Kroeker
|
12650c912c
|
Merge pull request #4613 from martin-frbg/issue4612
Do not run the CBLAS_?GEMM3M tests when cross-compiling with gmake
|
2024-04-06 11:11:27 +02:00 |
Martin Kroeker
|
4eb4b033e5
|
Merge pull request #4610 from martin-frbg/issue4608
Make the new ZSCAL utest not require CBLAS
|
2024-04-06 08:38:36 +02:00 |
cyk
|
95315f95d2
|
Update user_manual.md for static linker
when I use static link method to compile , result is undefined with pthread_create, so we should add -lpthread
|
2024-04-06 13:51:01 +08:00 |
مهدي شينون (Mehdi Chinoune)
|
cda55f2fd2
|
Don't pass `-exhaustive-register-search` directly to clang compiler
`-exhaustive-register-search` is an LLVM code generation flag that shouldn't be passed directly to clang compiler.
|
2024-04-06 05:54:48 +01:00 |
Martin Kroeker
|
14e71c249d
|
retain the bf16 capability in fallback versions of the -march option for NeoverseN2
|
2024-04-05 23:01:34 +02:00 |
Martin Kroeker
|
48e017de09
|
fix position of endif - gemm3m tests should not be run in cross-compiles
|
2024-04-05 22:54:34 +02:00 |
Martin Kroeker
|
9c86838279
|
use blasint for INTERFACE64 compatibility
|
2024-04-05 22:48:08 +02:00 |