Honglin Zhu
55d686d41e
neoverse n2 sbgemm:
...
implement ncopy tcopy kernel_8x4
2022-06-29 10:14:21 +08:00
Honglin Zhu
04593bb27c
neoverse n2 sbgemm: init file
2022-06-29 10:14:21 +08:00
Martin Kroeker
1fb4259077
Merge pull request #3673 from martin-frbg/azuredynmingw
...
AzureCI: drop cpus from the DYNAMIC_LIST for Windows/mingw to save time
2022-06-28 23:13:11 +02:00
Martin Kroeker
47a0e53196
mingw-dynamic arch: drop Haswell too
2022-06-28 21:40:04 +02:00
Martin Kroeker
c7b3ce010e
drop NEHALEM from the DYNLIST for Windows/mingw to save time
2022-06-28 20:12:11 +02:00
Martin Kroeker
be5500e704
Merge pull request #3669 from VFerrari/fix_small_matrix_kernel
...
POWER: fix issues with the small matrix kernel
2022-06-28 16:09:36 +02:00
Martin Kroeker
92275a7902
Merge pull request #3642 from nursik/develop
...
Add ARM64 support for Windows
2022-06-28 16:05:11 +02:00
Martin Kroeker
914c4d0fe8
Add C versions of the CBLAS test sources ( #3656 )
...
* Add C conversions of the CBLAS tests for NOFORTRAN=1 builds
* Enable CTEST without Fortran and fix passing of BUILD_vartype options to exports/gensymbol
2022-06-28 11:52:48 +02:00
Martin Kroeker
2857987ff6
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
2022-06-28 11:46:25 +02:00
VFerrari
2062280c6f
Power: Enable SMALL_MATRIX OPT as default for dynamic arch
2022-06-25 03:47:03 -03:00
VFerrari
cac634fce3
POWER10: Fix multithreading check when USE_THREAD=0
...
This patch fixes an issue when OpenBLAS is compiled for TARGET=POWER10
and the flag USE_THREAD is set to 0.
The function `num_cpu_avail` is only available when USE_THREAD=1,
so SMP is defined.
2022-06-25 03:46:46 -03:00
Martin Kroeker
9283c7c0b5
Merge pull request #3655 from RajalakshmiSR/zgemmasmp10
...
POWER10: Fix ZGEMM testcase failures
2022-06-18 20:52:26 +02:00
Martin Kroeker
9777c59d98
Merge pull request #3653 from RajalakshmiSR/dgemvp10
...
POWER10: convert dgemv inline assembly
2022-06-18 20:51:59 +02:00
Rajalakshmi Srinivasaraghavan
f191bc652b
POWER10: Fix ZGEMM testcase failures
...
This patch fixes storing and restoring non volatile registers
in zgemm POWER10 kernel.
2022-06-17 08:18:08 -05:00
Martin Kroeker
7060ca5002
Merge pull request #3647 from martin-frbg/exports_3.10.0
...
Amend gensymbol with some LAPACK 3.10.0 additions
2022-06-10 08:58:00 +02:00
Martin Kroeker
72ea19d187
Amend some LAPACK 3.10.0 additions
2022-06-09 19:31:08 +02:00
Nursultan Zarlyk
1dfc4e6150
Replace with ARM64 intrinsics
2022-06-09 18:49:49 +02:00
Rajalakshmi Srinivasaraghavan
8419d538ff
POWER10: convert dgemv inline assembly
...
This patch makes use of compiler builtins and matches with assembly
performance. Tested with clang14 and gcc12.
2022-06-09 10:42:57 -05:00
Martin Kroeker
bfd9c1b58c
Merge pull request #3645 from martin-frbg/issue3644
...
Fix quotes around compiler args in C11 check
2022-06-08 19:29:07 +02:00
Martin Kroeker
79d98327e4
Fix quotes around compiler args in C11 check
2022-06-08 11:22:20 +02:00
Martin Kroeker
eb1faada19
Merge pull request #3643 from martin-frbg/fixgensymbol
...
Fix LAPACK path in new gensymbol script
2022-06-08 11:18:46 +02:00
Xianyi Zhang
5e9a912591
Merge branch 'develop' into risc-v
2022-06-06 14:12:09 +08:00
Xianyi Zhang
f9715605ac
Add PLCT to contributors.
2022-06-06 14:11:28 +08:00
Xianyi Zhang
3f88429bcf
Merge branch 'risc-v_fix_intrinsic' into risc-v
2022-06-06 13:56:05 +08:00
Xianyi Zhang
968e1f51d8
Update RISC-V Intrinsic API.
2022-06-06 13:52:21 +08:00
Martin Kroeker
e9c3535208
Fix LAPACK path in new gensymbol script
2022-06-05 23:28:12 +02:00
Martin Kroeker
f150c97ceb
Merge pull request #3641 from RajalakshmiSR/ppc_build
...
power10: Fix build issues due to perl scripts conversion
2022-06-05 11:23:29 +02:00
Nursultan Zarlyk
1bb7993a97
Fix MSVC ARM64 build. Add generic kernel for ARM64
2022-06-02 16:53:54 +02:00
Rajalakshmi Srinivasaraghavan
c98d63b637
power10: Fix build issues due to perl scripts conversion
...
Due to recent perl script conversion, there are some build
errors when compiling openblas with advance toolchain compilers.
2022-06-02 08:11:10 -05:00
Martin Kroeker
28a24a4d4f
Merge pull request #3637 from martin-frbg/issue3636
...
Add fallback value for bogus sc_nprocessors_conf in getarch
2022-05-27 10:23:02 +02:00
Martin Kroeker
14ae22bf7a
Add fallback value for bogus sc_nprocessors_conf
2022-05-27 00:29:17 +02:00
Martin Kroeker
771dc6a8d8
Merge pull request #3635 from martin-frbg/issue3634
...
Support compilation with the Intel ifx compiler
2022-05-26 11:57:53 +02:00
Martin Kroeker
19413624d0
Add Intel ifx compiler
2022-05-26 09:31:49 +02:00
Martin Kroeker
f56e4b620f
Merge pull request #3633 from martin-frbg/perl_fallback
...
Add back original PERL-based build scripts and add option USE_PERL
2022-05-22 21:18:44 +02:00
Martin Kroeker
5cb0d23027
Support USE_PERL fallback for gensymbol
2022-05-22 18:36:24 +02:00
Martin Kroeker
f5a379bf77
Add USE_PERL fallback option for gensymbol script
2022-05-22 18:35:23 +02:00
Martin Kroeker
cfc1a9ed8d
Add back original PERL-based script under new name
2022-05-22 18:33:24 +02:00
Martin Kroeker
a3e02742f2
Add USE_PERL fallback option for create script used with FUNCTION_PROFILE
2022-05-22 18:32:19 +02:00
Martin Kroeker
f1c570a5f1
Add back original PERL-based script under new name
2022-05-22 18:29:01 +02:00
Martin Kroeker
181b96257c
Add back PERL-based scripts under new name
2022-05-22 18:27:45 +02:00
Martin Kroeker
7093a34a34
Add fallback option USE_PERL for original PERL-based build scripts
2022-05-22 18:27:02 +02:00
Martin Kroeker
c4b52ef46e
Merge pull request #3624 from ioraff/no-perl
...
rewrite perl scripts in universal shell
2022-05-22 18:21:17 +02:00
Martin Kroeker
d0c3504255
Merge pull request #3631 from martin-frbg/revertdynskx
...
Revert selection of a different DGEMM kernel for SkylakeX in DYNAMIC_ARCH builds
2022-05-20 13:47:09 +02:00
Martin Kroeker
dac14a5f7d
revert "switch DGEMM parameters for SkylakeX if DYNAMIC_ARCH"
2022-05-20 11:28:23 +02:00
Martin Kroeker
dc49edd4e6
Revert "roll back DGEMM kernel ... for DYNAMIC_ARCH"
2022-05-20 11:23:30 +02:00
Martin Kroeker
faf58d2b3f
Merge pull request #3630 from martin-frbg/fixpr3629
...
Fix compilation of cpuid_riscv
2022-05-20 06:37:37 +02:00
Martin Kroeker
30df29c0b3
Fix compilation
2022-05-19 18:57:46 +02:00
Zhang Xianyi
a720e2ca8a
Merge pull request #3629 from Rabenda/riscv-c910
...
riscv: Fix machine recognition for c910v
2022-05-19 17:57:19 +08:00
Han Gao
8dd4579480
riscv: Fix machine recognition for c910v
...
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-05-19 17:32:48 +08:00
Owen Rafferty
42c7a27e6b
rewrite perl scripts in universal shell
2022-05-18 19:00:15 -05:00