Commit Graph

7452 Commits

Author SHA1 Message Date
Honglin Zhu 55d686d41e neoverse n2 sbgemm:
implement ncopy tcopy kernel_8x4
2022-06-29 10:14:21 +08:00
Honglin Zhu 04593bb27c neoverse n2 sbgemm: init file 2022-06-29 10:14:21 +08:00
Martin Kroeker 1fb4259077
Merge pull request #3673 from martin-frbg/azuredynmingw
AzureCI: drop cpus from the DYNAMIC_LIST for Windows/mingw to save time
2022-06-28 23:13:11 +02:00
Martin Kroeker 47a0e53196
mingw-dynamic arch: drop Haswell too 2022-06-28 21:40:04 +02:00
Martin Kroeker c7b3ce010e
drop NEHALEM from the DYNLIST for Windows/mingw to save time 2022-06-28 20:12:11 +02:00
Martin Kroeker be5500e704
Merge pull request #3669 from VFerrari/fix_small_matrix_kernel
POWER: fix issues with the small matrix kernel
2022-06-28 16:09:36 +02:00
Martin Kroeker 92275a7902
Merge pull request #3642 from nursik/develop
Add ARM64 support for Windows
2022-06-28 16:05:11 +02:00
Martin Kroeker 914c4d0fe8
Add C versions of the CBLAS test sources (#3656)
* Add C conversions of the CBLAS tests for NOFORTRAN=1 builds

* Enable CTEST without Fortran and fix passing of BUILD_vartype options to exports/gensymbol
2022-06-28 11:52:48 +02:00
Martin Kroeker 2857987ff6
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs 2022-06-28 11:46:25 +02:00
VFerrari 2062280c6f
Power: Enable SMALL_MATRIX OPT as default for dynamic arch 2022-06-25 03:47:03 -03:00
VFerrari cac634fce3
POWER10: Fix multithreading check when USE_THREAD=0
This patch fixes an issue when OpenBLAS is compiled for TARGET=POWER10
and the flag USE_THREAD is set to 0.

The function `num_cpu_avail` is only available when USE_THREAD=1,
so SMP is defined.
2022-06-25 03:46:46 -03:00
Martin Kroeker 9283c7c0b5
Merge pull request #3655 from RajalakshmiSR/zgemmasmp10
POWER10: Fix ZGEMM testcase failures
2022-06-18 20:52:26 +02:00
Martin Kroeker 9777c59d98
Merge pull request #3653 from RajalakshmiSR/dgemvp10
POWER10: convert dgemv inline assembly
2022-06-18 20:51:59 +02:00
Rajalakshmi Srinivasaraghavan f191bc652b POWER10: Fix ZGEMM testcase failures
This patch fixes storing and restoring non volatile registers
in zgemm POWER10 kernel.
2022-06-17 08:18:08 -05:00
Martin Kroeker 7060ca5002
Merge pull request #3647 from martin-frbg/exports_3.10.0
Amend gensymbol with some LAPACK 3.10.0 additions
2022-06-10 08:58:00 +02:00
Martin Kroeker 72ea19d187
Amend some LAPACK 3.10.0 additions 2022-06-09 19:31:08 +02:00
Nursultan Zarlyk 1dfc4e6150 Replace with ARM64 intrinsics 2022-06-09 18:49:49 +02:00
Rajalakshmi Srinivasaraghavan 8419d538ff POWER10: convert dgemv inline assembly
This patch makes use of compiler builtins and matches with assembly
performance. Tested with clang14 and gcc12.
2022-06-09 10:42:57 -05:00
Martin Kroeker bfd9c1b58c
Merge pull request #3645 from martin-frbg/issue3644
Fix quotes around compiler args in C11 check
2022-06-08 19:29:07 +02:00
Martin Kroeker 79d98327e4
Fix quotes around compiler args in C11 check 2022-06-08 11:22:20 +02:00
Martin Kroeker eb1faada19
Merge pull request #3643 from martin-frbg/fixgensymbol
Fix LAPACK path in new gensymbol script
2022-06-08 11:18:46 +02:00
Xianyi Zhang 5e9a912591 Merge branch 'develop' into risc-v 2022-06-06 14:12:09 +08:00
Xianyi Zhang f9715605ac Add PLCT to contributors. 2022-06-06 14:11:28 +08:00
Xianyi Zhang 3f88429bcf Merge branch 'risc-v_fix_intrinsic' into risc-v 2022-06-06 13:56:05 +08:00
Xianyi Zhang 968e1f51d8 Update RISC-V Intrinsic API. 2022-06-06 13:52:21 +08:00
Martin Kroeker e9c3535208
Fix LAPACK path in new gensymbol script 2022-06-05 23:28:12 +02:00
Martin Kroeker f150c97ceb
Merge pull request #3641 from RajalakshmiSR/ppc_build
power10:  Fix build issues due to perl scripts conversion
2022-06-05 11:23:29 +02:00
Nursultan Zarlyk 1bb7993a97 Fix MSVC ARM64 build. Add generic kernel for ARM64 2022-06-02 16:53:54 +02:00
Rajalakshmi Srinivasaraghavan c98d63b637 power10: Fix build issues due to perl scripts conversion
Due to recent perl script conversion, there are some build
errors when compiling openblas with advance toolchain compilers.
2022-06-02 08:11:10 -05:00
Martin Kroeker 28a24a4d4f
Merge pull request #3637 from martin-frbg/issue3636
Add fallback value for bogus sc_nprocessors_conf in getarch
2022-05-27 10:23:02 +02:00
Martin Kroeker 14ae22bf7a
Add fallback value for bogus sc_nprocessors_conf 2022-05-27 00:29:17 +02:00
Martin Kroeker 771dc6a8d8
Merge pull request #3635 from martin-frbg/issue3634
Support compilation with the Intel ifx compiler
2022-05-26 11:57:53 +02:00
Martin Kroeker 19413624d0
Add Intel ifx compiler 2022-05-26 09:31:49 +02:00
Martin Kroeker f56e4b620f
Merge pull request #3633 from martin-frbg/perl_fallback
Add back original PERL-based build scripts and add option USE_PERL
2022-05-22 21:18:44 +02:00
Martin Kroeker 5cb0d23027
Support USE_PERL fallback for gensymbol 2022-05-22 18:36:24 +02:00
Martin Kroeker f5a379bf77
Add USE_PERL fallback option for gensymbol script 2022-05-22 18:35:23 +02:00
Martin Kroeker cfc1a9ed8d
Add back original PERL-based script under new name 2022-05-22 18:33:24 +02:00
Martin Kroeker a3e02742f2
Add USE_PERL fallback option for create script used with FUNCTION_PROFILE 2022-05-22 18:32:19 +02:00
Martin Kroeker f1c570a5f1
Add back original PERL-based script under new name 2022-05-22 18:29:01 +02:00
Martin Kroeker 181b96257c
Add back PERL-based scripts under new name 2022-05-22 18:27:45 +02:00
Martin Kroeker 7093a34a34
Add fallback option USE_PERL for original PERL-based build scripts 2022-05-22 18:27:02 +02:00
Martin Kroeker c4b52ef46e
Merge pull request #3624 from ioraff/no-perl
rewrite perl scripts in universal shell
2022-05-22 18:21:17 +02:00
Martin Kroeker d0c3504255
Merge pull request #3631 from martin-frbg/revertdynskx
Revert selection of a different DGEMM kernel for SkylakeX in DYNAMIC_ARCH builds
2022-05-20 13:47:09 +02:00
Martin Kroeker dac14a5f7d
revert "switch DGEMM parameters for SkylakeX if DYNAMIC_ARCH" 2022-05-20 11:28:23 +02:00
Martin Kroeker dc49edd4e6
Revert "roll back DGEMM kernel ... for DYNAMIC_ARCH" 2022-05-20 11:23:30 +02:00
Martin Kroeker faf58d2b3f
Merge pull request #3630 from martin-frbg/fixpr3629
Fix compilation of cpuid_riscv
2022-05-20 06:37:37 +02:00
Martin Kroeker 30df29c0b3
Fix compilation 2022-05-19 18:57:46 +02:00
Zhang Xianyi a720e2ca8a
Merge pull request #3629 from Rabenda/riscv-c910
riscv: Fix machine recognition for c910v
2022-05-19 17:57:19 +08:00
Han Gao 8dd4579480 riscv: Fix machine recognition for c910v
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-05-19 17:32:48 +08:00
Owen Rafferty 42c7a27e6b
rewrite perl scripts in universal shell 2022-05-18 19:00:15 -05:00