Commit Graph

8499 Commits

Author SHA1 Message Date
frjohnst 9b24b31419 resolve second_ conflict which breaks xlf timef 2024-02-21 15:52:29 -05:00
Chip-Kerchner bf2310442b Fix get_num_cores for AIX. 2024-02-21 13:26:28 -06:00
Chip-Kerchner a69adbbd11 Merge branch 'develop' of https://github.com/openmathlib/openblas into develop 2024-02-21 12:18:18 -06:00
Martin Kroeker 99ef76f9bb
Merge pull request #4511 from ErnstPeng/feature-branch
Optimized cgemm kernel 16x4 LASX for LoongArch
2024-02-21 14:25:57 +01:00
pengxu 4787a55c64 Optimized cgemm kernel 16x4 LASX for LoongArch 2024-02-21 15:28:47 +08:00
Martin Kroeker ebbf5b3ea0
Merge pull request #4504 from sergei-lewis/dev/slewis/ci
Add builds and unit tests for new RISCV platforms to CI
2024-02-16 22:48:28 +01:00
Sergei Lewis 461ecabb22 add RISCV64_ZVL128B and RISCV64_ZVL256B targets to CI flows and to README.md 2024-02-16 16:26:29 +00:00
Sergei Lewis ba17758c02 fix axpy implementations where y has a stride of 0 2024-02-16 16:00:38 +00:00
Martin Kroeker 5266998b9f
Merge pull request #4498 from mseminatore/win_tidy
blas_server_win32.c pass to clean up code
2024-02-15 14:37:37 +01:00
Martin Kroeker ca6b4961e4
updates to fix option conflicts and config file generation 2024-02-15 14:31:11 +01:00
Martin Kroeker c90979d8ef
allow for more pre- and suffixes in the name of the openblas library 2024-02-15 14:17:11 +01:00
Martin Kroeker 3120f12e76
allow for more pre- and suffixes in the name of the openblas library 2024-02-15 14:16:20 +01:00
Martin Kroeker a0e3f77e0b
add FIXED_LIBNAME, PREFIX and SUFFIX 2024-02-15 12:17:38 +01:00
Martin Kroeker ffbfc3c692
Add libname prefix and suffix 2024-02-15 12:16:34 +01:00
Martin Kroeker 179527f622
Merge branch 'OpenMathLib:develop' into issue4468 2024-02-15 12:15:39 +01:00
Martin Kroeker a28afac791
Add FIXED_LIBNAME, LIBNAMEPREFIX and LIBNAMESUFFIX 2024-02-15 11:48:33 +01:00
Martin Kroeker 57dd894af0
Merge pull request #4502 from dmikushin/add-missing-use_gemm3m-macro
Add missing USE_GEMM3M macro into CMake
2024-02-15 11:13:36 +01:00
Mark Seminatore b29fd48998
Merge branch 'develop' into win_tidy 2024-02-12 10:23:17 -08:00
Mark Seminatore 0a7ae326d2 Merge branch 'win_tidy' of https://github.com/mseminatore/OpenBLAS into win_tidy 2024-02-12 10:22:26 -08:00
Mark Seminatore 10548a0460 update contributors 2024-02-12 10:22:12 -08:00
Dmitry Mikushin d0f5dc763b Adding USE_GEMM3M macro to kernel targets, so that the *gemm3m functions and parameters can be included into the gotoblas structure. Fixes #4500 2024-02-12 02:29:58 +01:00
Dmitry Mikushin 8698f9e37f Adding basic support of benchmarks into CMake for single, double, single complex and double complex cases. Each benchmarking target has a suffix to identify the data type, for example ./benchmark_gemm3m_COMPLEX_DOUBLE is a gemm3m.c source compiled with COMPLEX and DOUBLE macros defined 2024-02-10 19:12:16 +01:00
kseniyazaytseva 7e9b1c0807 fix uninitialized data usage 2024-02-10 00:49:42 +03:00
kseniyazaytseva c6f30fd414 check for zero inc 2024-02-10 00:48:07 +03:00
kseniyazaytseva 5e9ead09ac fix info return 2024-02-10 00:47:25 +03:00
kseniyazaytseva 4c554bd527 check abs zero inc 2024-02-10 00:46:52 +03:00
kseniyazaytseva 46de7c8a2b Merge remote-tracking branch 'origin/risc-v-new-tests' into new-tests 2024-02-09 23:52:51 +03:00
Martin Kroeker 10ea3fb742
fix duplication of name parts 2024-02-09 17:09:55 +01:00
Martin Kroeker b1ae777afb
Merge pull request #4497 from sergei-lewis/dev/slewis/zaxpy
Fix axpy test hangs when n==0. Reenable zaxpy_vector kernel for C910V.
2024-02-09 16:22:00 +01:00
Martin Kroeker bb96e466ae
Introduce LIBNAMEPREFIX to avoid messing with the internal LIBPREFIX 2024-02-09 15:50:11 +01:00
Chip-Kerchner 32ed6e391a Merge branch 'develop' of https://github.com/openmathlib/openblas into develop 2024-02-09 07:25:04 -06:00
Sergei Lewis ff1523163f Fix axpy test hangs when n==0. Reenable zaxpy_vector kernel for C910V. 2024-02-09 12:59:14 +00:00
Martin Kroeker ba3bfe85ee
Merge pull request #4495 from martin-frbg/update-gensymbol
Update gensymbol with recently added CBLAS interfaces and LAPACK/LAPACKE functions
2024-02-09 08:55:22 +01:00
Martin Kroeker 93872f4681
drop the ?laqz? symbols for now (not translatable by f2c) 2024-02-08 23:02:09 +01:00
Mark Seminatore 98c56a7314 more cleanup 2024-02-08 13:50:15 -08:00
Martin Kroeker 83bec51355
Update with recently added CBLAS interfaces and LAPACK/LAPACKE functions 2024-02-08 21:23:48 +01:00
Martin Kroeker 974f29c4e9
Merge pull request #4494 from ChipKerchner/fixPower10CPUID
Make sure CPU ID works for all POWER_10 conditions
2024-02-08 21:21:32 +01:00
Chip Kerchner d408ecedba Add environment variable to display coretype for dynamic arch. 2024-02-08 12:17:18 -06:00
Martin Kroeker a96a04ee61
Merge pull request #4493 from martin-frbg/issue4475-3
Fix incompatible pointer types in the declarations of C/ZAXPBY
2024-02-08 16:50:06 +01:00
Chip Kerchner ac6b4b7aa4 Make sure CPU ID works for all POWER_10 conditions 2024-02-08 08:56:30 -06:00
Martin Kroeker 500ac4de5e
fix incompatible pointer types 2024-02-08 13:18:34 +01:00
Martin Kroeker b3fa16345d
fix prototype for c/zaxpby 2024-02-08 13:15:34 +01:00
kseniyazaytseva cfabc48190 Update rotg tests 2024-02-08 00:22:15 +03:00
kseniyazaytseva ec5cfe3bc8 Fix invalid tests 2024-02-08 00:21:38 +03:00
kseniyazaytseva ff10e6b6dc Fix zero step tests 2024-02-08 00:19:54 +03:00
Martin Kroeker e9cfb7fd30
Merge pull request #4491 from martin-frbg/fixup-4488
fix sbgemm bfloat16 conversion errors introduced in PR 4488
2024-02-07 21:34:40 +01:00
Chip-Kerchner cb9aa2a587 Merge branch 'develop' of https://github.com/openmathlib/openblas into develop 2024-02-07 13:09:58 -06:00
Martin Kroeker e9f480111e
fix sbgemm bfloat16 conversion errors introduced in PR 4488 2024-02-07 19:57:18 +01:00
Martin Kroeker 22b487b622
Merge pull request #4488 from martin-frbg/issue4475-2
Separate the interface for SBGEMMT from GEMMT
2024-02-07 18:40:35 +01:00
Martin Kroeker 818bf30628
Merge pull request #4490 from ChipKerchner/missingCPUIDsForAIX
Add missing CPU ID definitions for old versions of AIX.
2024-02-07 17:31:26 +01:00