Commit Graph

8033 Commits

Author SHA1 Message Date
Martin Kroeker 2e2e538b7c
Add openblas_set_num_threads_local() and use of blas_omp_threads_local in OMP parallel regions 2024-01-13 20:02:43 +01:00
Martin Kroeker f9b2d7f225
Merge pull request #3253 from wi24rd/patch-1
Fix typo in common.h
2024-01-13 19:55:01 +01:00
Martin Kroeker 5f5b7c4f45
Merge pull request #4423 from martin-frbg/issue4422
Check compiler support for AVX512BF16 and base COL/SPR kernel choice on that
2024-01-12 16:30:50 +01:00
Martin Kroeker f31bea07dd
Merge pull request #4419 from martin-frbg/issue4413
[WIP] Add fixes and utests for ZSCAL with NaN or Inf arguments
2024-01-12 14:27:08 +01:00
Martin Kroeker 20413ee6ec
Update zscal.c 2024-01-12 13:11:13 +01:00
Martin Kroeker b57627c27f
Handle NAN and INF 2024-01-12 12:03:08 +01:00
Martin Kroeker d1ead06bd8
define NAN and INFINITY if needed 2024-01-12 09:29:13 +01:00
Martin Kroeker 995a990e24
Make AVX512 BFLOAT16 kernels conditional on compiler capability 2024-01-12 00:12:46 +01:00
Martin Kroeker 1dada6d65d
Add compiler test and flag for AVX512BF16 capability 2024-01-12 00:10:56 +01:00
Martin Kroeker 7df363e1e2
temporarily disable the MSA C/ZSCAL kernels 2024-01-12 00:08:52 +01:00
Martin Kroeker 3599f2de8b
Merge pull request #4421 from ChipKerchner/power10Copies_DGEMM
Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM PowerPC versions.
2024-01-10 07:49:00 +01:00
Chip-Kerchner 5c5c1a1220 Merge remote-tracking branch 'origin/develop' into power10Copies_DGEMM 2024-01-09 07:18:41 -06:00
Chip-Kerchner 058dd2a4cb Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
Martin Kroeker 1c31f56e5a
Handle NAN 2024-01-08 16:11:25 +01:00
Martin Kroeker 7ee1ee38e2
Handle NaN in input 2024-01-08 14:20:07 +01:00
Martin Kroeker f637e12713
Handle INF and NAN 2024-01-08 09:52:38 +01:00
Martin Kroeker 25b0c48082
Update zscal.c 2024-01-08 09:49:18 +01:00
Martin Kroeker 5e7f714e93
Update zscal.c 2024-01-08 08:17:40 +01:00
Martin Kroeker cf8b03ae8b
Use NAN rather than SNAN for portability 2024-01-07 23:09:57 +01:00
Martin Kroeker 7a6a24647d
Merge pull request #4420 from martin-frbg/revertstuff
Revert accidental direct commits to develop
2024-01-07 23:04:22 +01:00
Martin Kroeker f0808d856b
Handle NAN in input 2024-01-07 20:27:29 +01:00
Martin Kroeker acf17a825d
Handle NAN in input 2024-01-07 20:26:16 +01:00
Martin Kroeker f052bd4705
revert accidental direct commit to develop 2024-01-07 20:21:48 +01:00
Martin Kroeker 91bbde7f64
revert accidental direct commit to develop 2024-01-07 20:21:05 +01:00
Martin Kroeker 2173356d5b
Update zscal_vector.c 2024-01-07 19:45:28 +01:00
Martin Kroeker b08a208365
Update zscal_vector.c 2024-01-07 19:14:41 +01:00
Martin Kroeker 0c33b57f5f
Handle NAN in input 2024-01-07 18:40:19 +01:00
Martin Kroeker 903589f84b
Update zscal.c 2024-01-07 18:37:00 +01:00
Martin Kroeker 711433fcf0
Update zscal.c 2024-01-07 18:01:58 +01:00
Martin Kroeker d3d99c34f2
Fix handling of NAN and INF 2024-01-07 17:56:51 +01:00
Martin Kroeker c9df62e883
Fix handling of NAN 2024-01-07 17:49:40 +01:00
Martin Kroeker def4996170
Fix handling of NAN and INF arguments 2024-01-07 15:29:42 +01:00
Martin Kroeker e48627c999
Add tests for ZSCAL with NaN and Inf arguments 2024-01-06 23:55:52 +01:00
Martin Kroeker 1412d2deeb
Update version to 0.3.26.dev 2024-01-02 22:33:01 +01:00
Martin Kroeker 4f5da84e2f
Update version to 0.3.26.dev 2024-01-02 22:32:27 +01:00
Martin Kroeker 1ad742844b
Merge pull request #4409 from OpenMathLib/release-0.3.0
merge release branch back into develop to copy tag
2024-01-02 22:31:38 +01:00
Martin Kroeker 6c77e5e314
Update Makefile.rule 2024-01-02 22:25:05 +01:00
Martin Kroeker fde8bb9903
Update version to 0.3.26 2024-01-02 22:24:33 +01:00
Martin Kroeker 8fe7f80271
Merge pull request #4408 from OpenMathLib/develop
merge develop for 0.3.26 release
2024-01-02 22:23:31 +01:00
Martin Kroeker cddd35fae1
Merge pull request #4407 from martin-frbg/changelog0326
Update Changelog for 0.3.26
2024-01-02 22:21:16 +01:00
Martin Kroeker 03713bc464
Update Changelog for 0.3.26 2024-01-02 22:08:49 +01:00
Martin Kroeker cdff44e4d3
Merge pull request #4406 from martin-frbg/issue3291
Increase S/D GEMM PQ parameters for Neoverse N1
2024-01-02 22:02:56 +01:00
Martin Kroeker 8278d0d093
Merge pull request #4353 from erikbs/feature/fix-xerbla-linking-on-older-mac-versions
Fix _xerbla_ linking on older Mac versions
2024-01-02 19:55:05 +01:00
Martin Kroeker 504f9b0c5e
Increase S/D GEMM PQ to match typical L2 size as forNeoverseV1 2024-01-02 18:46:21 +01:00
Martin Kroeker 534de14a02
Merge pull request #4402 from martin-frbg/lapack967
Fix an uninitialized read (use of wrong variable) in ?BBCSD (Reference-LAPACK PR 967)
2023-12-31 16:31:28 +01:00
Martin Kroeker 4a15d72420
AzureCI: Update alpine-chroot-install (#4403)
* Update alpine-chroot-install
2023-12-31 16:30:57 +01:00
Martin Kroeker 0c43c6fa99
Merge pull request #4341 from catap/openblas.pc.in
cmake/openblas.pc.in: fixed version and URL
2023-12-31 13:25:06 +01:00
Martin Kroeker 00d7476b4b
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:39:21 +01:00
Martin Kroeker 1b668479de
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:37:52 +01:00
Martin Kroeker bd787c8a1a
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:36:47 +01:00