Commit Graph

7817 Commits

Author SHA1 Message Date
Pierrick Bouvier a4992e09bc Fix utest compilation
Introduced recently when adding new test cases for ZSCAL

- include cblas is needed for cblas_zscal
- ASSERT macro does not exist
- missing closing )
2024-01-18 18:21:30 +04:00
Martin Kroeker 6f0e0e4021
Merge pull request #4438 from Dirreke/csky-support
Add CSKY support
2024-01-18 13:04:52 +01:00
Martin Kroeker 43cb266178
Merge pull request #4441 from martin-frbg/gemv-threshold
Increase multithreading threshold for S/DGEMV by a factor of 50
2024-01-17 22:25:01 +01:00
Martin Kroeker d2fc4f3b4d
Increase multithreading threshold by a factor of 50 2024-01-17 20:59:24 +01:00
Martin Kroeker ec46ca7a43
Support Arm Compiler for Linux as classic flang (#4436)
* Support ArmCompilerforLinux as classic flang
2024-01-17 07:33:10 +01:00
Dirreke ec89466e14 Add CSKY support 2024-01-16 23:45:06 +08:00
Martin Kroeker 452741b67f
Merge pull request #4435 from imciner2/im/sapphire
Fix Clang sapphire rapids march flag
2024-01-16 13:57:29 +01:00
Ian McInerney 8f4e325ea8 Fix Clang sapphire rapids march flag 2024-01-15 23:42:03 +00:00
Martin Kroeker 13c764eaaa
Merge pull request #4434 from martin-frbg/issue4433
Only use mtune=native in ARM64 fallback paths when not cross-compiling
2024-01-15 23:36:07 +01:00
Martin Kroeker 025a1b2c7b
Only use mtune=native when not cross-compiling 2024-01-15 22:40:21 +01:00
Martin Kroeker 2527afaaa2
Merge pull request #4429 from martin-frbg/issue4428
Handle NAN and INF in ARM and generic/s390x ZSCAL
2024-01-15 11:26:12 +01:00
Martin Kroeker 0d2e486edf
Handle NAN and INF 2024-01-15 11:18:59 +01:00
Martin Kroeker a782103b9c
Merge pull request #4425 from martin-frbg/issue2392
Add BLAS extension openblas_set_num_threads_local()
2024-01-14 21:57:57 +01:00
Martin Kroeker 152a6c43b6
Add blas_omp_threads_local 2024-01-14 19:59:55 +01:00
Martin Kroeker 8a9d492af7
Add default for blas_omp_threads_local 2024-01-14 19:58:49 +01:00
Martin Kroeker b3341527ad
Merge pull request #4426 from martin-frbg/issue4415
Tweak LAPACK tests for SGS/DGS to avoid spurious errors resulting from FMA-induced inaccuracies
2024-01-13 23:27:13 +01:00
Martin Kroeker 9fab60d32f
Remove matrix dimension 6 from SGS to avoid spurious errors from FMA 2024-01-13 20:39:05 +01:00
Martin Kroeker bf66af3dc0
remove matrix dimension 6 from DGS to avoid spurious errors from FMA 2024-01-13 20:37:36 +01:00
Martin Kroeker 87d31af2ae
Add openblas_set_num_threads_local() 2024-01-13 20:06:24 +01:00
Martin Kroeker 2e2e538b7c
Add openblas_set_num_threads_local() and use of blas_omp_threads_local in OMP parallel regions 2024-01-13 20:02:43 +01:00
Martin Kroeker f9b2d7f225
Merge pull request #3253 from wi24rd/patch-1
Fix typo in common.h
2024-01-13 19:55:01 +01:00
Martin Kroeker 5f5b7c4f45
Merge pull request #4423 from martin-frbg/issue4422
Check compiler support for AVX512BF16 and base COL/SPR kernel choice on that
2024-01-12 16:30:50 +01:00
Martin Kroeker f31bea07dd
Merge pull request #4419 from martin-frbg/issue4413
[WIP] Add fixes and utests for ZSCAL with NaN or Inf arguments
2024-01-12 14:27:08 +01:00
Martin Kroeker 20413ee6ec
Update zscal.c 2024-01-12 13:11:13 +01:00
Martin Kroeker b57627c27f
Handle NAN and INF 2024-01-12 12:03:08 +01:00
Martin Kroeker d1ead06bd8
define NAN and INFINITY if needed 2024-01-12 09:29:13 +01:00
Martin Kroeker 995a990e24
Make AVX512 BFLOAT16 kernels conditional on compiler capability 2024-01-12 00:12:46 +01:00
Martin Kroeker 1dada6d65d
Add compiler test and flag for AVX512BF16 capability 2024-01-12 00:10:56 +01:00
Martin Kroeker 7df363e1e2
temporarily disable the MSA C/ZSCAL kernels 2024-01-12 00:08:52 +01:00
Martin Kroeker 3599f2de8b
Merge pull request #4421 from ChipKerchner/power10Copies_DGEMM
Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM PowerPC versions.
2024-01-10 07:49:00 +01:00
Chip-Kerchner 5c5c1a1220 Merge remote-tracking branch 'origin/develop' into power10Copies_DGEMM 2024-01-09 07:18:41 -06:00
Chip-Kerchner 058dd2a4cb Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 2024-01-08 14:16:09 -06:00
Martin Kroeker 1c31f56e5a
Handle NAN 2024-01-08 16:11:25 +01:00
Martin Kroeker 7ee1ee38e2
Handle NaN in input 2024-01-08 14:20:07 +01:00
Martin Kroeker f637e12713
Handle INF and NAN 2024-01-08 09:52:38 +01:00
Martin Kroeker 25b0c48082
Update zscal.c 2024-01-08 09:49:18 +01:00
Martin Kroeker 5e7f714e93
Update zscal.c 2024-01-08 08:17:40 +01:00
Martin Kroeker cf8b03ae8b
Use NAN rather than SNAN for portability 2024-01-07 23:09:57 +01:00
Martin Kroeker 7a6a24647d
Merge pull request #4420 from martin-frbg/revertstuff
Revert accidental direct commits to develop
2024-01-07 23:04:22 +01:00
Martin Kroeker f0808d856b
Handle NAN in input 2024-01-07 20:27:29 +01:00
Martin Kroeker acf17a825d
Handle NAN in input 2024-01-07 20:26:16 +01:00
Martin Kroeker f052bd4705
revert accidental direct commit to develop 2024-01-07 20:21:48 +01:00
Martin Kroeker 91bbde7f64
revert accidental direct commit to develop 2024-01-07 20:21:05 +01:00
Martin Kroeker 2173356d5b
Update zscal_vector.c 2024-01-07 19:45:28 +01:00
Martin Kroeker b08a208365
Update zscal_vector.c 2024-01-07 19:14:41 +01:00
Martin Kroeker 0c33b57f5f
Handle NAN in input 2024-01-07 18:40:19 +01:00
Martin Kroeker 903589f84b
Update zscal.c 2024-01-07 18:37:00 +01:00
Martin Kroeker 711433fcf0
Update zscal.c 2024-01-07 18:01:58 +01:00
Martin Kroeker d3d99c34f2
Fix handling of NAN and INF 2024-01-07 17:56:51 +01:00
Martin Kroeker c9df62e883
Fix handling of NAN 2024-01-07 17:49:40 +01:00