Mark Seminatore
b0ad8a78ff
code to fix lost work in case of re-entrant calls to exec_blas_async()
2024-03-28 15:24:52 -07:00
Martin Kroeker
88b5330ae7
Restore outer loop of blas_buffer_inuse setup
2024-03-24 18:33:21 +01:00
shivammonaka
d49ebc54e1
Merge branch 'shivam-develop' into shivam-Locks
2024-02-29 11:58:14 +05:30
shivammonaka
bc191015e3
Using OpenMP locks with NUM_PARALLEL
2024-02-29 11:47:05 +05:30
Mark Seminatore
b29fd48998
Merge branch 'develop' into win_tidy
2024-02-12 10:23:17 -08:00
Mark Seminatore
98c56a7314
more cleanup
2024-02-08 13:50:15 -08:00
Chip Kerchner
d408ecedba
Add environment variable to display coretype for dynamic arch.
2024-02-08 12:17:18 -06:00
Chip Kerchner
ac6b4b7aa4
Make sure CPU ID works for all POWER_10 conditions
2024-02-08 08:56:30 -06:00
Chip Kerchner
08ce6b1c1c
Add missing CPU ID definitions for old versions of AIX.
2024-02-07 07:54:06 -06:00
Martin Kroeker
a4fde2c5ac
Merge pull request #4451 from martin-frbg/overflow_reset
...
Reset "buffer management structure overflowed" state and free auxiliary struct on blas_shutdown
2024-02-05 07:27:04 +01:00
Martin Kroeker
e61d96303d
Fix missing NO_AVX2 fallback for SapphireRapids
2024-02-04 10:05:20 +01:00
Mark Seminatore
42cb567f0f
more cleanup
2024-01-31 13:24:28 -08:00
Mark Seminatore
0d7fe5ea61
clean up whitespace
2024-01-29 22:33:47 -08:00
Martin Kroeker
d938aed7fe
reset "mem structure overflowed" state on shutdown
2024-01-23 17:15:53 +01:00
Chris Sidebottom
aaf65210cc
Add dynamic support for Arm(R) Neoverse(TM) V2 processor
...
Whilst I figure out how best to map the L2 parameters without
duplicating all of `ARMV8SVE`, lets just map this to `NEOVERSEV1`.
2024-01-19 19:05:50 +00:00
Martin Kroeker
152a6c43b6
Add blas_omp_threads_local
2024-01-14 19:59:55 +01:00
Martin Kroeker
8a9d492af7
Add default for blas_omp_threads_local
2024-01-14 19:58:49 +01:00
Martin Kroeker
87d31af2ae
Add openblas_set_num_threads_local()
2024-01-13 20:06:24 +01:00
Martin Kroeker
e7a895e714
Add Apple M as NeoverseN1
2023-12-25 12:36:05 +01:00
Chris Sidebottom
dc20a78188
Use functionally equivalent dynamic targets
...
Similar to `drivers/other/dynamic.c`, I've looked for functionally
equivalent targets and mapped them in the default DYNAMIC_ARCH build.
Users can still build specific cores using DYNAMIC_LIST.
2023-12-23 12:45:27 +00:00
Mark Seminatore
6bd7c54af5
introduce MT_TRACE to clean up SMP_DEBUG code
2023-12-11 15:13:04 -08:00
Mark Seminatore
edac80d7e8
some cleanup, dynamically scale threads, add missing WIN_CASE defn
2023-12-07 14:59:27 -08:00
Mark Seminatore
4ebf814b42
fix bug failing to mark task as finished.
2023-12-05 23:28:37 -08:00
Mark Seminatore
5f51811728
try at new threading model
2023-12-05 22:43:36 -08:00
Shiyou Yin
1310a0931b
loongarch: Refine build control for loongarch64.
...
1. Use getauxval instead of cpucfg to test hardware capability.
2. Remove unnecessary code and option for compiler check in c_check.
2023-11-28 20:23:55 +08:00
Chip-Kerchner
d99aad8ee3
Fix older version of gcc - missing __has_builtin, cpuid and no support of P10.
2023-11-14 11:07:08 -06:00
Martin Kroeker
9b5f8eb33a
Fix empty function prototypes
2023-11-12 19:35:53 +01:00
Martin Kroeker
9324520d0e
typo fix
2023-11-11 23:14:58 +01:00
Martin Kroeker
ff6437f2d7
Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14
2023-11-11 21:30:32 +01:00
Chip-Kerchner
4eecccd49b
Fix __builtin_cpu_is for AIX.
2023-11-08 07:12:21 -06:00
Chip-Kerchner
5e31c57083
Only define __builtin_cpu_is and __builtin_cpu_supports if not present.
2023-11-07 20:58:34 -06:00
Chip-Kerchner
7dcb2d67f2
Have POWER7 return arch=POWER6.
2023-11-01 15:23:28 -05:00
Chip-Kerchner
c8882bd9d8
Remove POWER7 from cpu list.
2023-11-01 14:53:55 -05:00
Chip Kerchner
badfb2e60f
Merge branch 'develop' into XLC-AIX
2023-10-26 09:19:31 -05:00
Martin Kroeker
e12aaed13d
Fix unwanted fallthrough from Intel Family 6 to 15 in case of identification failure
2023-10-18 16:28:54 +02:00
Chip-Kerchner
880af052dd
Fix dynamic dispatch P9 for clang.
2023-10-06 13:41:49 -05:00
Chip-Kerchner
3655632611
Another small change.
2023-10-06 13:11:40 -05:00
Chip-Kerchner
36e08f6994
One more small change.
2023-10-06 13:08:41 -05:00
Chip-Kerchner
298bf1f240
Reduce differences.
2023-10-06 12:50:28 -05:00
Chip-Kerchner
71c6689af4
Fix dynamic dispatch to work for clang.
2023-10-06 12:20:40 -05:00
Chip-Kerchner
c60f9d9c08
Add missing CPU_POWER5.
2023-10-06 09:49:17 -05:00
Chip Kerchner
3cc72a3797
Only include cpu_id and cpu_supports in AIX and fix parameter types.
2023-10-04 09:54:37 -05:00
Chip-Kerchner
09212f84bf
Fix default case for cpu_is.
2023-10-03 12:23:21 -05:00
Chip-Kerchner
2d0b233425
Fix missing parens.
2023-10-03 10:26:14 -05:00
Chip-Kerchner
a8c90eb3ed
Added cpu_is
2023-10-03 10:24:04 -05:00
Chip-Kerchner
b677d0d5fd
Adding missing endif
2023-10-02 13:09:12 -05:00
Chip-Kerchner
e5dc376912
Remove duplicate defines.
2023-10-02 12:48:47 -05:00
Chip-Kerchner
10210748de
Revert PGI changes.
2023-10-02 12:44:07 -05:00
Chip-Kerchner
a922a07e61
Cleanup white spaces.
2023-10-02 12:24:30 -05:00
Chip-Kerchner
12130ee961
Remove tab.
2023-10-02 12:19:22 -05:00