Commit Graph

724 Commits

Author SHA1 Message Date
Mark Seminatore b0ad8a78ff code to fix lost work in case of re-entrant calls to exec_blas_async() 2024-03-28 15:24:52 -07:00
Martin Kroeker 88b5330ae7
Restore outer loop of blas_buffer_inuse setup 2024-03-24 18:33:21 +01:00
shivammonaka d49ebc54e1 Merge branch 'shivam-develop' into shivam-Locks 2024-02-29 11:58:14 +05:30
shivammonaka bc191015e3 Using OpenMP locks with NUM_PARALLEL 2024-02-29 11:47:05 +05:30
Mark Seminatore b29fd48998
Merge branch 'develop' into win_tidy 2024-02-12 10:23:17 -08:00
Mark Seminatore 98c56a7314 more cleanup 2024-02-08 13:50:15 -08:00
Chip Kerchner d408ecedba Add environment variable to display coretype for dynamic arch. 2024-02-08 12:17:18 -06:00
Chip Kerchner ac6b4b7aa4 Make sure CPU ID works for all POWER_10 conditions 2024-02-08 08:56:30 -06:00
Chip Kerchner 08ce6b1c1c Add missing CPU ID definitions for old versions of AIX. 2024-02-07 07:54:06 -06:00
Martin Kroeker a4fde2c5ac
Merge pull request #4451 from martin-frbg/overflow_reset
Reset "buffer management structure overflowed" state and free auxiliary struct on blas_shutdown
2024-02-05 07:27:04 +01:00
Martin Kroeker e61d96303d
Fix missing NO_AVX2 fallback for SapphireRapids 2024-02-04 10:05:20 +01:00
Mark Seminatore 42cb567f0f more cleanup 2024-01-31 13:24:28 -08:00
Mark Seminatore 0d7fe5ea61 clean up whitespace 2024-01-29 22:33:47 -08:00
Martin Kroeker d938aed7fe
reset "mem structure overflowed" state on shutdown 2024-01-23 17:15:53 +01:00
Chris Sidebottom aaf65210cc Add dynamic support for Arm(R) Neoverse(TM) V2 processor
Whilst I figure out how best to map the L2 parameters without
duplicating all of `ARMV8SVE`, lets just map this to `NEOVERSEV1`.
2024-01-19 19:05:50 +00:00
Martin Kroeker 152a6c43b6
Add blas_omp_threads_local 2024-01-14 19:59:55 +01:00
Martin Kroeker 8a9d492af7
Add default for blas_omp_threads_local 2024-01-14 19:58:49 +01:00
Martin Kroeker 87d31af2ae
Add openblas_set_num_threads_local() 2024-01-13 20:06:24 +01:00
Martin Kroeker e7a895e714
Add Apple M as NeoverseN1 2023-12-25 12:36:05 +01:00
Chris Sidebottom dc20a78188 Use functionally equivalent dynamic targets
Similar to `drivers/other/dynamic.c`, I've looked for functionally
equivalent targets and mapped them in the default DYNAMIC_ARCH build.
Users can still build specific cores using DYNAMIC_LIST.
2023-12-23 12:45:27 +00:00
Mark Seminatore 6bd7c54af5 introduce MT_TRACE to clean up SMP_DEBUG code 2023-12-11 15:13:04 -08:00
Mark Seminatore edac80d7e8 some cleanup, dynamically scale threads, add missing WIN_CASE defn 2023-12-07 14:59:27 -08:00
Mark Seminatore 4ebf814b42 fix bug failing to mark task as finished. 2023-12-05 23:28:37 -08:00
Mark Seminatore 5f51811728 try at new threading model 2023-12-05 22:43:36 -08:00
Shiyou Yin 1310a0931b loongarch: Refine build control for loongarch64.
1. Use getauxval instead of cpucfg to test hardware capability.
2. Remove unnecessary code and option for compiler check in c_check.
2023-11-28 20:23:55 +08:00
Chip-Kerchner d99aad8ee3 Fix older version of gcc - missing __has_builtin, cpuid and no support of P10. 2023-11-14 11:07:08 -06:00
Martin Kroeker 9b5f8eb33a
Fix empty function prototypes 2023-11-12 19:35:53 +01:00
Martin Kroeker 9324520d0e
typo fix 2023-11-11 23:14:58 +01:00
Martin Kroeker ff6437f2d7
Add workaround for omp_get_max_threads hanging on FreeBSD with libomp from LLVM14 2023-11-11 21:30:32 +01:00
Chip-Kerchner 4eecccd49b Fix __builtin_cpu_is for AIX. 2023-11-08 07:12:21 -06:00
Chip-Kerchner 5e31c57083 Only define __builtin_cpu_is and __builtin_cpu_supports if not present. 2023-11-07 20:58:34 -06:00
Chip-Kerchner 7dcb2d67f2 Have POWER7 return arch=POWER6. 2023-11-01 15:23:28 -05:00
Chip-Kerchner c8882bd9d8 Remove POWER7 from cpu list. 2023-11-01 14:53:55 -05:00
Chip Kerchner badfb2e60f Merge branch 'develop' into XLC-AIX 2023-10-26 09:19:31 -05:00
Martin Kroeker e12aaed13d
Fix unwanted fallthrough from Intel Family 6 to 15 in case of identification failure 2023-10-18 16:28:54 +02:00
Chip-Kerchner 880af052dd Fix dynamic dispatch P9 for clang. 2023-10-06 13:41:49 -05:00
Chip-Kerchner 3655632611 Another small change. 2023-10-06 13:11:40 -05:00
Chip-Kerchner 36e08f6994 One more small change. 2023-10-06 13:08:41 -05:00
Chip-Kerchner 298bf1f240 Reduce differences. 2023-10-06 12:50:28 -05:00
Chip-Kerchner 71c6689af4 Fix dynamic dispatch to work for clang. 2023-10-06 12:20:40 -05:00
Chip-Kerchner c60f9d9c08 Add missing CPU_POWER5. 2023-10-06 09:49:17 -05:00
Chip Kerchner 3cc72a3797 Only include cpu_id and cpu_supports in AIX and fix parameter types. 2023-10-04 09:54:37 -05:00
Chip-Kerchner 09212f84bf Fix default case for cpu_is. 2023-10-03 12:23:21 -05:00
Chip-Kerchner 2d0b233425 Fix missing parens. 2023-10-03 10:26:14 -05:00
Chip-Kerchner a8c90eb3ed Added cpu_is 2023-10-03 10:24:04 -05:00
Chip-Kerchner b677d0d5fd Adding missing endif 2023-10-02 13:09:12 -05:00
Chip-Kerchner e5dc376912 Remove duplicate defines. 2023-10-02 12:48:47 -05:00
Chip-Kerchner 10210748de Revert PGI changes. 2023-10-02 12:44:07 -05:00
Chip-Kerchner a922a07e61 Cleanup white spaces. 2023-10-02 12:24:30 -05:00
Chip-Kerchner 12130ee961 Remove tab. 2023-10-02 12:19:22 -05:00