OpenBLAS/driver/others
Chen, Guobing 0c1c903f1e Fix OMP num specify issue
In current code, no matter what number of threads specified, all
available CPU count is used when invoking OMP, which leads to very bad
performance if the workload is small while all available CPUs are big.
Lots of time are wasted on inter-thread sync. Fix this issue by really
using the number specified by the variable 'num' from calling API.

Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
2020-08-24 02:45:54 +08:00
..
CMakeLists.txt Fix typo in previous commit for arm dynamic arch 2018-12-07 19:37:33 +01:00
Makefile Protect against inadvertent activation of USE_CUDA 2020-08-01 12:31:39 +02:00
abs.c Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
blas_l1_thread.c modify the blas_l1_thread.c for support multi-threded for L1 fuction with return value 2017-01-10 11:47:06 +08:00
blas_server.c Update conditional for atomics to use HAVE_C11 2020-07-18 17:03:31 +00:00
blas_server_omp.c Fix OMP num specify issue 2020-08-24 02:45:54 +08:00
blas_server_win32.c driver: more reasonable thread wait timeout on Windows. 2019-12-13 09:52:33 +01:00
divtable.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
dynamic.c Add AMD Renoir/Matisse and preliminary support for Zen3 as Zen2 2020-07-28 13:53:17 +00:00
dynamic_arm64.c ARM64: Add THUNDERX3T110 Target 2020-07-26 23:32:24 -07:00
dynamic_power.c Change minimum gcc version for POWER10 2020-07-09 21:46:06 -05:00
dynamic_zarch.c s390x: choose SIMD kernels at run-time based on OS and compiler support 2020-05-12 11:01:16 +02:00
init.c Fix errors in cpu enumeration with glibc 2.6 2019-05-07 13:34:52 +02:00
lamc3.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lamch.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
lsame.c Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
memory.c Update conditional for atomics to HAVE_C11 2020-07-18 17:07:38 +00:00
memory_qalloc.c Disable the old QCDOC qalloc by default and copy utility functions from memory.c 2019-11-17 19:22:04 +01:00
openblas_env.c Refs #716. Only call getenv at init function. 2016-03-09 12:50:07 -05:00
openblas_error_handle.c Refs #716. Only call getenv at init function. 2016-03-09 12:50:07 -05:00
openblas_get_config.c address minor warnings from gcc7 2019-09-07 10:21:08 +03:00
openblas_get_num_procs.c Introduce openblas_get_num_threads and openblas_get_num_procs 2015-02-03 12:23:41 -05:00
openblas_get_num_threads.c Introduce openblas_get_num_threads and openblas_get_num_procs 2015-02-03 12:23:41 -05:00
openblas_get_parallel.c Update organization info. 2014-11-25 15:28:58 +08:00
openblas_set_num_threads.c Update organization info. 2014-11-25 15:28:58 +08:00
parameter.c Enable COOPERLAKE build target 2020-08-13 06:18:00 +08:00
profile.c Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
xerbla.c Update xerbla.c 2017-01-04 23:16:48 +01:00