ger and gemv call blas_memory_alloc/free which in their turn call blas_lock. blas_lock create thread contention when matrices are small and the number of thread is high enough. We avoid call blas_memory_alloc by replacing it with stack allocation. This can be enabled with: make -DMAX_STACK_ALLOC=2048 The given size (in byte) must be high enough to avoid thread contention and small enough to avoid stack overflow. Fix #478 |
||
|---|---|---|
| .. | ||
| alpha | ||
| arm | ||
| arm64 | ||
| generic | ||
| ia64 | ||
| mips64 | ||
| power | ||
| sparc | ||
| x86 | ||
| x86_64 | ||
| Makefile | ||
| Makefile.L1 | ||
| Makefile.L2 | ||
| Makefile.L3 | ||
| Makefile.LA | ||
| setparam-ref.c | ||