OpenBLAS/lapack/getrf
Martin Kroeker bb9876db33
Fix thread races and infinite looping on systems with many cpus
On systems with more than 64 cpus, blas_quickdivide will sometimes return zero which creates bogus workloads when used for the stride calculation. This then leads to threads spinning incessantly waiting for a status change that never happens, as seen in #1497.
This patch also fixes several data races that were found by helgrind and/or tsan while debugging the issue.
2018-04-04 18:16:52 +02:00
..
Makefile Import GotoBLAS2 1.13 BSD version codes. 2011-01-24 14:54:24 +00:00
getrf_parallel.c Fix thread races and infinite looping on systems with many cpus 2018-04-04 18:16:52 +02:00
getrf_parallel_omp.c prepared lapack/getrf functions for UNROLL values, that are not a power of two 2017-01-09 12:57:26 +01:00
getrf_single.c LAPACK helpers in C that need care too 2018-01-02 14:38:50 +01:00