mips32 was seen to return zero here, driving nthreads to zero with subsequent fpe in blas_quickdivide