OpenBLAS

History

Arjan van de Ven 5b708e5eb1 sgemm/dgemm: add a way for an arch kernel to specify prefered sizes The current gemm threading code can make very unfortunate choices, for example on my 10 core system a 1024x1024x1024 matrix multiply ends up chunking into blocks of 102... which is not a vector friendly size and performance ends up horrible. this patch adds a helper define where an architecture can specify a preference for size multiples. This is different from existing defines that are minimum sizes and such. The performance increase with this patch for the 1024x1024x1024 sgemm is 2.3x (!!)		2018-11-01 01:43:20 +00:00
..
level2	Merge pull request #1403 from brada4/develop	2017-12-30 14:51:34 +01:00
level3	sgemm/dgemm: add a way for an arch kernel to specify prefered sizes	2018-11-01 01:43:20 +00:00
mapper	Remove all trailing whitespace except lapack-netlib	2014-06-27 12:05:18 -07:00
others	Merge pull request #1843 from martin-frbg/aix_numprocs	2018-10-31 21:25:15 +01:00