Werner Saar
|
8310d4d3f7
|
optimized dgemm for 20 threads
|
2016-05-16 14:14:25 +02:00 |
Werner Saar
|
9276c9012f
|
Optimized sgemm and dgemm and tested again.
|
2016-04-21 11:37:57 +02:00 |
Werner Saar
|
9c42f0374a
|
Updated cgemm- and sgemm-kernel for POWER8 SMP
|
2016-04-07 15:08:15 +02:00 |
Werner Saar
|
cc26d888b8
|
BUGFIX: increased BUFFER_SIZE for POWER8
|
2016-03-04 10:26:53 +01:00 |
Werner Saar
|
b752858d6c
|
added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8
|
2016-03-01 07:33:56 +01:00 |
Grazvydas Ignotas
|
6b92204a7c
|
add fallback blas_lock implementation
to be used on armv5 and new platforms
|
2015-08-16 18:59:17 +02:00 |
Grazvydas Ignotas
|
e12cf1123e
|
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
|
2015-08-16 18:59:16 +02:00 |
Matthew Brandyberry
|
7ba4fe5afb
|
ppc64le platform support (ELF ABI v2)
|
2015-07-21 22:20:19 -05:00 |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-06-27 12:05:18 -07:00 |
Xianyi Zhang
|
342bbc3871
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |