Zhang Xianyi
|
2a7503e563
|
Refs #225. Fixed a bug in GEMM OpenMP threading.
|
2013-07-15 09:56:19 +08:00 |
Zhang Xianyi
|
d744c9590a
|
In OpenMP threading, preallocate the thread buffer instead of allocating the buffer every time. This patch improved the performance slightly.
|
2013-03-01 14:36:47 +08:00 |
Zhang Xianyi
|
3cc6ae793e
|
Refs #174. Return sb pointer when OpenMP or Windows.
|
2013-02-26 00:48:21 +08:00 |
Xianyi Zhang
|
4727fe8abf
|
Refs #47. On Loongson 3A, set DGEMM_R parameter depending on different number of threads. It would improve double precision BLAS3 on multi-threads.
|
2011-09-05 15:13:52 +00:00 |
Xianyi Zhang
|
82f5274828
|
Refs #39. It's unnecessary to include sys/mman.h file in blas_server_omp.c.
|
2011-06-22 01:52:20 +08:00 |
Xianyi Zhang
|
989c6f8b06
|
Fixed #14 the SEGFAULT bug on 64 cores. On SMP server, the number of CPUs or cores should be less than or equal to 64.
|
2011-04-07 14:48:10 +08:00 |
Xianyi Zhang
|
342bbc3871
|
Import GotoBLAS2 1.13 BSD version codes.
|
2011-01-24 14:54:24 +00:00 |