Zhang Xianyi
|
249917700d
|
Merge branch 'TimothyGu-develop' into develop
Fixed #398. Remove all trailing whitespace except lapack-netlib.
|
2014-06-28 20:52:07 +08:00 |
Zhang Xianyi
|
7a8949e0ce
|
Merge branch 'develop' of https://github.com/TimothyGu/OpenBLAS into TimothyGu-develop
Conflicts:
driver/others/memory.c
|
2014-06-28 20:51:31 +08:00 |
Zhang Xianyi
|
b82108f899
|
Merge pull request #399 from TimothyGu/upstr
Build import libs as .dll.a instead of .lib
|
2014-06-28 20:40:23 +08:00 |
Zhang Xianyi
|
8373ad4ec2
|
Merge pull request #397 from vtjnash/develop
fix #394
|
2014-06-28 20:38:48 +08:00 |
wernsaar
|
35d37e124f
|
bugfix for barcelona zgemv-kernel
|
2014-06-28 12:36:11 +02:00 |
wernsaar
|
d8ba46efdb
|
bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel
|
2014-06-28 12:16:20 +02:00 |
wernsaar
|
a15f22a1f6
|
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
|
2014-06-28 11:46:58 +02:00 |
wernsaar
|
b94ea89f52
|
bugfix for haswell cgemm- and zgemm-kernel
|
2014-06-28 10:22:40 +02:00 |
wernsaar
|
35f668bb14
|
bugfix for cgemm_kernel_8x2_sandy.S
|
2014-06-28 10:01:56 +02:00 |
Timothy Gu
|
4ebbf758f5
|
.gitignore: add some more entries concerned with kernel
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-06-27 13:58:42 -07:00 |
Timothy Gu
|
8615d6ec87
|
Build import libs as .dll.a instead of .lib
This is MinGW convention.
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-06-27 13:34:09 -07:00 |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-06-27 12:05:18 -07:00 |
Jameson Nash
|
f41f03ab83
|
fix #394. this cleans up some handles after using them, and doesn't disable ALL process privileges upon success
|
2014-06-27 12:16:57 -04:00 |
wernsaar
|
365e8de346
|
added optimized cgemm-kernel for SANDYBRIDGE
|
2014-06-27 13:40:29 +02:00 |
wernsaar
|
578d1b6219
|
added DSDOT definition and enabled optimized sdot kernel
|
2014-06-27 11:30:29 +02:00 |
wernsaar
|
a6ae079b17
|
added blas-test from lapack
|
2014-06-27 10:12:19 +02:00 |
Zhang Xianyi
|
d10db52edb
|
Merge pull request #390 from wernsaar/develop
Ref #103: enhancement for small matrix dimensions. Fixed some bugs. Enable sgemm for SNB and dgemm for NEHALEM
|
2014-06-27 14:57:06 +08:00 |
wernsaar
|
dabab2b5f4
|
added new optimized sgemm kernel for SANDYBRIGE
|
2014-06-26 21:42:08 +02:00 |
wernsaar
|
aa2709c4e0
|
enabled optimized dgemm kernel for NEHALEM
|
2014-06-26 12:22:29 +02:00 |
wernsaar
|
9d6f2b594e
|
Fortran flag -frecursive is disabled by default
|
2014-06-25 13:55:19 +02:00 |
wernsaar
|
a13bcc1716
|
enabled optimized sgemv kernel for barcelona and piledriver
|
2014-06-25 13:50:57 +02:00 |
wernsaar
|
d2c82d7543
|
enabled optimized sgemv kernel for HASWELL
|
2014-06-25 12:56:45 +02:00 |
wernsaar
|
0517672dd0
|
enabled optimized sgemv kernels for nehalem, sandybridge and bulldozer
|
2014-06-25 12:38:14 +02:00 |
wernsaar
|
15d5dfa92c
|
fixed compiler warnings
|
2014-06-25 11:32:44 +02:00 |
wernsaar
|
d83373db61
|
added parameter for gemm3m kernels
|
2014-06-25 10:40:25 +02:00 |
wernsaar
|
88b6bf251a
|
force fallback for x86 32bit
|
2014-06-22 17:27:11 +02:00 |
wernsaar
|
4a2ab7460b
|
Ref #391: force fallback for x86 32bit
|
2014-06-22 13:51:17 +02:00 |
wernsaar
|
86d8c8978b
|
Ref #391: disabled SMP in ger.c and zger.c
|
2014-06-22 12:01:24 +02:00 |
wernsaar
|
316df0e821
|
fixed bug for INTERFACE64
|
2014-06-22 09:49:20 +02:00 |
wernsaar
|
438002204d
|
Ref #393: fix for INTERFACE64=0 and ARCH_X86 in divtable
|
2014-06-21 12:29:23 +02:00 |
wernsaar
|
23203d52c1
|
Ref #380: lowered stack usage for haswell kernels
|
2014-06-19 14:31:52 +02:00 |
wernsaar
|
73545a79cd
|
Ref #380: lowered stack usage for piledriver and bulldozer kernels
|
2014-06-19 14:02:14 +02:00 |
wernsaar
|
a19d209005
|
Ref #103: enhancement for small matrix dimensions
|
2014-06-18 15:04:11 +02:00 |
Zhang Xianyi
|
8602816536
|
Merge pull request #387 from davidanthoff/fixbuilderroronwin
Add -lgfortran flag to gcc call in a makefile
|
2014-06-18 07:57:30 +08:00 |
Zhang Xianyi
|
d52863cfd7
|
Merge pull request #386 from wernsaar/develop
Some enhancements for dynamic_arch and some warning fixes
|
2014-06-18 07:56:08 +08:00 |
David Anthoff
|
c6361d63c2
|
Add -lgfortran flag to gcc call in a makefile
Adding $(EXTRALIB) adds this flag when things are built with
msys2 on windows. Without this the build fails.
|
2014-06-13 21:10:27 -07:00 |
wernsaar
|
53bfa51ee0
|
Ref #385: fixed warnings in dynamic.c
|
2014-06-12 18:17:08 +02:00 |
wernsaar
|
ff9cfca24c
|
Ref #385: added missing return instruction
|
2014-06-12 15:52:14 +02:00 |
wernsaar
|
a86d349a51
|
Ref #380: enhancements for dynamic_arch
|
2014-06-12 14:20:03 +02:00 |
Zhang Xianyi
|
7b277f0110
|
Merge pull request #384 from wernsaar/develop
Blas extensions
|
2014-06-11 09:49:27 +08:00 |
wernsaar
|
faeab93df0
|
Ref #51: added blas extensions simatcopy, dimatcopy, cimatcopy, zimatcopy
|
2014-06-10 16:14:34 +02:00 |
Zhang Xianyi
|
f773f492f3
|
Merge branch 'develop'
|
2014-06-10 21:55:47 +08:00 |
Zhang Xianyi
|
21a6b5f79e
|
OpenBLAS 0.2.9 Version.
|
2014-06-10 21:55:19 +08:00 |
wernsaar
|
cee257f384
|
Ref #51: added blas extensions zomatcopy and comatcopy
|
2014-06-10 10:34:54 +02:00 |
wernsaar
|
7bfb3011e8
|
Ref #51: added blas extension somatcopy
|
2014-06-09 20:21:13 +02:00 |
wernsaar
|
8c8f596238
|
Ref #51: added blas extension domatcopy as not opimized reference
|
2014-06-09 17:11:07 +02:00 |
wernsaar
|
bff575d0b1
|
Ref #375: added workaround for small sizes to scal.c and zscal.c
|
2014-06-08 13:49:19 +02:00 |
wernsaar
|
faf3ac0aad
|
Ref #285: added axpby kernels
|
2014-06-08 11:54:24 +02:00 |
Zhang Xianyi
|
a40116de25
|
Fixed generating DLL bug.
|
2014-06-06 16:13:08 +08:00 |
Zhang Xianyi
|
b31ec99372
|
Fixed #374.
Merge branch 'TimothyGu-develop' into develop
|
2014-06-05 17:01:44 +08:00 |