Werner Saar
63a7d7fb24
updated gemv_n_vfpv3.S for armv7
2016-01-25 15:00:13 +01:00
Werner Saar
b4ede558a5
updated nrm2 kernel for armv7
2016-01-25 11:55:25 +01:00
Werner Saar
de3e2d4349
updated trmm kernels for armv7
2016-01-25 11:08:56 +01:00
Werner Saar
a0e51e96f1
updated gemm kernels for armv7
2016-01-25 10:46:10 +01:00
Lauri Tirkkonen
d6afac9624
don't pass -Y at all to the linker on illumos
...
the illumos linker can't understand the "-Y/lib"... form that f_check
generates, and -Wl cannot pass options that include commas
2016-01-25 11:09:34 +02:00
Werner Saar
c2891330bc
updated KERNEL.ARMV6
2016-01-24 17:12:07 +01:00
Werner Saar
ceaa931e48
updated gemv kernel for armv6
2016-01-24 16:31:19 +01:00
Werner Saar
eaa63165df
updated cgemv and zgemv kernels for armv6
2016-01-24 14:42:38 +01:00
Werner Saar
c65357c566
updated trmm_kernels for armv6
2016-01-24 13:03:33 +01:00
Werner Saar
e63e9f9f26
updated gemm_kernels for armv6
2016-01-24 11:55:50 +01:00
Jerome Robert
1fe3aab047
Use GEMM_MULTITHREAD_THRESHOLD as a number of ops
...
...not a matrix size. For GEMM_MULTITHREAD_THRESHOLD=4
(the default value) this does not change anything but
for other values it make the GEMM and GEMV thresholds
changing in the same way.
Close #742
2016-01-24 11:31:40 +01:00
Werner Saar
aafd3ab60e
updated cdot and zdot on arm
2016-01-24 10:56:49 +01:00
Jerome Robert
1a1935507b
[z]ger: increase multithread threshold
...
The ones given in 3ae30cd
was by far to low because I
mixed m and m*n in my measures. Note that the new ones
are closed to the [z]gemv ones which is comforting
that both are right.
2016-01-24 10:46:35 +01:00
Werner Saar
d2f84c9c8a
Ref #740 : updated nrm2_vfp.S
2016-01-23 17:47:58 +01:00
Werner Saar
ca32253f32
Ref #740 : updated asum_vfp.S and iamax_vfp.S
2016-01-23 14:44:34 +01:00
Werner Saar
9066d1f982
Ref #750 and Ref #740 : bugfix for sdot, dsdot and ddot on arm
2016-01-23 11:59:51 +01:00
Lauri Tirkkonen
8d85be770d
actually install the shared lib on illumos
2016-01-22 18:56:03 +02:00
Lauri Tirkkonen
7ba1d9b9ca
actually build the shared lib on illumos
2016-01-22 18:55:59 +02:00
Lauri Tirkkonen
31aff441ce
use $(AWK) in Makefile.install and switch it to nawk
2016-01-22 18:55:55 +02:00
Lauri Tirkkonen
e737e32fd1
RLIMIT_NPROC doesn't exist on illumos
2016-01-22 18:55:51 +02:00
Lauri Tirkkonen
8635d425c1
make parallel make work on illumos
2016-01-22 18:55:48 +02:00
Lauri Tirkkonen
97cd4b8aee
illumos fixes to memory.c
2016-01-22 18:55:43 +02:00
wernsaar
72390e3ffb
Merge pull request #747 from wernsaar/develop
...
Ref #730 : added performance updates for syrk and syr2k
2016-01-21 14:21:59 +01:00
Werner Saar
b07d733a71
added updates for syrk and syr2k
2016-01-21 13:16:44 +01:00
Zhang Xianyi
fa3018c30e
Merge pull request #745 from jakirkham/minor_fix_scipy_prof
...
BENCH: Minor fixes in SciPy benchmarks
2016-01-20 11:24:22 -06:00
Zhang Xianyi
6caa40302e
Merge pull request #744 from jeromerobert/bug731
...
Bug731
2016-01-20 11:18:21 -06:00
John Kirkham
a48b247e9e
benchmark/scripts/SCIPY/dsyrk.py: Overwrite will work on a Fortran array of the correct type.
2016-01-19 15:32:28 -05:00
John Kirkham
b1b115ecd6
benchmark/scripts/SCIPY/ssyrk.py: Overwrite will work on a Fortran array of the correct type.
2016-01-19 15:31:37 -05:00
John Kirkham
07bba933ff
benchmark/scripts/SCIPY/dsyrk.py: Arrays should be Fortran order.
2016-01-19 15:29:43 -05:00
John Kirkham
e85f8af519
benchmark/scripts/SCIPY/ssyrk.py: Arrays should be Fortran order.
2016-01-19 15:28:22 -05:00
John Kirkham
adfa0ab878
benchmark/scripts/SCIPY/ssyrk.py: Fix PEP8 issues.
2016-01-19 15:06:17 -05:00
John Kirkham
cbb6649e97
benchmark/scripts/SCIPY/dsyrk.py: Fix PEP8 issues.
2016-01-19 15:05:18 -05:00
John Kirkham
77abc9b280
benchmark/scripts/SCIPY/ssyrk.py: Write values into `C`.
2016-01-19 15:00:54 -05:00
John Kirkham
81e8690763
benchmark/scripts/SCIPY/dsyrk.py: Write values into `C`.
2016-01-19 15:00:23 -05:00
John Kirkham
dd04a8ac22
benchmark/scripts/SCIPY/ssyrk.py: Use the environment python.
2016-01-19 14:05:14 -05:00
John Kirkham
cb554b3a9c
benchmark/scripts/SCIPY/dsyrk.py: Use the environment python.
2016-01-19 14:04:55 -05:00
John Kirkham
1153459d1b
benchmark/scripts/SCIPY/ssyrk.py: Drop unneeded semicolons.
2016-01-19 14:00:51 -05:00
John Kirkham
1a73390ffe
benchmark/scripts/SCIPY/dsyrk.py: Drop unneeded semicolons.
2016-01-19 14:00:51 -05:00
John Kirkham
8b981e41a1
benchmark/scripts/SCIPY/ssyrk.py: Allocate `C` using zeros instead of randomly generating it.
2016-01-19 14:00:48 -05:00
John Kirkham
c10b1f555d
benchmark/scripts/SCIPY/dsyrk.py: Allocate `C` using zeros instead of randomly generating it.
2016-01-19 14:00:26 -05:00
Jerome Robert
14db1ca508
update CONTRIBUTORS.md
2016-01-19 17:15:31 +01:00
Jerome Robert
66eafb16cf
swap: disable multi-threading for small matrices
...
Close #731
2016-01-19 17:14:46 +01:00
Jerome Robert
3ae30cd6b9
Disable multi-threading for small matrices in [z]ger
...
Ref #731
2016-01-19 17:14:31 +01:00
Werner Saar
692d9c881c
Ref #740 : simple solution to clear floating point register on arm
2016-01-17 15:37:12 +01:00
Zhang Xianyi
055b481386
Fixed CMake bug for single core.
2016-01-15 06:42:54 +08:00
Zhang Xianyi
ce2b1edd4e
[av skip] Change test cmd on Travis.
2016-01-12 20:44:49 -06:00
Zhang Xianyi
8cf3657fb6
Refs #738 . Fix previous commit bug. Run BLAS and CBLAS test on Travis.
2016-01-12 20:01:49 -06:00
Zhang Xianyi
44222a7fe0
Refs #738 . Run test on Travis.
2016-01-12 22:52:47 +00:00
Zhang Xianyi
3ac153180c
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2016-01-12 22:25:36 +00:00
Zhang Xianyi
96b486acee
Merge branch 'jeromerobert-bug736' into develop
2016-01-12 22:25:08 +00:00