Zhang Xianyi
6caa40302e
Merge pull request #744 from jeromerobert/bug731
...
Bug731
2016-01-20 11:18:21 -06:00
John Kirkham
a48b247e9e
benchmark/scripts/SCIPY/dsyrk.py: Overwrite will work on a Fortran array of the correct type.
2016-01-19 15:32:28 -05:00
John Kirkham
b1b115ecd6
benchmark/scripts/SCIPY/ssyrk.py: Overwrite will work on a Fortran array of the correct type.
2016-01-19 15:31:37 -05:00
John Kirkham
07bba933ff
benchmark/scripts/SCIPY/dsyrk.py: Arrays should be Fortran order.
2016-01-19 15:29:43 -05:00
John Kirkham
e85f8af519
benchmark/scripts/SCIPY/ssyrk.py: Arrays should be Fortran order.
2016-01-19 15:28:22 -05:00
John Kirkham
adfa0ab878
benchmark/scripts/SCIPY/ssyrk.py: Fix PEP8 issues.
2016-01-19 15:06:17 -05:00
John Kirkham
cbb6649e97
benchmark/scripts/SCIPY/dsyrk.py: Fix PEP8 issues.
2016-01-19 15:05:18 -05:00
John Kirkham
77abc9b280
benchmark/scripts/SCIPY/ssyrk.py: Write values into `C`.
2016-01-19 15:00:54 -05:00
John Kirkham
81e8690763
benchmark/scripts/SCIPY/dsyrk.py: Write values into `C`.
2016-01-19 15:00:23 -05:00
John Kirkham
dd04a8ac22
benchmark/scripts/SCIPY/ssyrk.py: Use the environment python.
2016-01-19 14:05:14 -05:00
John Kirkham
cb554b3a9c
benchmark/scripts/SCIPY/dsyrk.py: Use the environment python.
2016-01-19 14:04:55 -05:00
John Kirkham
1153459d1b
benchmark/scripts/SCIPY/ssyrk.py: Drop unneeded semicolons.
2016-01-19 14:00:51 -05:00
John Kirkham
1a73390ffe
benchmark/scripts/SCIPY/dsyrk.py: Drop unneeded semicolons.
2016-01-19 14:00:51 -05:00
John Kirkham
8b981e41a1
benchmark/scripts/SCIPY/ssyrk.py: Allocate `C` using zeros instead of randomly generating it.
2016-01-19 14:00:48 -05:00
John Kirkham
c10b1f555d
benchmark/scripts/SCIPY/dsyrk.py: Allocate `C` using zeros instead of randomly generating it.
2016-01-19 14:00:26 -05:00
Jerome Robert
14db1ca508
update CONTRIBUTORS.md
2016-01-19 17:15:31 +01:00
Jerome Robert
66eafb16cf
swap: disable multi-threading for small matrices
...
Close #731
2016-01-19 17:14:46 +01:00
Jerome Robert
3ae30cd6b9
Disable multi-threading for small matrices in [z]ger
...
Ref #731
2016-01-19 17:14:31 +01:00
Werner Saar
692d9c881c
Ref #740 : simple solution to clear floating point register on arm
2016-01-17 15:37:12 +01:00
Zhang Xianyi
055b481386
Fixed CMake bug for single core.
2016-01-15 06:42:54 +08:00
Zhang Xianyi
ce2b1edd4e
[av skip] Change test cmd on Travis.
2016-01-12 20:44:49 -06:00
Zhang Xianyi
8cf3657fb6
Refs #738 . Fix previous commit bug. Run BLAS and CBLAS test on Travis.
2016-01-12 20:01:49 -06:00
Zhang Xianyi
44222a7fe0
Refs #738 . Run test on Travis.
2016-01-12 22:52:47 +00:00
Zhang Xianyi
3ac153180c
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2016-01-12 22:25:36 +00:00
Zhang Xianyi
96b486acee
Merge branch 'jeromerobert-bug736' into develop
2016-01-12 22:25:08 +00:00
Zhang Xianyi
3602a2cd1f
#736 Revert #733 patch to fix bus error on ARM.
2016-01-12 22:19:58 +00:00
Zhang Xianyi
b65de4947a
Merge pull request #739 from sebastien-villemot/develop
...
Fixes for old outstanding bugs in CBLAS test programs
2016-01-12 14:47:34 -06:00
Sébastien Villemot
04ad946fc8
Fix output descriptors of c_{s,d,c,z}blat3
...
The NTRA argument can be equal to -1 if one does not want a snapshot file
(and this is the case with sample data {s,d,c,z}in3).
The routines {S,D,C,Z}PRCN3 will try to use their first argument as an output
unit number, so we avoid calling them when NTRA < 0.
Patch originally written by Camm Maguire.
2016-01-11 11:22:17 +01:00
Sébastien Villemot
f704b8d32f
Fix CBLAS double complex level 2 tests
...
The SNAME variable contains names of C functions like "cblas_dgemv".
Apparently the code was not taking into account the 6-letter "cblas_"
prefix when determining the task to be done.
The issue does not affect c_{s,d,c}blat2.f, which use the correct
offsetting.
Patch originally written by Camm Maguire.
2016-01-11 11:15:33 +01:00
Jerome Robert
708ad330ac
stack alloc: Fix stack smashing detection in 32bits
...
* Fix commit 87a2ccc
* Close #736
2016-01-10 19:04:37 +01:00
Werner Saar
c6a27bbe64
added benchmark tests for ssyrk and dsyrk
2016-01-10 12:19:03 +01:00
Zhang Xianyi
f16b4f10b6
Merge pull request #734 from jeromerobert/common_stackalloc
...
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
2016-01-08 22:13:37 -06:00
Jerome Robert
87a2ccc37c
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
...
Ref #727
2016-01-08 16:03:52 +01:00
Zhang Xianyi
e3e20e2242
Merge pull request #733 from yuyichao/arm-asm
...
Do not use vsub to clear the register values
2016-01-05 19:35:12 -06:00
Yichao Yu
594b9f4c73
Do not use vsub to clear the register values since it doesn't work with non-normal numbers.
2016-01-05 16:54:05 +00:00
wernsaar
c96c6a26fd
Merge pull request #732 from wernsaar/develop
...
added optimized trsm_kernels
2016-01-05 15:34:08 +01:00
Werner Saar
c8f2c5d636
added optimized trsm_kernels
2016-01-05 13:05:05 +01:00
Werner Saar
5f2fa15e04
include sched.h if OS is Android
2016-01-05 12:36:49 +01:00
Zhang Xianyi
7d144aaabc
Merge pull request #728 from jeromerobert/fix-no-stack-alloc
...
Fix make MAX_STACK_ALLOC=0
2016-01-04 15:04:24 -06:00
Jerome Robert
f9890a6452
Fix compilation when MAX_STACK_ALLOC is not set
...
Close #722
2015-12-31 14:43:09 +01:00
Jerome Robert
2c7143459f
Let make MAX_STACK_ALLOC=0 do what expected
...
It's no longer required to modify Makefile.rule to disable
stack allocation. It's now possible to run:
make MAX_STACK_ALLOC=0
2015-12-31 14:43:09 +01:00
Zhang Xianyi
3857581adf
Merge pull request #726 from jeromerobert/amd-e2-3200
...
Fix detection of AMD E2-3200
2015-12-28 12:53:11 -06:00
Zhang Xianyi
e9754e6250
Merge pull request #725 from jeromerobert/make-nb-jobs
...
Allow to force the number of parallel make job
2015-12-28 12:48:49 -06:00
Jerome Robert
76398c3233
Fix detection of AMD E2-3200
2015-12-28 19:45:47 +01:00
Jerome Robert
ba024fcfc0
Allow to force the number of parallel make job
...
This is particularly useful when using distcc
2015-12-28 19:45:29 +01:00
Zhang Xianyi
b9b52c295d
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2015-12-14 10:07:10 -06:00
Zhang Xianyi
285d042b10
Fixed rotg bug on ARM.
2015-12-14 10:07:01 -06:00
Zhang Xianyi
01db7908b8
Merge pull request #713 from btracey/patch-2
...
Fix Dormbr to perform the correct size operations with RowMajor
2015-12-10 10:13:49 -06:00
Zhang Xianyi
5f75df40d5
Merge pull request #711 from btracey/patch-1
...
Fix Dormlq to perform the correct size operations with RowMajor
2015-12-10 10:13:12 -06:00
Brendan Tracey
b3f100dc25
Fix Dormbr to perform the correct size operations with RowMajor
...
Fixes issue #712
2015-12-09 00:50:22 -07:00