Commit Graph

7452 Commits

Author SHA1 Message Date
Zhang Xianyi 6caa40302e Merge pull request #744 from jeromerobert/bug731
Bug731
2016-01-20 11:18:21 -06:00
John Kirkham a48b247e9e benchmark/scripts/SCIPY/dsyrk.py: Overwrite will work on a Fortran array of the correct type. 2016-01-19 15:32:28 -05:00
John Kirkham b1b115ecd6 benchmark/scripts/SCIPY/ssyrk.py: Overwrite will work on a Fortran array of the correct type. 2016-01-19 15:31:37 -05:00
John Kirkham 07bba933ff benchmark/scripts/SCIPY/dsyrk.py: Arrays should be Fortran order. 2016-01-19 15:29:43 -05:00
John Kirkham e85f8af519 benchmark/scripts/SCIPY/ssyrk.py: Arrays should be Fortran order. 2016-01-19 15:28:22 -05:00
John Kirkham adfa0ab878 benchmark/scripts/SCIPY/ssyrk.py: Fix PEP8 issues. 2016-01-19 15:06:17 -05:00
John Kirkham cbb6649e97 benchmark/scripts/SCIPY/dsyrk.py: Fix PEP8 issues. 2016-01-19 15:05:18 -05:00
John Kirkham 77abc9b280 benchmark/scripts/SCIPY/ssyrk.py: Write values into `C`. 2016-01-19 15:00:54 -05:00
John Kirkham 81e8690763 benchmark/scripts/SCIPY/dsyrk.py: Write values into `C`. 2016-01-19 15:00:23 -05:00
John Kirkham dd04a8ac22 benchmark/scripts/SCIPY/ssyrk.py: Use the environment python. 2016-01-19 14:05:14 -05:00
John Kirkham cb554b3a9c benchmark/scripts/SCIPY/dsyrk.py: Use the environment python. 2016-01-19 14:04:55 -05:00
John Kirkham 1153459d1b benchmark/scripts/SCIPY/ssyrk.py: Drop unneeded semicolons. 2016-01-19 14:00:51 -05:00
John Kirkham 1a73390ffe benchmark/scripts/SCIPY/dsyrk.py: Drop unneeded semicolons. 2016-01-19 14:00:51 -05:00
John Kirkham 8b981e41a1 benchmark/scripts/SCIPY/ssyrk.py: Allocate `C` using zeros instead of randomly generating it. 2016-01-19 14:00:48 -05:00
John Kirkham c10b1f555d benchmark/scripts/SCIPY/dsyrk.py: Allocate `C` using zeros instead of randomly generating it. 2016-01-19 14:00:26 -05:00
Jerome Robert 14db1ca508 update CONTRIBUTORS.md 2016-01-19 17:15:31 +01:00
Jerome Robert 66eafb16cf swap: disable multi-threading for small matrices
Close #731
2016-01-19 17:14:46 +01:00
Jerome Robert 3ae30cd6b9 Disable multi-threading for small matrices in [z]ger
Ref #731
2016-01-19 17:14:31 +01:00
Werner Saar 692d9c881c Ref #740: simple solution to clear floating point register on arm 2016-01-17 15:37:12 +01:00
Zhang Xianyi 055b481386 Fixed CMake bug for single core. 2016-01-15 06:42:54 +08:00
Zhang Xianyi ce2b1edd4e [av skip] Change test cmd on Travis. 2016-01-12 20:44:49 -06:00
Zhang Xianyi 8cf3657fb6 Refs #738. Fix previous commit bug. Run BLAS and CBLAS test on Travis. 2016-01-12 20:01:49 -06:00
Zhang Xianyi 44222a7fe0 Refs #738. Run test on Travis. 2016-01-12 22:52:47 +00:00
Zhang Xianyi 3ac153180c Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop 2016-01-12 22:25:36 +00:00
Zhang Xianyi 96b486acee Merge branch 'jeromerobert-bug736' into develop 2016-01-12 22:25:08 +00:00
Zhang Xianyi 3602a2cd1f #736 Revert #733 patch to fix bus error on ARM. 2016-01-12 22:19:58 +00:00
Zhang Xianyi b65de4947a Merge pull request #739 from sebastien-villemot/develop
Fixes for old outstanding bugs in CBLAS test programs
2016-01-12 14:47:34 -06:00
Sébastien Villemot 04ad946fc8 Fix output descriptors of c_{s,d,c,z}blat3
The NTRA argument can be equal to -1 if one does not want a snapshot file
(and this is the case with sample data {s,d,c,z}in3).
The routines {S,D,C,Z}PRCN3 will try to use their first argument as an output
unit number, so we avoid calling them when NTRA < 0.

Patch originally written by Camm Maguire.
2016-01-11 11:22:17 +01:00
Sébastien Villemot f704b8d32f Fix CBLAS double complex level 2 tests
The SNAME variable contains names of C functions like "cblas_dgemv".
Apparently the code was not taking into account the 6-letter "cblas_"
prefix when determining the task to be done.

The issue does not affect c_{s,d,c}blat2.f, which use the correct
offsetting.

Patch originally written by Camm Maguire.
2016-01-11 11:15:33 +01:00
Jerome Robert 708ad330ac stack alloc: Fix stack smashing detection in 32bits
* Fix commit 87a2ccc
* Close #736
2016-01-10 19:04:37 +01:00
Werner Saar c6a27bbe64 added benchmark tests for ssyrk and dsyrk 2016-01-10 12:19:03 +01:00
Zhang Xianyi f16b4f10b6 Merge pull request #734 from jeromerobert/common_stackalloc
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
2016-01-08 22:13:37 -06:00
Jerome Robert 87a2ccc37c Factorize MAX_STACK_ALLOC code to common_stackalloc.h
Ref #727
2016-01-08 16:03:52 +01:00
Zhang Xianyi e3e20e2242 Merge pull request #733 from yuyichao/arm-asm
Do not use vsub to clear the register values
2016-01-05 19:35:12 -06:00
Yichao Yu 594b9f4c73 Do not use vsub to clear the register values since it doesn't work with non-normal numbers. 2016-01-05 16:54:05 +00:00
wernsaar c96c6a26fd Merge pull request #732 from wernsaar/develop
added optimized trsm_kernels
2016-01-05 15:34:08 +01:00
Werner Saar c8f2c5d636 added optimized trsm_kernels 2016-01-05 13:05:05 +01:00
Werner Saar 5f2fa15e04 include sched.h if OS is Android 2016-01-05 12:36:49 +01:00
Zhang Xianyi 7d144aaabc Merge pull request #728 from jeromerobert/fix-no-stack-alloc
Fix make MAX_STACK_ALLOC=0
2016-01-04 15:04:24 -06:00
Jerome Robert f9890a6452 Fix compilation when MAX_STACK_ALLOC is not set
Close #722
2015-12-31 14:43:09 +01:00
Jerome Robert 2c7143459f Let make MAX_STACK_ALLOC=0 do what expected
It's no longer required to modify Makefile.rule to disable
stack allocation. It's now possible to run:

make MAX_STACK_ALLOC=0
2015-12-31 14:43:09 +01:00
Zhang Xianyi 3857581adf Merge pull request #726 from jeromerobert/amd-e2-3200
Fix detection of AMD E2-3200
2015-12-28 12:53:11 -06:00
Zhang Xianyi e9754e6250 Merge pull request #725 from jeromerobert/make-nb-jobs
Allow to force the number of parallel make job
2015-12-28 12:48:49 -06:00
Jerome Robert 76398c3233 Fix detection of AMD E2-3200 2015-12-28 19:45:47 +01:00
Jerome Robert ba024fcfc0 Allow to force the number of parallel make job
This is particularly useful when using distcc
2015-12-28 19:45:29 +01:00
Zhang Xianyi b9b52c295d Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop 2015-12-14 10:07:10 -06:00
Zhang Xianyi 285d042b10 Fixed rotg bug on ARM. 2015-12-14 10:07:01 -06:00
Zhang Xianyi 01db7908b8 Merge pull request #713 from btracey/patch-2
Fix Dormbr to perform the correct size operations with RowMajor
2015-12-10 10:13:49 -06:00
Zhang Xianyi 5f75df40d5 Merge pull request #711 from btracey/patch-1
Fix Dormlq to perform the correct size operations with RowMajor
2015-12-10 10:13:12 -06:00
Brendan Tracey b3f100dc25 Fix Dormbr to perform the correct size operations with RowMajor
Fixes issue #712
2015-12-09 00:50:22 -07:00