Zhang Xianyi
3602a2cd1f
#736 Revert #733 patch to fix bus error on ARM.
2016-01-12 22:19:58 +00:00
Zhang Xianyi
b65de4947a
Merge pull request #739 from sebastien-villemot/develop
...
Fixes for old outstanding bugs in CBLAS test programs
2016-01-12 14:47:34 -06:00
Sébastien Villemot
04ad946fc8
Fix output descriptors of c_{s,d,c,z}blat3
...
The NTRA argument can be equal to -1 if one does not want a snapshot file
(and this is the case with sample data {s,d,c,z}in3).
The routines {S,D,C,Z}PRCN3 will try to use their first argument as an output
unit number, so we avoid calling them when NTRA < 0.
Patch originally written by Camm Maguire.
2016-01-11 11:22:17 +01:00
Sébastien Villemot
f704b8d32f
Fix CBLAS double complex level 2 tests
...
The SNAME variable contains names of C functions like "cblas_dgemv".
Apparently the code was not taking into account the 6-letter "cblas_"
prefix when determining the task to be done.
The issue does not affect c_{s,d,c}blat2.f, which use the correct
offsetting.
Patch originally written by Camm Maguire.
2016-01-11 11:15:33 +01:00
Jerome Robert
708ad330ac
stack alloc: Fix stack smashing detection in 32bits
...
* Fix commit 87a2ccc
* Close #736
2016-01-10 19:04:37 +01:00
Werner Saar
c6a27bbe64
added benchmark tests for ssyrk and dsyrk
2016-01-10 12:19:03 +01:00
Zhang Xianyi
f16b4f10b6
Merge pull request #734 from jeromerobert/common_stackalloc
...
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
2016-01-08 22:13:37 -06:00
Jerome Robert
87a2ccc37c
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
...
Ref #727
2016-01-08 16:03:52 +01:00
Zhang Xianyi
e3e20e2242
Merge pull request #733 from yuyichao/arm-asm
...
Do not use vsub to clear the register values
2016-01-05 19:35:12 -06:00
Yichao Yu
594b9f4c73
Do not use vsub to clear the register values since it doesn't work with non-normal numbers.
2016-01-05 16:54:05 +00:00
wernsaar
c96c6a26fd
Merge pull request #732 from wernsaar/develop
...
added optimized trsm_kernels
2016-01-05 15:34:08 +01:00
Werner Saar
c8f2c5d636
added optimized trsm_kernels
2016-01-05 13:05:05 +01:00
Werner Saar
5f2fa15e04
include sched.h if OS is Android
2016-01-05 12:36:49 +01:00
Zhang Xianyi
7d144aaabc
Merge pull request #728 from jeromerobert/fix-no-stack-alloc
...
Fix make MAX_STACK_ALLOC=0
2016-01-04 15:04:24 -06:00
Jerome Robert
f9890a6452
Fix compilation when MAX_STACK_ALLOC is not set
...
Close #722
2015-12-31 14:43:09 +01:00
Jerome Robert
2c7143459f
Let make MAX_STACK_ALLOC=0 do what expected
...
It's no longer required to modify Makefile.rule to disable
stack allocation. It's now possible to run:
make MAX_STACK_ALLOC=0
2015-12-31 14:43:09 +01:00
Zhang Xianyi
3857581adf
Merge pull request #726 from jeromerobert/amd-e2-3200
...
Fix detection of AMD E2-3200
2015-12-28 12:53:11 -06:00
Zhang Xianyi
e9754e6250
Merge pull request #725 from jeromerobert/make-nb-jobs
...
Allow to force the number of parallel make job
2015-12-28 12:48:49 -06:00
Jerome Robert
76398c3233
Fix detection of AMD E2-3200
2015-12-28 19:45:47 +01:00
Jerome Robert
ba024fcfc0
Allow to force the number of parallel make job
...
This is particularly useful when using distcc
2015-12-28 19:45:29 +01:00
Zhang Xianyi
b9b52c295d
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2015-12-14 10:07:10 -06:00
Zhang Xianyi
285d042b10
Fixed rotg bug on ARM.
2015-12-14 10:07:01 -06:00
Zhang Xianyi
01db7908b8
Merge pull request #713 from btracey/patch-2
...
Fix Dormbr to perform the correct size operations with RowMajor
2015-12-10 10:13:49 -06:00
Zhang Xianyi
5f75df40d5
Merge pull request #711 from btracey/patch-1
...
Fix Dormlq to perform the correct size operations with RowMajor
2015-12-10 10:13:12 -06:00
Brendan Tracey
b3f100dc25
Fix Dormbr to perform the correct size operations with RowMajor
...
Fixes issue #712
2015-12-09 00:50:22 -07:00
Brendan Tracey
2f65aad626
Fix Dormlq to perform the correct size operations with RowMajor
...
Fixes issue #615 .
2015-12-08 22:34:21 -07:00
Zhang Xianyi
25116788ef
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2015-12-05 00:46:42 +08:00
Zhang Xianyi
958f0de65e
Refs #708 . Modified config template for MSVC.
2015-12-05 00:45:29 +08:00
Zhang Xianyi
5d212f66a7
Refs #706 . Fixed lapacke installation error.
2015-12-03 01:32:39 +08:00
Zhang Xianyi
f88ee18409
Merge pull request #704 from tkelman/patch-1
...
fix makefile warning when renaming symbols
2015-11-30 22:37:25 -06:00
Tony Kelman
d22917a58a
fix makefile warning when renaming symbols
...
use different names for `openblas*.renamed` between osx and other unices, fixes
```
Makefile:121: warning: overriding commands for target `../libopenblas64_p-r0.2.15.a.renamed'
Makefile💯 warning: ignoring old commands for target `../libopenblas64_p-r0.2.15.a.renamed'
```
also clean `*.renamed`
2015-11-30 20:16:33 -08:00
Zhang Xianyi
640cccc2b1
Refs #697 . Fixed gemv bug for Windows.
...
Thank matzeri's patch.
2015-11-30 15:19:45 -06:00
Zhang Xianyi
fba6532502
Refs #702 . Delete redundant xerbla exporting
2015-11-30 11:08:33 -06:00
Zhang Xianyi
da7f69e8f4
Refs #699 . Split the obj list of LAPACKE 3.6.0.
2015-11-24 13:15:28 -06:00
Zhang Xianyi
044fb91ea5
Merge pull request #690 from rayglover/msvc-fix
...
(Visual Studio) Don't use C99 complex numbers when building C++ code.
2015-11-23 11:05:37 -06:00
Zhang Xianyi
b4380acf77
Merge pull request #696 from ashwinyes/develop_20151120_lapack_test_fixes
...
Cortex A57 fixes and Lapack 3.6.0
2015-11-23 11:04:42 -06:00
Werner Saar
d1dd4e302e
fix for bad or outdated mingw compiler
2015-11-23 16:20:14 +01:00
Ashwin Sekhar T K
318f0949c3
lapack-test fixes in nrm2 kernels for Cortex A57
2015-11-23 13:43:36 +05:30
Werner Saar
299cdcdc29
lapack fixes for Windos
2015-11-21 14:33:27 +01:00
Werner Saar
a8516c5b47
fixes for cross compile
2015-11-21 10:48:37 +01:00
Werner Saar
c40538eaeb
bugfix for cross compiling
2015-11-20 13:47:22 +01:00
Werner Saar
33e37d01b3
added lapack-3.6.0
2015-11-20 09:45:46 +01:00
Werner Saar
64db4576e6
removed lapack-3.5.0
2015-11-20 09:41:59 +01:00
Werner Saar
0d22551a6b
increase the stack size limit in the constructor
2015-11-20 09:23:01 +01:00
Ashwin Sekhar T K
1d121852c1
Fix blas_lock for arm64
2015-11-20 01:45:35 +05:30
Ashwin Sekhar T K
98965da2e8
lapack-test fixes for Cortex A57
2015-11-20 01:15:04 +05:30
Ashwin Sekhar T K
39937d15cd
Change BUFFER_SIZE for Cortex A57 to 20 MB
...
Change the GEMM_P, GEMM_Q, GEMM_R values for Cortex A57
2015-11-20 01:12:04 +05:30
Ray Glover
a9d7eee0dc
(Visual Studio) Don't use C99 complex numbers when building C++ code.
2015-11-17 17:29:30 +00:00
Zhang Xianyi
e31948ceb0
Fix #686 . Merge branch 'ashwinyes-develop' into develop
2015-11-11 04:30:26 +08:00
Zhang Xianyi
233ec2a1cc
Use 40 MB buffer for ARM Cortex A57.
2015-11-11 04:22:34 +08:00