Commit Graph

166 Commits

Author SHA1 Message Date
Martin Kroeker 677e42d7b0
Fixes from netlib PR 253
When minimal workspace is given in ?hesv_aa, ?sysv_aa, ?hesv_aa_2stage, ?sysv_aa_2stage, now no error is given
Quick return for ?laqr1
2018-06-01 15:12:59 +02:00
Martin Kroeker e2a8c35e5a
Fixes from netlib PR253
LAPACKE interfaces for Aasen's functions now call ?sytrf_aa and ?hetrf_aa instead of ?sytrf and ?hetrf
2018-06-01 15:08:14 +02:00
Martin Kroeker a7dbd4c57d
Fix paths to LIN and EIG tests
should fix 1574
2018-05-31 11:19:33 +02:00
Martin Kroeker c720f1f019
Merge pull request #1554 from martin-frbg/lapack-249
LAPACKE fixes from lapack PR249
2018-05-10 15:32:08 +02:00
Martin Kroeker d7d950fcf2
LAPACKE fixes from lapack PR249
Copied from Reference-LAPACK/lapack#249, this fixes out-of-bounds memory accesses
in the nancheck calls of the LAPACKE lacgv, lassq,larfg,larfb,larfx and mtr functions
2018-05-10 13:15:42 +02:00
Martin Kroeker 9795adc7ef
Drop C-style "L" suffix from OPENMP version number in check 2018-05-08 21:39:42 +02:00
Martin Kroeker 1a8e487c4a
Drop C-style "L" suffix from OPENMP version number in check 2018-05-08 21:38:25 +02:00
Martin Kroeker 5966fd52a2
Drop C-style "L" suffix from OPENMP version number in check 2018-05-08 21:36:56 +02:00
Martin Kroeker eaab622f03
Make "OMP task depend" sections conditional on OpenMP4, not just OpenMP
To allow compiling with gcc versions older than 4.9
2018-02-14 22:58:14 +01:00
Martin Kroeker 3cda1ce50a
Revert insiduous suppression of the -fopenmp flag in the LAPACK subtree
This was added in #1046 citing a problem with mingw, but in effect it quietly reduces thread safety on all non-Windows platforms (while -fopenmp is already disabled for Windows builds through the toplevel Makefile.system). Removing the filter fixes #1425
2018-02-13 22:44:45 +01:00
Martin Kroeker 150c7294a6
Remove spurious EXTERNAL reference
From Reference-LAPACK issue 228, remove spurious EXTERNAL reference to unused and nonexistent function xLACGV that could cause linking problems.
2018-02-08 14:57:13 +01:00
Martin Kroeker 374260027d
Add conditionals around ar calls for optional modules
The macOS ar aborts when it gets called with no input, see #1398
2017-12-21 20:42:30 +01:00
martin 3be5c3d343 Update LAPACK to 3.8.0 2017-11-23 18:13:35 +01:00
Martin Kroeker 2df1e3372d
Break out of potentially infinite rescaling loop after 1000 iterations
Inf values in the input vector will survive rescaling, causing an infinite loop. The value of 1000 is arbitrarily chosen as a large but finite value with the intention to never interfere with regular calculations.
2017-11-10 20:02:21 +01:00
Martin Kroeker d5ea8fd823 Fix workspace computation for side=L
From netlib PR#144
2017-05-04 20:01:41 +02:00
Martin Kroeker 4beffaaa4b Fix workspace computation for side=L
From netlib PR#144
2017-05-04 19:59:02 +02:00
Martin Kroeker fb28e4adc9 Fix workspace computation for side=L
From netlib PR#144
2017-05-04 19:55:02 +02:00
Martin Kroeker 26faa3ca47 Fix workspace allocation in lapacke_ctp for side=L
from netlib PR #144
2017-05-04 19:49:51 +02:00
Martin Kroeker 7908efafc8 Fix integer overflow in LAPACK DBDSQR, SBDSQR (#1135)
* Fix integer overflow in DBDSQR

As noted in lapack issue 135, an integer overflow in the calculation of the iteration limit could lead to an immediate return without any iterations having been performed if the input matrix is sufficiently big.

* Fix integer overflow in SBDSQR

As noted in lapack issue 135, an integer overflow in the calculation of the iteration limit could lead to an immediate return without any iterations having been performed if the input matrix is sufficiently big.

* Fix integer overflow in threshold calculation

Related to lapack issue 135, the threshold calculation can overflow as well as the multiplication is evaluated from left to right.
Without explicit parentheses, the calculation would overflow for N >= 18919

* Fix integer overflow in threshold calculation

Related to lapack issue 135, the threshold calculation can overflow as well as the multiplication is evaluated from left to right.
Without explicit parentheses, the calculation would overflow for N >= 18919
2017-03-24 22:05:22 +01:00
Martin Kroeker c7bc0ee823 Remove spurious names from EXTERNAL list
Remove unused (and nonexistent) functions ZHETRD_SY2SB and ZHETRD_SB2ST from comment and EXTERNAL declaration
2017-02-22 21:48:35 +01:00
Martin Kroeker 6bdee6d50a Remove spurious names from EXTERNAL list
Remove unused (and nonexistent) ZHETRD_SY2SB and ZHETRD_SB2ST
2017-02-22 21:45:27 +01:00
Martin Kroeker 009c0d2e5a Fix typo in EXTERNAL declaration
ZHBTRD_HB2ST  should be ZHETRD_HB2ST
2017-02-22 21:41:07 +01:00
Martin Kroeker 0958b49811 Fix missing comma on continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:40:39 +01:00
Martin Kroeker 09b240f1ef Fix missing comma on continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:39:06 +01:00
Martin Kroeker 69f4e8b86c Fix missing comma on continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:34:20 +01:00
Martin Kroeker e072e68aa0 Fix missing comma in continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:32:20 +01:00
Vladimir Chalupecky eeaee46e86 LAPACKE: fix wrong matrix size in ?ormbr
Changes made upstream in Reference LAPACK in
https://github.com/Reference-LAPACK/lapack/pull/128
2017-02-21 21:57:18 +01:00
Martin Kroeker 3f7720ec4b LAPACKE: fix wrong number of columns in ?ormlq
Copied from lapack https://github.com/Reference-LAPACK/lapack/pull/127  by vladimir-ch (with earlier changes from echeresh's  
PR 115 "lapacke_*ormlq_work: move declarations under if" there as they touched some of the same files)
2017-02-20 16:20:43 +01:00
Vladimir Chalupecky 4c2b713ce5 LAPACKE: fix incorrect value of lda_t in lapacke_?laswp_work
Fixed in Reference LAPACK in commit:

07e1fbd897
2017-02-07 09:21:46 +01:00
Martin Kroeker 7611a41f40 Split LAPACKE 3.7.0 obj list (take 2)
Missed the splitting of the actual ar call
2017-01-10 17:11:35 +01:00
Martin Kroeker 9f5cfd43dc Split the obj list of LAPACKE 3.7.0
Split obj list to allow building with mingw (argument list too long for the msys ar)
2017-01-09 18:29:53 +01:00
Werner Saar 1153e3ac39 filtered out -fopenmp and fix for mingw 2017-01-07 08:41:42 +01:00
Werner Saar 7c2c488c23 removed xerbla and lsame for Makefile 2017-01-06 16:35:20 +01:00
Werner Saar 4494d03a21 filtered out optimized functions 2017-01-06 13:42:31 +01:00
Werner Saar d35baf30cf added lapack 3.7.0 with latest patches from git 2017-01-06 11:48:40 +01:00
Werner Saar 8cd46acebb removed lapack 3.6.0 2017-01-06 11:44:57 +01:00
François Bissey c732f1a066 Never use "make" in makefiles. Only $(MAKE). 2016-12-15 11:38:23 +13:00
Martin Koehler 6f58271190 Move remaining OpenBLAS related changes from 3.6.0 to 3.6.1 2016-10-26 21:43:41 +02:00
Martin Koehler 311e0a912c Fix #971 2016-10-26 21:34:56 +02:00
Martin Koehler 429dfd83ee Fix threshold in nep.in 2016-10-26 21:17:12 +02:00
Martin Köhler 5af06c764a Fix MingW build 2016-10-26 16:03:00 +02:00
Martin Köhler 77006cc2a3 Import LAPACK: top directory 2016-10-26 15:14:13 +02:00
Martin Köhler 57eee3fa43 Import LAPACK: TESTING directory 2016-10-26 15:13:03 +02:00
Martin Köhler 92a858e69e Import LAPACK: SRC directory 2016-10-26 15:12:09 +02:00
Martin Köhler 13d40e7591 Import LAPACK: LAPACKE directory 2016-10-26 15:06:08 +02:00
Martin Köhler 4c29d20108 Import LAPACK: INSTALL directory 2016-10-26 15:04:39 +02:00
Martin Köhler 1357b8d93b Import LAPACK: DOCS directory 2016-10-26 15:03:51 +02:00
Martin Köhler fb7057babe Import LAPACK: CMAKE directory 2016-10-26 15:03:16 +02:00
Martin Köhler 358ee318ed Import LAPACK: CBLAS directory 2016-10-26 15:02:41 +02:00
Martin Köhler 4c024b85e4 Import LAPACK: BLAS directory 2016-10-26 15:02:09 +02:00
Vladimir Chalupecky 117d3371d4 LAPACKE: fix wrong direction check in LAPACKE_?larfb_work
Closes #971
2016-10-01 05:31:30 +09:00
Martin Kroeker 9b69d8a8e5 Update zgetrf2.f
Trivial typo correction (ZERBLA => XERBLA) to fix #910
2016-07-14 11:41:57 +02:00
John Biddiscombe 053044ae4d Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
2016-05-25 09:13:28 +02:00
Werner Saar 10c2ebdfc5 BUGFIX: removed fixes for bugs #148 and #149, because info for xerbla is wrong 2016-03-07 10:34:04 +01:00
Werner Saar 26b3b3a3e6 bugfixes form lapack svn for bugs #142 - #155 2016-03-07 10:10:00 +01:00
Werner Saar 8d652f11e7 updated smallscaling.c to build without C99 or C11
increased the threshold value of nep.in to 40
2016-03-06 08:40:51 +01:00
buffer51 855e0cb700 Restored LAPACK_COMPLEX_STRUCTURE for Android prior to 21. Refs #682. 2016-02-04 17:20:07 -05:00
Werner Saar ceaa931e48 updated gemv kernel for armv6 2016-01-24 16:31:19 +01:00
Zhang Xianyi 01db7908b8 Merge pull request #713 from btracey/patch-2
Fix Dormbr to perform the correct size operations with RowMajor
2015-12-10 10:13:49 -06:00
Brendan Tracey b3f100dc25 Fix Dormbr to perform the correct size operations with RowMajor
Fixes issue #712
2015-12-09 00:50:22 -07:00
Brendan Tracey 2f65aad626 Fix Dormlq to perform the correct size operations with RowMajor
Fixes issue #615.
2015-12-08 22:34:21 -07:00
Zhang Xianyi da7f69e8f4 Refs #699. Split the obj list of LAPACKE 3.6.0. 2015-11-24 13:15:28 -06:00
Werner Saar 299cdcdc29 lapack fixes for Windos 2015-11-21 14:33:27 +01:00
Werner Saar a8516c5b47 fixes for cross compile 2015-11-21 10:48:37 +01:00
Werner Saar c40538eaeb bugfix for cross compiling 2015-11-20 13:47:22 +01:00
Werner Saar 33e37d01b3 added lapack-3.6.0 2015-11-20 09:45:46 +01:00
Werner Saar 64db4576e6 removed lapack-3.5.0 2015-11-20 09:41:59 +01:00
Zhang Xianyi a550431ee6 Refs #682. Enable LAPACK_COMPLEX_STRUCTURE when __ANDROID_API_ < 21. 2015-11-06 23:46:20 -06:00
Zhang Xianyi 90aa8e24b9 Refs #615. Import bug fixes for LAPACKE dormlq. 2015-10-07 02:31:51 +08:00
kortschak d6e8459f20 Fix LAPACK_*lansy routines
Fixes #639.
2015-09-10 15:32:50 +09:30
kortschak efffd28739 Fix lantr preparation for row major matrices 2015-09-09 09:25:48 +09:30
Werner Saar 3814bf60d3 added optimized dsymv kernels for haswell 2015-04-22 10:42:50 +02:00
Werner Saar 6d0db0151f added optimized zaxpy-kernels 2015-04-16 11:19:37 +02:00
Werner Saar 3937e2a0a0 add optimized sdot-kernel for sandybridge 2015-04-05 19:47:05 +02:00
Erik Schnetter 29cb47fc06 Move #include statements outside extern "C" blocks 2015-01-12 21:27:52 -05:00
Erik Schnetter cd9868b1b4 Correct ilaver C declaration 2014-12-25 17:41:17 -05:00
Werner Saar 4de7b9ae47 increased NMAX to 128 2014-12-22 14:04:27 +01:00
Werner Saar ec85c4a51d Increased the Threshold value in sep.in 2014-12-11 14:57:41 +01:00
Werner Saar 97de657d38 added tests to sep.as as workaround for gfortran-4.8.x 2014-12-11 13:53:59 +01:00
wernsaar e2fc8c8c2c changed 1 test value (bug in lapack-testing?) 2014-08-30 13:58:02 +02:00
wernsaar dc05937313 added additional test values 2014-08-13 14:54:50 +02:00
wernsaar 339ab34c4c added additional test value to dstest.in 2014-07-13 18:29:19 +02:00
wernsaar cedc1f4b14 Ref #410: disabled optimized potri functions ( single threading bug) 2014-07-10 13:42:32 +02:00
wernsaar 2081f6e8ff Lapack bug114: replaced cgesvd.f and zgesvd.f 2014-07-08 10:21:10 +02:00
wernsaar dc6b809f15 Lapack bug117: replaced zstemr.f 2014-07-08 10:08:34 +02:00
wernsaar 0f08684649 Lapack bug118: replaced clanhf.f and zlanhf.f 2014-07-08 09:57:40 +02:00
wernsaar aa2709c4e0 enabled optimized dgemm kernel for NEHALEM 2014-06-26 12:22:29 +02:00
Zhang Xianyi a40116de25 Fixed generating DLL bug. 2014-06-06 16:13:08 +08:00
wernsaar 219bcb119d added lapack and lapacke timing libs by default 2014-05-24 15:53:25 +02:00
wernsaar 5664445543 changed threshold value for sep.in from 50.0 to 60.0 2014-05-23 17:26:50 +02:00
wernsaar 89da450800 enabled and tested optimized potri lapack functions 2014-05-23 12:14:30 +02:00
wernsaar c26bbee489 enabled abd tested optimized trtri lapack functions 2014-05-23 10:55:39 +02:00
wernsaar a748d3a75d enabled optimized trti2 lapack functions again 2014-05-21 11:02:07 +02:00
wernsaar a5ab231ad4 enabled optimized complex lauum lapack functions again 2014-05-21 10:35:28 +02:00
wernsaar dbaeea7b59 enabled lauu2 and lauum lapack functions again 2014-05-21 09:49:18 +02:00
wernsaar 0d75f3b6a2 enabled and tested optimized gesv lapack functions 2014-05-19 14:44:53 +02:00
wernsaar 2ff66e661d enabled and tested optimized laswp lapack function 2014-05-19 13:35:32 +02:00
wernsaar ebc95e6f11 enabled and tested optimized potf2 lapack functions 2014-05-18 22:41:43 +02:00
wernsaar 61a2c50e8e enabled and tested optimized getf2 lapack functions 2014-05-18 22:21:16 +02:00
wernsaar 4f98f8c9b3 enabled and tested optimized potrf lapack functions 2014-05-18 21:42:37 +02:00