Commit Graph

434 Commits

Author SHA1 Message Date
Martin Kroeker 95da5141f0
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-14 22:21:29 +01:00
Martin Kroeker 2592853fc7
Restore OpenBLAS-specific changes 2022-11-14 21:47:37 +01:00
Martin Kroeker 52c2a0397b
Restore OpenBLAS modifications to link line 2022-11-14 17:13:08 +01:00
Martin Kroeker c5c4888ed6
Merge pull request #3818 from martin-frbg/lapack665
Fix leading dimension check of eigen-/Schur vectors in LAPACKE (Reference-LAPACK PR 665)
2022-11-14 07:27:58 +01:00
Martin Kroeker 92411dfecb
Merge pull request #3817 from martin-frbg/lapack738742
Add NaN check functions for trapezoidal matrices to LAPACKE (Reference-LAPACK PR 738+742)
2022-11-13 23:49:15 +01:00
Martin Kroeker 13f3bbece1
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:18:09 +01:00
Martin Kroeker 92174725d9
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:16:12 +01:00
Martin Kroeker 6eb707d941
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:10:13 +01:00
Martin Kroeker 7eb2653268
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:07:10 +01:00
Martin Kroeker 4bc918a791
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651) 2022-11-13 23:03:31 +01:00
Martin Kroeker 29dc086f38
Add macros for 32/64bit integer printf 2022-11-13 22:50:02 +01:00
Martin Kroeker 8b3f9715ec
Add macros for 32/64bit integer printf 2022-11-13 22:48:37 +01:00
Martin Kroeker 35dac5677a
Merge pull request #3816 from martin-frbg/lapack638
Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638)
2022-11-13 20:38:42 +01:00
Martin Kroeker fdb012ceed
Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:37:18 +01:00
Martin Kroeker c99d27ae45
Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:33:20 +01:00
Martin Kroeker a485e4f515
Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:30:06 +01:00
Martin Kroeker 0c2aa0bed7
Fix implicit conversions and unused variables (Reference-LAPACK PR 703) 2022-11-13 20:29:08 +01:00
Martin Kroeker 2226a82f2e
Fix leading dimension check of eigen-/Schur vectors (Reference-LAPACK PR 665) 2022-11-13 17:50:49 +01:00
Martin Kroeker 645633e321
Fix leading dimension check of eigen-/Schur vectors (Reference-LAPACK PR 665) 2022-11-13 17:48:02 +01:00
Martin Kroeker eba1112e38
Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 15:03:39 +01:00
Martin Kroeker 23cfe58ee3
Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 14:55:45 +01:00
Martin Kroeker 6dcf737c5d
Add NaN check functions for trapezoidal matrices (Reference-LAPACK PR738+742) 2022-11-13 14:51:39 +01:00
Martin Kroeker 3e2d52c502
Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638) 2022-11-13 13:00:52 +01:00
Martin Kroeker cb48c29b6f
Fix workspace calculation (Reference-LAPACK PR690) 2022-11-13 12:49:59 +01:00
Martin Kroeker 515cf26929
Fix pointer/integer argument mismatch in calls to pow() 2022-09-14 11:48:36 +02:00
Martin Kroeker f2cd238450
Restore conditional compilation and fix for parallel make in LAPACK MATGEN (#3733)
* Restore NOPARALLEL and conditional compilation
2022-08-13 20:03:16 +02:00
Martin Kroeker 704a024df4
Fix C99-style declaration of loop variable 2022-08-11 16:37:23 +02:00
Ivan Pribec 802e71bf05 Add const attribute to lsame 2022-08-08 15:15:52 +02:00
Martin Kroeker 9c4e91a77d
Increase threshold 2022-08-07 00:03:50 +02:00
Martin Kroeker 1fe06caf49
Increase threshold 2022-08-07 00:03:20 +02:00
martin-frbg cf61d48470 fix C_LAPACK build options for files that must be built unoptimized 2022-05-12 09:43:23 +02:00
martin-frbg 884e6da73b restore OpenBLAS-specific build file fixes 2022-05-12 00:20:54 +02:00
martin-frbg 56d238ff0f Merge branch 'lapack3101' of https://github.com/martin-frbg/OpenBLAS into lapack3101 2022-05-11 18:45:02 +02:00
martin-frbg ffb4f04fbb Update the LAPACK testsuite to match 3.10.1 2022-05-11 18:40:55 +02:00
Martin Kroeker 75cdc8cba5
Merge branch 'develop' into lapack3101 2022-04-30 21:43:45 +02:00
Martin Kroeker b4b9ccdbfa
Remove leftover debug output 2022-04-27 21:59:45 +02:00
martin-frbg 510c7222ef Add dummy C files for 3.10.1 modules 2022-04-22 13:09:12 +02:00
martin-frbg ee7f42217b Add dummy C versions of ?roundup_lwork for C_LAPACK 2022-04-22 12:28:15 +02:00
martin-frbg aea3d366f5 Add dummy C versions of the 3.10.1 ?laqz codes for C_LAPACK 2022-04-22 09:27:37 +02:00
martin-frbg 920199c577 fix module build and d/sroundup_lwork 2022-04-21 23:31:02 +02:00
martin-frbg 7c84a9f7be update suffix rules 2022-04-21 22:43:29 +02:00
martin-frbg 329cdc31b3 Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1 2022-04-21 20:39:29 +02:00
Martin Kroeker 18b19d135b
C_LAPACK: Fixes to make it compile with MSVC (#3605)
* Fix f2c-like support functions to compile with MSVC, and
re-enable C_LAPACK for MSVC in CMAKE

* Add MSVC&flang build to Azure CI in order to check C_LAPACK correctness
2022-04-17 17:49:38 +02:00
Martin Kroeker b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available (#3539)
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
2022-04-09 22:38:58 +02:00
Larson, Eric 8fe3555792 ILP support
long's in windows are 4 bytes (MSVS, intel compilers). Use int64_t and int32_t
to ensure 8 byte integers for ILP interface.

support 8 byte integer flag for intel ifort compiler
2022-03-24 19:09:23 +01:00
Aisha Tammy 3efbf968f1 create INDEX64 target 2022-03-24 19:09:23 +01:00
Martin Kroeker 1c1ffb0591
Annotate LAPACKE_lsame with the const attribute for GCC and compatible compilers 2022-02-26 19:27:34 +01:00
Martin Kroeker e2bf3f31a6
Add .NOTPARALLEL: as a workaround for builds on DFS 2022-02-09 22:09:25 +01:00
Martin Kroeker a3eea3e127
Fix input argument check (LAPACK PR 646) 2022-02-03 11:43:17 +01:00
Martin Kroeker 337b65133d
Fix out of bounds read in ?llarv (Reference-LAPACK PR 625) 2021-10-01 11:19:53 +02:00
Martin Kroeker ddb0ff5353
Fix out of bounds read in ?llarv (Reference-LAPACK PR 625) 2021-10-01 11:19:07 +02:00
Martin Kroeker fe497efa05
Fix out of bounds read in ?llarv (Reference-LAPACK PR 625) 2021-10-01 11:18:20 +02:00
Martin Kroeker 2be5ee3cca
Fix out of bounds read in ?llarv (Reference-LAPACK PR 625) 2021-10-01 11:17:21 +02:00
Isuru Fernando 02d4a49761 Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS 2021-07-27 23:44:51 +02:00
Martin Kroeker f176ff90af
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:42:43 +02:00
Martin Kroeker f4d4abd423
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:41:45 +02:00
Martin Kroeker 2b9443b7e7
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:40:29 +02:00
Martin Kroeker fe0e66564e
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:39:20 +02:00
Martin Kroeker 5958ffc9b6
Declare DZASUM as EXTERNAL 2021-06-16 09:43:39 +02:00
Martin Kroeker cd0e4aadb1
Declare ZDROT as EXTERNAL 2021-06-16 09:41:18 +02:00
Martin Kroeker e2621ef93a
Declare SROT as EXTERNAL 2021-06-16 09:40:15 +02:00
Martin Kroeker 9e1b43ea9b
Declare DROT as EXTERNAL 2021-06-16 09:39:28 +02:00
Martin Kroeker 5269348178
Declare CSROT as EXTERNAL 2021-06-16 09:35:12 +02:00
Martin Kroeker 92e024bbb3
Declare SCASUM as EXTERNAL 2021-06-16 09:33:23 +02:00
Martin Kroeker 3a53207cc9
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564) 2021-05-22 14:29:45 +02:00
Martin Kroeker 5c729c6dce
Correct function name in error message from SLASQ2 (Reference-LAPACK PR 555) 2021-05-17 14:47:14 +02:00
Martin Kroeker 15b9d6b4a7
Delete zchkaa.f 2021-05-14 19:55:31 +02:00
Martin Kroeker f7bcd962c1
Delete schkaa.f 2021-05-14 19:54:54 +02:00
Martin Kroeker 93cc066921
Delete dchkaa.f 2021-05-14 19:54:13 +02:00
Martin Kroeker 2c7d4a7766
Delete cchkaa.f 2021-05-14 19:53:38 +02:00
Martin Kroeker eef1c42f03
Convert ?chkaa to use dynamic allocation for the larger arrays 2021-05-14 19:53:03 +02:00
Martin Kroeker d00709e016
Add files via upload 2021-05-02 20:47:58 +02:00
Martin Kroeker d444344497
Add LAPACKE interfaces for the new Householder Reconstruction functions from 3.9.1 2021-05-02 19:57:47 +02:00
Martin Kroeker fb7308b9b5
Add entries for the new Householder Reconstruction functions from 3.9.1 2021-05-02 19:56:11 +02:00
Martin Kroeker 88b70fba3e
Add new tests for Householder reconstruction functions from 3.9.1 2021-05-02 19:28:21 +02:00
Martin Kroeker 4c1d47098b
Add new files for Householder reconstruction functions from 3.9.1 2021-05-02 19:25:43 +02:00
Martin Kroeker b8232c9054
Merge pull request #3213 from martin-frbg/lapack382
Avoid allocating the transposed triangular matrix in LAPACKE_xlantr_work (Reference-LAPACK 382)
2021-05-02 18:45:15 +02:00
Martin Kroeker 114bbbc6d7
Merge pull request #3212 from martin-frbg/lapack463
Initialize X and Y to zero for N=0 in xGGGLM (Reference-LAPACK PR463)
2021-05-02 18:44:59 +02:00
Martin Kroeker b67a92c19f
Merge pull request #3211 from martin-frbg/lapack471
Handle norm NaN value in xGESDD (Reference LAPACK PR471)
2021-05-02 18:44:29 +02:00
Martin Kroeker 4bf00da8fb
Avoid allocating the transposed triangular matrix (Reference-LAPACK PR382) 2021-05-02 12:18:17 +02:00
Martin Kroeker c26780d451
Initialize X and Y to zero for N=0 (Reference-LAPACK PR463) 2021-05-02 11:40:56 +02:00
Martin Kroeker d77d9bc920
Handle norm NaN value (Reference LAPACK PR471) 2021-05-02 11:24:50 +02:00
Martin Kroeker 37d3e2bd94
Merge pull request #3210 from martin-frbg/lapack502
Fix possible division by zero in LAPACK xTGSJA (Reference-LAPACK PR502)
2021-05-02 09:02:11 +02:00
Martin Kroeker de8656769c
Fix possible division by zero in xTGSJA (Reference-LAPACK PR502) 2021-05-01 21:31:13 +02:00
Martin Kroeker d43e07198d
Merge pull request #3208 from martin-frbg/lapack534
Apply MKL team fixes to the LAPACKE interfaces (Reference-LAPACK PR 534)
2021-05-01 20:18:29 +02:00
Martin Kroeker 5cc35abc3d
Apply MKL team fixes to the LAPACKE interfaces (Reference-LAPACK PR 534)
Removed spurious checks for INFO in xLACPY,xLASET after routines not returning any,and redundant requirements for ldvt in xGESVD_WORK
2021-05-01 13:22:10 +02:00
Martin Kroeker 254774f5a6
Add const qualifiers 2021-05-01 13:10:16 +02:00
Martin Kroeker 87d2e314db
Import packing improvements in LAPACK xLAQR from Reference-LAPACK PR 480+535 2021-04-30 13:50:55 +02:00
Martin Kroeker c9a82f54d1
Merge pull request #3204 from martin-frbg/lapack506
Correct INFO value returned by SLASQ2/DLASQ2 (Reference-LAPACK 506)
2021-04-30 13:25:48 +02:00
Martin Kroeker 444cb78be5
correct INFO value (Reference-LAPACK 506) 2021-04-30 09:26:54 +02:00
Martin Kroeker 9a36a283d3
Merge pull request #3199 from martin-frbg/lapack537
Add LAPACKE fixes from Reference-LAPACK PR 537
2021-04-29 05:39:50 +02:00
Martin Kroeker 7e35d25ea0
Merge pull request #3198 from martin-frbg/lapack539
Apply fixes from Reference-LAPACK PR468 and 539 for array declarations in ?ORGBR/?UNGBR
2021-04-29 05:39:35 +02:00
Martin Kroeker 3704f5e5b0
Add missing break statements in the ?lascl functions 2021-04-28 20:56:55 +02:00
Martin Kroeker 6b76066632
Add const qualifiers 2021-04-28 20:55:37 +02:00
Martin Kroeker 2b01132515
Clean up misdeclaration of the dummy stand-in for A in ?ORGBR/?UNGBR workspace queries (Reference-LAPACK PR 468 and 530) 2021-04-28 19:20:08 +02:00
Martin Kroeker 8e95a1e18d
Merge pull request #3195 from martin-frbg/lapack536
Apply lapack-testing fix from Reference-LAPACK PR536
2021-04-28 18:17:25 +02:00
Martin Kroeker 13a29d13fd
Apply lapack-testing fix from Reference-LAPACK PR536
fixes changing back from a single OMP thread for error exit testing to the originally requested number of threads for computational tests
2021-04-27 15:48:22 +02:00
Martin Kroeker 3526ff2507
Apply fixes from Reference-LAPACK PR538 2021-04-27 12:52:49 +02:00
Martin Kroeker 0dba04bb58
Merge pull request #3178 from martin-frbg/fix2864
Fix unwanted fallback to implicit typing in slanv2/dlanv2
2021-04-09 13:38:05 +02:00
Martin Kroeker e96f5e3c65
Fix implicit typing of new variable TWO 2021-04-09 10:04:15 +02:00
Martin Kroeker 558724e99f
Fix implicit typing of new variable TWO 2021-04-09 10:03:31 +02:00
Martin Kroeker 081d5ae971
Fix typo and potentially undefined variables
(copies fixes made in Reference-LAPACK PR 477 after the initial cherrypick)
2021-04-03 22:11:14 +02:00
Martin Kroeker 30d835168a
Merge pull request #3088 from xoviat/msvc
add misc fixes.
2021-03-14 17:14:28 +01:00
Martin Kroeker 9564f688c4
Adjust build rules for ?chkee.F 2021-02-28 18:57:05 +01:00
Martin Kroeker 90c1776c86
Adjust build rules for ?chkee.F 2021-02-28 18:53:20 +01:00
Martin Kroeker 9cf861e8fa
Add rewritten cchkee.F from Reference-LAPACK PR335 2021-02-28 18:51:03 +01:00
Martin Kroeker 9b7b1da133
Add rewritten dchkee.F from Reference-LAPACK PR335 2021-02-28 18:50:26 +01:00
Martin Kroeker a5ab891292
Add rewritten schkee.F from Reference-LAPACK PR335 2021-02-28 18:49:50 +01:00
Martin Kroeker 90bb4ac821
Add rewritten zchkee.F from Reference-LAPACK PR335 2021-02-28 18:49:10 +01:00
Martin Kroeker 23a0d1bc1f
Delete zchkee.f 2021-02-28 18:47:06 +01:00
Martin Kroeker 0e96c378fd
Delete schkee.f 2021-02-28 18:46:52 +01:00
Martin Kroeker ee16efff3c
Delete dchkee.f 2021-02-28 18:46:38 +01:00
Martin Kroeker 0197519dd7
Delete cchkee.f 2021-02-28 18:46:08 +01:00
Martin Kroeker c4b5abbe43
fix data type 2021-01-29 10:45:36 +01:00
Martin Kroeker f87842483e
fix calculation of non-exceptional shift (from Reference-LAPACK PR 477) 2021-01-29 09:56:12 +01:00
xoviat 3165c915b6 fix test helpers 2021-01-27 15:24:49 -06:00
Martin Kroeker 856bc36533
Add exceptional shift to fix rare convergence problems 2021-01-27 13:41:45 +01:00
Martin Kroeker f95031204e
Fix macro used in argument conversion (LAPACK PR 458) 2020-10-22 16:19:26 +02:00
Martin Kroeker a6570108c5
Add Makefile support for enabling only some variable types 2020-10-11 14:49:58 +02:00
Martin Kroeker ef552bc578
Add Makefile support for enabling only some variable types 2020-10-11 14:49:06 +02:00
Martin Kroeker efe1ad4700
Add Makefile support for enabling only some variable types 2020-10-11 14:48:23 +02:00
Martin Kroeker 7ed25e9e10
FIx underflow/rounding errors in LAPACK (S,D)LANV2
Reference-LAPACK PR 445, fixing their issue 263
2020-09-27 22:59:20 +02:00
Martin Kroeker 7e4d5c237c
Fix workspace query in xGELQ (Reference-LAPACK PR443) 2020-09-18 09:19:46 +02:00
Martin Kroeker 7d9c77f421
Correct dimension argument to xLASET
from Reference-LAPACK PR 438
2020-09-07 22:03:46 +02:00
Martin Kroeker 8a2a137a9e
Correct argument to SLASET (Improves fix from PR2778)
as explained by serguei-patchkovskii in Reference-LAPACK/lapack#438 (comment) , passing in an index of 1 instead of N leads to a standards violation accessing matrix A in SLASET, i.e. undefined behavior
2020-09-05 13:06:31 +02:00
H. Vetinari 3426519ae2 adapt ?ggsv?-functions to ambient code style in LAPACKE/include/lapack.h 2020-09-04 17:33:24 +02:00
H. Vetinari 1c6c71fa85 Follow-up to lapack#434 & lapack#409: add missing 'const' in signatures
Based on how the surrounding functions in lapack.h are handling the
parameters, particularly the ?ggsv?3-variants of the affected functions
2020-09-04 17:33:11 +02:00
H. Vetinari 860247b5da Follow-up to lapack#434 & lapack#409: fix signature mismatches 2020-09-04 17:32:53 +02:00
Martin Kroeker c61771e335
Merge pull request #2778 from martin-frbg/lapackeig
Fix various wrong calls to SLASET/DLASET in the EIG part of the LAPACK testsuite
2020-09-04 10:06:02 +02:00
Martin Kroeker c31b72965e
Fix data type of work array in zgesvdq prototype 2020-09-02 23:44:44 +02:00
Martin Kroeker 0ce2aa3163
Fix data type of rwork array 2020-09-02 23:41:51 +02:00
Martin Kroeker f42e84d46c
Fix misnaming of LAPACK_?ggsvp function prototypes as LAPACKE_ (#2808)
* Fix misnaming of LAPACK_?ggsvp and ?ggsvd function prototypes as LAPACKE_

* Drop the LAPACKE matrix_layout parameter from the argument lists, change ints to pointers and add missing work arguments.
2020-09-01 10:44:48 +02:00
Martin Kroeker 597010a968
Fix incorrect argument to SLASET
Reference-LAPACK issue 425 (and 318)
2020-08-14 00:41:56 +02:00
Martin Kroeker d64f1ef26b
Fix incorrect argument to SLASET
Reference-LAPACK issue 425 (and 318)
2020-08-14 00:40:24 +02:00
Martin Kroeker c62aad62e5
Fix incorrect calls to DLASET
Reference-LAPACK issue 429
2020-08-14 00:35:45 +02:00
Martin Kroeker 7219c9cb87
Merge pull request #2764 from martin-frbg/lapacktests
Fix array overruns in the LIN part of the LAPACK testsuite
2020-08-10 13:27:51 +02:00
Martin Kroeker 64259d521a
Fix use of unallocated array in workspace query and wrong type of argument to xSCAL 2020-08-09 13:02:27 +02:00
Martin Kroeker 6f5ca44c1a
Expand TAU array as SGEMQR/DGEMQR read elements 2 and 3 2020-08-09 12:59:20 +02:00
Martin Kroeker aaf1a17168
Apply current library name suffix 2020-08-02 17:58:33 +02:00
Martin Kroeker 53add6a80d
Apply library name suffix to openblas if any 2020-08-02 17:57:12 +02:00
Martin Kroeker f766024749
enable fortran for cmake 2020-07-09 13:44:25 +02:00
Martin Kroeker c502760bef
Modify for building with OpenBLAS 2020-07-09 13:13:16 +02:00
Martin Kroeker 29b5887d5f
Modify for building with OpenBLAS 2020-07-09 13:12:35 +02:00
Martin Kroeker 60188a8c82
Append crude hack for enabling lapack tests in the OpenBLAS build 2020-07-09 11:44:31 +02:00
Martin Kroeker 522aaf53bf
Break out of potentially infinite rescaling loop in LAPACK xLARGV/xLARTG/xLARTGP
Reference-LAPACK issue 411
2020-06-07 14:30:20 +02:00
Ilhan Polat 76d2612e0c
BUG: Fix the loop range in ZHEEQUB.f 2020-05-30 14:11:11 +02:00
Martin Kroeker 6baa9a778d
Improve declaration of LAPACKE_get_nancheck 2020-05-19 17:59:31 +02:00
Martin Kroeker c775458299
Merge pull request #2512 from martin-frbg/lapackh
Move declarations of lapack_complex_custom types outside the extern C
2020-03-14 13:27:40 +01:00
Martin Kroeker ee2e758278
Move declarations of lapack_complex_custom types outside the extern C
fixes #2510
2020-03-13 20:34:13 +01:00
Martin Kroeker b25ae1fc60
Apply fix for Reference-LAPACK issue 394
reference to XERBLA extending beyond column 72, breaking builds with compilers that default to traditional punch card format
2020-03-10 13:37:41 +01:00
Martin Kroeker 87ac1ceb0b
Apply fix from Reference-LAPACK PR390, NaN not propagating 2020-02-23 22:40:40 +01:00
Martin Kroeker 9e40c080f2
Apply fix from Reference-LAPACK PR390, NaN not propagating 2020-02-23 22:39:01 +01:00
Martin Kroeker 2ea2bd99c7
Apply LAPACKE fix for eigenvector transposition in symmetric eigensolvers
from Reference-LAPACK PR 330
2020-01-03 11:10:00 +01:00
Martin Kroeker 375b1875c8
[WIP] Update LAPACK to 3.9.0 (#2353)
* Update make.inc entries for LAPACK 3.9.0

Reference-LAPACK PR 347 changed some variable names and relative paths

* Update LAPACK to 3.9.0

* Add new functions from LAPACK 3.9.0

* Add new functions from LAPACK 3.9.0

* Restore LOADER command 

as it makes it easier to specify pthread as needed

* Restore LOADER

* Restore EIG/LIN prefixes in cmdbase

* add binary path to lapack_testing.py call

* Restore OpenMP version check

* Restore OpenMP version check

* Restore fix for out-of-bounds array accesses

from #2096
2020-01-01 13:18:53 +01:00
Guillaume Horel 4b21b646ea turn on optimized code 2019-09-08 11:14:49 -04:00
Martin Kroeker 2aad88d5b9 Avoid out-of-bounds accesses in LAPACK EIG tests
see https://github.com/Reference-LAPACK/lapack/issues/333
2019-04-27 23:01:49 +02:00
Martin Kroeker 1391fc46d2
fix second instance of complex.h for c++ as well 2019-02-05 19:29:33 +01:00
Martin Kroeker ae02a57261
Merge pull request #1866 from martin-frbg/issue1859
Fix argument in SLASET call to zero S
2018-11-10 19:23:31 +01:00
Martin Kroeker a6a52a73f7
Fix argument in SLASET call to zero S
fixes #1859 in accordance with https://github.com/LAPACK-Reference/issue/296
2018-11-10 17:16:53 +01:00
Martin Kroeker 9c177d270b
Restore Android/ARMv7 build fix from #778
for #1811
2018-11-01 18:50:25 +01:00
Tiziano Müller 79ea839b63 fix parallel build issues with APFS/HFS+/ext2/3 in netlib-lapack
The problem is that OpenBLAS sets the LAPACKE_LIB and the TMGLIB to the
same object and uses the `ar` feature to update the archive file. If the
underlying filesystem does not have sub-second timestamp resolution and
the system is fast enough (or `ccache` is used), the timestamp of the
builds which should be added to the previously generated archive is the
same as the archive file itself and therefore `make` does not update the
archive.

Since OpenBLAS takes care to not run the different targets updating the
archive in parallel, the easiest solution is to declare the respective
targets `.PHONY`, forcing `make` to always update them.

fixes #1682
2018-10-06 14:10:05 +02:00
Martin Kroeker fd081a91e4
Merge pull request #1759 from martin-frbg/lapack283
Remove an unused variable from several LAPACKE 2stage_work functions
2018-09-11 13:52:09 +02:00
Martin Kroeker 094f8c3b57
remove unused variable ldb_t
Copied from Reference-LAPACK PR283
2018-09-11 10:53:47 +02:00
Martin Kroeker 5cf090f516
remove unused variable ldb_t
Copied from Reference-LAPACK PR283
2018-09-11 10:52:30 +02:00
Martin Kroeker 58363542e7
remove unused variable ldb_t
Copied from Reference-LAPACK PR283
2018-09-11 10:51:17 +02:00
Martin Kroeker 9e917b16db
Fix missing replacements of ILAENV by ILAENV_2STAGE (lapack PR 272)
This could cause spurious "parameter has an illegal value" errors in DSYEVR and related routines, see https://github.com/Reference-LAPACK/lapack/issues/262
2018-08-28 21:11:54 +02:00
Martin Kroeker 401adddb2b
Merge pull request #1585 from martin-frbg/lapack-253
Fixes from Lapack-Reference PR 253
2018-06-01 18:59:33 +02:00
Martin Kroeker c5b13d4e10
Fixes from netlib PR 253 2018-06-01 15:14:45 +02:00
Martin Kroeker 677e42d7b0
Fixes from netlib PR 253
When minimal workspace is given in ?hesv_aa, ?sysv_aa, ?hesv_aa_2stage, ?sysv_aa_2stage, now no error is given
Quick return for ?laqr1
2018-06-01 15:12:59 +02:00
Martin Kroeker e2a8c35e5a
Fixes from netlib PR253
LAPACKE interfaces for Aasen's functions now call ?sytrf_aa and ?hetrf_aa instead of ?sytrf and ?hetrf
2018-06-01 15:08:14 +02:00
Martin Kroeker a7dbd4c57d
Fix paths to LIN and EIG tests
should fix 1574
2018-05-31 11:19:33 +02:00
Martin Kroeker c720f1f019
Merge pull request #1554 from martin-frbg/lapack-249
LAPACKE fixes from lapack PR249
2018-05-10 15:32:08 +02:00
Martin Kroeker d7d950fcf2
LAPACKE fixes from lapack PR249
Copied from Reference-LAPACK/lapack#249, this fixes out-of-bounds memory accesses
in the nancheck calls of the LAPACKE lacgv, lassq,larfg,larfb,larfx and mtr functions
2018-05-10 13:15:42 +02:00
Martin Kroeker 9795adc7ef
Drop C-style "L" suffix from OPENMP version number in check 2018-05-08 21:39:42 +02:00
Martin Kroeker 1a8e487c4a
Drop C-style "L" suffix from OPENMP version number in check 2018-05-08 21:38:25 +02:00
Martin Kroeker 5966fd52a2
Drop C-style "L" suffix from OPENMP version number in check 2018-05-08 21:36:56 +02:00
Martin Kroeker eaab622f03
Make "OMP task depend" sections conditional on OpenMP4, not just OpenMP
To allow compiling with gcc versions older than 4.9
2018-02-14 22:58:14 +01:00
Martin Kroeker 3cda1ce50a
Revert insiduous suppression of the -fopenmp flag in the LAPACK subtree
This was added in #1046 citing a problem with mingw, but in effect it quietly reduces thread safety on all non-Windows platforms (while -fopenmp is already disabled for Windows builds through the toplevel Makefile.system). Removing the filter fixes #1425
2018-02-13 22:44:45 +01:00
Martin Kroeker 150c7294a6
Remove spurious EXTERNAL reference
From Reference-LAPACK issue 228, remove spurious EXTERNAL reference to unused and nonexistent function xLACGV that could cause linking problems.
2018-02-08 14:57:13 +01:00
Martin Kroeker 374260027d
Add conditionals around ar calls for optional modules
The macOS ar aborts when it gets called with no input, see #1398
2017-12-21 20:42:30 +01:00
martin 3be5c3d343 Update LAPACK to 3.8.0 2017-11-23 18:13:35 +01:00
Martin Kroeker 2df1e3372d
Break out of potentially infinite rescaling loop after 1000 iterations
Inf values in the input vector will survive rescaling, causing an infinite loop. The value of 1000 is arbitrarily chosen as a large but finite value with the intention to never interfere with regular calculations.
2017-11-10 20:02:21 +01:00
Martin Kroeker d5ea8fd823 Fix workspace computation for side=L
From netlib PR#144
2017-05-04 20:01:41 +02:00
Martin Kroeker 4beffaaa4b Fix workspace computation for side=L
From netlib PR#144
2017-05-04 19:59:02 +02:00
Martin Kroeker fb28e4adc9 Fix workspace computation for side=L
From netlib PR#144
2017-05-04 19:55:02 +02:00
Martin Kroeker 26faa3ca47 Fix workspace allocation in lapacke_ctp for side=L
from netlib PR #144
2017-05-04 19:49:51 +02:00
Martin Kroeker 7908efafc8 Fix integer overflow in LAPACK DBDSQR, SBDSQR (#1135)
* Fix integer overflow in DBDSQR

As noted in lapack issue 135, an integer overflow in the calculation of the iteration limit could lead to an immediate return without any iterations having been performed if the input matrix is sufficiently big.

* Fix integer overflow in SBDSQR

As noted in lapack issue 135, an integer overflow in the calculation of the iteration limit could lead to an immediate return without any iterations having been performed if the input matrix is sufficiently big.

* Fix integer overflow in threshold calculation

Related to lapack issue 135, the threshold calculation can overflow as well as the multiplication is evaluated from left to right.
Without explicit parentheses, the calculation would overflow for N >= 18919

* Fix integer overflow in threshold calculation

Related to lapack issue 135, the threshold calculation can overflow as well as the multiplication is evaluated from left to right.
Without explicit parentheses, the calculation would overflow for N >= 18919
2017-03-24 22:05:22 +01:00
Martin Kroeker c7bc0ee823 Remove spurious names from EXTERNAL list
Remove unused (and nonexistent) functions ZHETRD_SY2SB and ZHETRD_SB2ST from comment and EXTERNAL declaration
2017-02-22 21:48:35 +01:00
Martin Kroeker 6bdee6d50a Remove spurious names from EXTERNAL list
Remove unused (and nonexistent) ZHETRD_SY2SB and ZHETRD_SB2ST
2017-02-22 21:45:27 +01:00
Martin Kroeker 009c0d2e5a Fix typo in EXTERNAL declaration
ZHBTRD_HB2ST  should be ZHETRD_HB2ST
2017-02-22 21:41:07 +01:00
Martin Kroeker 0958b49811 Fix missing comma on continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:40:39 +01:00
Martin Kroeker 09b240f1ef Fix missing comma on continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:39:06 +01:00
Martin Kroeker 69f4e8b86c Fix missing comma on continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:34:20 +01:00
Martin Kroeker e072e68aa0 Fix missing comma in continued line
EXTERNAL declaration of subroutines missed a comma before the continuation line,
causing a strange run-together name to appear in the object when compiled with ifort.
2017-02-22 08:32:20 +01:00
Vladimir Chalupecky eeaee46e86 LAPACKE: fix wrong matrix size in ?ormbr
Changes made upstream in Reference LAPACK in
https://github.com/Reference-LAPACK/lapack/pull/128
2017-02-21 21:57:18 +01:00
Martin Kroeker 3f7720ec4b LAPACKE: fix wrong number of columns in ?ormlq
Copied from lapack https://github.com/Reference-LAPACK/lapack/pull/127  by vladimir-ch (with earlier changes from echeresh's  
PR 115 "lapacke_*ormlq_work: move declarations under if" there as they touched some of the same files)
2017-02-20 16:20:43 +01:00
Vladimir Chalupecky 4c2b713ce5 LAPACKE: fix incorrect value of lda_t in lapacke_?laswp_work
Fixed in Reference LAPACK in commit:

07e1fbd897
2017-02-07 09:21:46 +01:00
Martin Kroeker 7611a41f40 Split LAPACKE 3.7.0 obj list (take 2)
Missed the splitting of the actual ar call
2017-01-10 17:11:35 +01:00
Martin Kroeker 9f5cfd43dc Split the obj list of LAPACKE 3.7.0
Split obj list to allow building with mingw (argument list too long for the msys ar)
2017-01-09 18:29:53 +01:00
Werner Saar 1153e3ac39 filtered out -fopenmp and fix for mingw 2017-01-07 08:41:42 +01:00