Martin Kroeker
|
d8d7bd33cb
|
Update Changelog for 0.3.15
|
2021-05-02 23:46:55 +02:00 |
Martin Kroeker
|
e72420e8c5
|
Merge pull request #3214 from martin-frbg/lapack-3.9.1hrt
Add new Householder Reconstruction functions from LAPACK 3.9.1
|
2021-05-02 23:40:03 +02:00 |
Martin Kroeker
|
d00709e016
|
Add files via upload
|
2021-05-02 20:47:58 +02:00 |
Martin Kroeker
|
d444344497
|
Add LAPACKE interfaces for the new Householder Reconstruction functions from 3.9.1
|
2021-05-02 19:57:47 +02:00 |
Martin Kroeker
|
fb7308b9b5
|
Add entries for the new Householder Reconstruction functions from 3.9.1
|
2021-05-02 19:56:11 +02:00 |
Martin Kroeker
|
db50b24a4a
|
Add entries for the new Householder Reconstruction functions from 3.9.1
|
2021-05-02 19:55:15 +02:00 |
Martin Kroeker
|
88b70fba3e
|
Add new tests for Householder reconstruction functions from 3.9.1
|
2021-05-02 19:28:21 +02:00 |
Martin Kroeker
|
4c1d47098b
|
Add new files for Householder reconstruction functions from 3.9.1
|
2021-05-02 19:25:43 +02:00 |
Martin Kroeker
|
40000d1f64
|
Add entries for Householder reconstruction functions from 3.9.1
|
2021-05-02 19:21:59 +02:00 |
Martin Kroeker
|
dc3664993c
|
Merge pull request #26 from xianyi/develop
rebase
|
2021-05-02 19:19:28 +02:00 |
Martin Kroeker
|
b8232c9054
|
Merge pull request #3213 from martin-frbg/lapack382
Avoid allocating the transposed triangular matrix in LAPACKE_xlantr_work (Reference-LAPACK 382)
|
2021-05-02 18:45:15 +02:00 |
Martin Kroeker
|
114bbbc6d7
|
Merge pull request #3212 from martin-frbg/lapack463
Initialize X and Y to zero for N=0 in xGGGLM (Reference-LAPACK PR463)
|
2021-05-02 18:44:59 +02:00 |
Martin Kroeker
|
b67a92c19f
|
Merge pull request #3211 from martin-frbg/lapack471
Handle norm NaN value in xGESDD (Reference LAPACK PR471)
|
2021-05-02 18:44:29 +02:00 |
Martin Kroeker
|
4bf00da8fb
|
Avoid allocating the transposed triangular matrix (Reference-LAPACK PR382)
|
2021-05-02 12:18:17 +02:00 |
Martin Kroeker
|
c26780d451
|
Initialize X and Y to zero for N=0 (Reference-LAPACK PR463)
|
2021-05-02 11:40:56 +02:00 |
Martin Kroeker
|
d77d9bc920
|
Handle norm NaN value (Reference LAPACK PR471)
|
2021-05-02 11:24:50 +02:00 |
Martin Kroeker
|
37d3e2bd94
|
Merge pull request #3210 from martin-frbg/lapack502
Fix possible division by zero in LAPACK xTGSJA (Reference-LAPACK PR502)
|
2021-05-02 09:02:11 +02:00 |
Martin Kroeker
|
de8656769c
|
Fix possible division by zero in xTGSJA (Reference-LAPACK PR502)
|
2021-05-01 21:31:13 +02:00 |
Martin Kroeker
|
d43e07198d
|
Merge pull request #3208 from martin-frbg/lapack534
Apply MKL team fixes to the LAPACKE interfaces (Reference-LAPACK PR 534)
|
2021-05-01 20:18:29 +02:00 |
Martin Kroeker
|
da16764c7a
|
Merge pull request #3209 from martin-frbg/issue3160
Add casts to prevent overflow of intermediate results
|
2021-05-01 20:08:24 +02:00 |
Martin Kroeker
|
98ebc8ac59
|
Add casts to prevent overflow of intermediate result
|
2021-05-01 14:48:19 +02:00 |
Martin Kroeker
|
904b221f03
|
Add cast to prevent overflow of intermediate result
|
2021-05-01 14:47:22 +02:00 |
Martin Kroeker
|
5cc35abc3d
|
Apply MKL team fixes to the LAPACKE interfaces (Reference-LAPACK PR 534)
Removed spurious checks for INFO in xLACPY,xLASET after routines not returning any,and redundant requirements for ldvt in xGESVD_WORK
|
2021-05-01 13:22:10 +02:00 |
Martin Kroeker
|
254774f5a6
|
Add const qualifiers
|
2021-05-01 13:10:16 +02:00 |
Martin Kroeker
|
ae9cdee753
|
Merge pull request #3207 from hjl-tools/hjl/cet/develop
x86: Enable Intel CET
|
2021-05-01 12:42:54 +02:00 |
H.J. Lu
|
53ee0b76bb
|
x86: Enable Intel CET
When Intel CET is enabled, we need to include <cet.h> in assembly codes
to mark Intel CET support and place _CET_ENDBR at the function entry.
|
2021-04-30 19:45:39 -07:00 |
Martin Kroeker
|
dc6b04c375
|
Merge pull request #3206 from martin-frbg/lapack480535
Import packing improvements to LAPACK xLAQR from Reference-LAPACK (PR 480+535)
|
2021-04-30 21:42:44 +02:00 |
pnp
|
3d4ccd2a13
|
fix for build error
|
2021-04-30 12:25:33 -04:00 |
pnp
|
c59652f0ce
|
optimize on sgemv_n for small n
|
2021-04-30 12:14:58 -04:00 |
Martin Kroeker
|
87d2e314db
|
Import packing improvements in LAPACK xLAQR from Reference-LAPACK PR 480+535
|
2021-04-30 13:50:55 +02:00 |
Martin Kroeker
|
3a30c12019
|
Merge pull request #25 from xianyi/develop
rebase
|
2021-04-30 13:47:17 +02:00 |
Martin Kroeker
|
c9a82f54d1
|
Merge pull request #3204 from martin-frbg/lapack506
Correct INFO value returned by SLASQ2/DLASQ2 (Reference-LAPACK 506)
|
2021-04-30 13:25:48 +02:00 |
Martin Kroeker
|
444cb78be5
|
correct INFO value (Reference-LAPACK 506)
|
2021-04-30 09:26:54 +02:00 |
Martin Kroeker
|
171c20e3b6
|
Merge pull request #3202 from martin-frbg/issue3201
Fix division by zero in the non-x86 codepath of C/ZROTG
|
2021-04-29 18:58:27 +02:00 |
Martin Kroeker
|
c5fb91f1bc
|
Fix division by zero in the non-x86 codepath
|
2021-04-29 09:47:18 +02:00 |
Martin Kroeker
|
9a36a283d3
|
Merge pull request #3199 from martin-frbg/lapack537
Add LAPACKE fixes from Reference-LAPACK PR 537
|
2021-04-29 05:39:50 +02:00 |
Martin Kroeker
|
7e35d25ea0
|
Merge pull request #3198 from martin-frbg/lapack539
Apply fixes from Reference-LAPACK PR468 and 539 for array declarations in ?ORGBR/?UNGBR
|
2021-04-29 05:39:35 +02:00 |
Martin Kroeker
|
3704f5e5b0
|
Add missing break statements in the ?lascl functions
|
2021-04-28 20:56:55 +02:00 |
Martin Kroeker
|
6b76066632
|
Add const qualifiers
|
2021-04-28 20:55:37 +02:00 |
Martin Kroeker
|
2b01132515
|
Clean up misdeclaration of the dummy stand-in for A in ?ORGBR/?UNGBR workspace queries (Reference-LAPACK PR 468 and 530)
|
2021-04-28 19:20:08 +02:00 |
Martin Kroeker
|
8e95a1e18d
|
Merge pull request #3195 from martin-frbg/lapack536
Apply lapack-testing fix from Reference-LAPACK PR536
|
2021-04-28 18:17:25 +02:00 |
Wangyang Guo
|
aa7b3dc3db
|
GEMM: skylake: improve the performance when m is small
|
2021-04-28 13:56:06 +00:00 |
Martin Kroeker
|
13a29d13fd
|
Apply lapack-testing fix from Reference-LAPACK PR536
fixes changing back from a single OMP thread for error exit testing to the originally requested number of threads for computational tests
|
2021-04-27 15:48:22 +02:00 |
Martin Kroeker
|
a6c2cb8417
|
Merge pull request #3193 from martin-frbg/lapack538
Apply lapack-testing fixes from Reference-LAPACK PR538
|
2021-04-27 15:40:51 +02:00 |
Martin Kroeker
|
d511a7bb4f
|
Merge pull request #3191 from martin-frbg/issue3188
Delay creation of the (soft)link until after the library has been built
|
2021-04-27 13:35:16 +02:00 |
Martin Kroeker
|
3526ff2507
|
Apply fixes from Reference-LAPACK PR538
|
2021-04-27 12:52:49 +02:00 |
Martin Kroeker
|
adcfe7b789
|
Merge pull request #3190 from martin-frbg/issue3128-2
Replace spurious AVX512 requirement in the Haswell drot microkernel with an AVX2/FMA3 guard
|
2021-04-27 06:36:28 +02:00 |
damonyu
|
ceb44bef14
|
update the intrinsic api to the offical name.
|
2021-04-27 11:12:29 +08:00 |
damonyu1989
|
ed473267df
|
Merge pull request #1 from xianyi/develop
update
|
2021-04-27 10:53:59 +08:00 |
Martin Kroeker
|
0608bc5d82
|
delay creation of the softlink until after the library has been created
|
2021-04-26 22:32:23 +02:00 |