Martin Kroeker
|
5f677e782e
|
Merge pull request #3196 from guowangy/skylakex-gemm-batch-k
GEMM: skylake: improve the performance when m is small
|
2021-05-22 19:25:28 +02:00 |
Martin Kroeker
|
04c60cee5d
|
Merge pull request #3242 from martin-frbg/issue3239
Handle inadvertent use of DYNAMIC_ARCH=0
|
2021-05-22 19:24:46 +02:00 |
Martin Kroeker
|
3a53207cc9
|
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564)
|
2021-05-22 14:29:45 +02:00 |
Martin Kroeker
|
0e73d20629
|
Handle inadvertent use of DYNAMIC_ARCH=0
|
2021-05-22 14:23:49 +02:00 |
Martin Kroeker
|
02087a62e7
|
Merge pull request #3205 from intelmy/sgemv_n_opt
optimize on sgemv_n for small n
|
2021-05-17 17:49:01 +02:00 |
Martin Kroeker
|
03b4d79a7e
|
Merge pull request #3238 from martin-frbg/lapack555
Correct function name in error message from SLASQ2 (LAPACK PR555)
|
2021-05-17 17:32:23 +02:00 |
Martin Kroeker
|
5c729c6dce
|
Correct function name in error message from SLASQ2 (Reference-LAPACK PR 555)
|
2021-05-17 14:47:14 +02:00 |
Martin Kroeker
|
e1911b2e60
|
Merge pull request #3236 from martin-frbg/issue3234
Add -lm for FreeBSD on ARM/ARM64
|
2021-05-16 17:17:18 +02:00 |
Martin Kroeker
|
8f33da4f94
|
Merge pull request #3235 from dnoan/develop
Update Makefile.arm64
|
2021-05-16 17:15:45 +02:00 |
Martin Kroeker
|
26ccf643a3
|
Add -lm for FreeBSD on ARM/ARM64
|
2021-05-16 13:04:38 +02:00 |
Noan
|
32264ba496
|
Update Makefile.arm64
Added -march and -mtune flags for EMAG processors when GCC 9 or later
|
2021-05-16 09:49:13 +00:00 |
Martin Kroeker
|
4ecf631f95
|
Merge pull request #3228 from martin-frbg/issue3226
filter out -mavx flag on Sandybridge zgemm/ztrmm kernels
|
2021-05-15 09:06:12 +02:00 |
Martin Kroeker
|
5af510081d
|
Merge pull request #3233 from martin-frbg/issue3230
Add autodetection for Intel Ice Lake SP
|
2021-05-15 01:04:09 +02:00 |
Martin Kroeker
|
164551d5a2
|
Merge pull request #3232 from martin-frbg/lapack553
Reduce stack size requirements in the LAPACK LIN tests (LAPACK PR 553)
|
2021-05-14 23:28:45 +02:00 |
Martin Kroeker
|
310b76aad7
|
Merge pull request #3231 from martin-frbg/issue3227
Support compilation with pre-C99 versions of MSVC
|
2021-05-14 23:28:06 +02:00 |
Martin Kroeker
|
c4da892ba0
|
Only filter out -mavx on Sandybridge ZGEMM/ZTRMM kernels
|
2021-05-14 23:19:10 +02:00 |
Martin Kroeker
|
cbfd3c87e1
|
Recognize Intel Ice Lake SP as Cooper Lake
|
2021-05-14 20:44:06 +02:00 |
Martin Kroeker
|
26e87ac517
|
Support Intel Ice Lake SP as Cooper Lake
|
2021-05-14 20:39:55 +02:00 |
Martin Kroeker
|
15b9d6b4a7
|
Delete zchkaa.f
|
2021-05-14 19:55:31 +02:00 |
Martin Kroeker
|
f7bcd962c1
|
Delete schkaa.f
|
2021-05-14 19:54:54 +02:00 |
Martin Kroeker
|
93cc066921
|
Delete dchkaa.f
|
2021-05-14 19:54:13 +02:00 |
Martin Kroeker
|
2c7d4a7766
|
Delete cchkaa.f
|
2021-05-14 19:53:38 +02:00 |
Martin Kroeker
|
eef1c42f03
|
Convert ?chkaa to use dynamic allocation for the larger arrays
|
2021-05-14 19:53:03 +02:00 |
Martin Kroeker
|
73f637e584
|
Support compilation with pre-C99 versions of MSVC
|
2021-05-14 15:08:12 +02:00 |
Martin Kroeker
|
8b90e5f202
|
Drop redundant inclusion of complex.h
|
2021-05-14 15:06:44 +02:00 |
Martin Kroeker
|
bd60fb6ffc
|
filter out -mavx flag on zgemm kernels as it can cause problems with older gcc
|
2021-05-13 23:05:00 +02:00 |
Martin Kroeker
|
37ea8702ee
|
Merge pull request #3192 from damonyu1989/develop
Update the intrinsic api to the offical name.
|
2021-05-11 16:00:45 +02:00 |
Martin Kroeker
|
ec7d6c02bc
|
Add an Android crossbuild on OSX to Azure CI (#3224)
* Add an Android crossbuild on OSX
|
2021-05-10 08:02:01 +02:00 |
Martin Kroeker
|
c90c23e78f
|
Merge pull request #3223 from martin-frbg/develop
Use percent instead of ampersand as placeholder for substitutions
|
2021-05-07 08:51:45 +02:00 |
Martin Kroeker
|
bda8820da7
|
Use percent instead of ampersand as placeholder for substitutions
|
2021-05-06 20:20:08 +02:00 |
Martin Kroeker
|
c0ca63ea46
|
Fix missing conditionals for non-SKX kernels
|
2021-05-05 14:55:36 +02:00 |
Martin Kroeker
|
f497bb949b
|
Merge pull request #3219 from austinpagan/Gemm.ErrorFix
Add error message token for SBGEMM in gemm.c
|
2021-05-05 14:30:41 +02:00 |
Martin Kroeker
|
f86b1bc3da
|
Merge pull request #3220 from drhpc/drhpc-fixup
Delete lapack_wrappers.c.orig
|
2021-05-05 14:30:24 +02:00 |
drhpc
|
206e03fdac
|
Delete lapack_wrappers.c.orig
This looks like a leftover from patching and confuses further patching;-)
|
2021-05-04 21:02:07 +02:00 |
Gordon Fossum
|
8b599836db
|
Add error message token for SBGEMM in gemm.c
|
2021-05-04 13:55:02 -05:00 |
Martin Kroeker
|
9721b57ecf
|
Update version to 0.3.15.dev
|
2021-05-03 00:01:08 +02:00 |
Martin Kroeker
|
380f955078
|
Update version to 0.3.15.dev
|
2021-05-03 00:00:29 +02:00 |
Martin Kroeker
|
49d18e65e3
|
Merge pull request #3217 from xianyi/release-0.3.0
merge 0.3.15 back into develop to copy tag
|
2021-05-02 23:59:55 +02:00 |
Martin Kroeker
|
904f9a267d
|
Update version to 0.3.15
|
2021-05-02 23:50:22 +02:00 |
Martin Kroeker
|
4c033730bb
|
Update version to 0.3.15
|
2021-05-02 23:49:49 +02:00 |
Martin Kroeker
|
65502c6af6
|
Merge pull request #3216 from xianyi/develop
Update from develop for 0.3.15 release
|
2021-05-02 23:48:28 +02:00 |
Martin Kroeker
|
f71627fa2e
|
Merge pull request #3215 from martin-frbg/cl0315
Update Changelog for 0.3.15
|
2021-05-02 23:47:24 +02:00 |
Martin Kroeker
|
d8d7bd33cb
|
Update Changelog for 0.3.15
|
2021-05-02 23:46:55 +02:00 |
Martin Kroeker
|
e72420e8c5
|
Merge pull request #3214 from martin-frbg/lapack-3.9.1hrt
Add new Householder Reconstruction functions from LAPACK 3.9.1
|
2021-05-02 23:40:03 +02:00 |
Martin Kroeker
|
d00709e016
|
Add files via upload
|
2021-05-02 20:47:58 +02:00 |
Martin Kroeker
|
d444344497
|
Add LAPACKE interfaces for the new Householder Reconstruction functions from 3.9.1
|
2021-05-02 19:57:47 +02:00 |
Martin Kroeker
|
fb7308b9b5
|
Add entries for the new Householder Reconstruction functions from 3.9.1
|
2021-05-02 19:56:11 +02:00 |
Martin Kroeker
|
db50b24a4a
|
Add entries for the new Householder Reconstruction functions from 3.9.1
|
2021-05-02 19:55:15 +02:00 |
Martin Kroeker
|
88b70fba3e
|
Add new tests for Householder reconstruction functions from 3.9.1
|
2021-05-02 19:28:21 +02:00 |
Martin Kroeker
|
4c1d47098b
|
Add new files for Householder reconstruction functions from 3.9.1
|
2021-05-02 19:25:43 +02:00 |