Commit Graph

7452 Commits

Author SHA1 Message Date
Martin Kroeker 1217eb910d
Fix copy-paste errors in variables used 2021-05-28 09:38:48 +02:00
Martin Kroeker d6d7a6685d
Add shortcuts for (small) cases that do not need expensive buffer allocation 2021-05-27 22:39:18 +02:00
Martin Kroeker f0e7345fb8
Add shortcut for small-size gemv_n with increments of one 2021-05-26 22:02:34 +02:00
Martin Kroeker 42f048cf6c
Merge pull request #3249 from MikaelUrankar/develop
Fix typo
2021-05-26 15:26:30 +02:00
MikaelUrankar 4fbc0777f4 Fix typo 2021-05-26 12:14:57 +02:00
Martin Kroeker d7472606d5
Merge pull request #3244 from martin-frbg/issue3237
Add fast path for small xSYR with INCX==1
2021-05-22 22:38:09 +02:00
Martin Kroeker 03297ff9f0
Add fast path for small xSYR with INCX==1 2021-05-22 20:41:18 +02:00
Martin Kroeker 2d8d0af0ea
Merge pull request #3243 from martin-frbg/lapack564
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564)
2021-05-22 19:25:56 +02:00
Martin Kroeker 5f677e782e
Merge pull request #3196 from guowangy/skylakex-gemm-batch-k
GEMM: skylake: improve the performance when m is small
2021-05-22 19:25:28 +02:00
Martin Kroeker 04c60cee5d
Merge pull request #3242 from martin-frbg/issue3239
Handle inadvertent use of DYNAMIC_ARCH=0
2021-05-22 19:24:46 +02:00
Martin Kroeker 3a53207cc9
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564) 2021-05-22 14:29:45 +02:00
Martin Kroeker 0e73d20629
Handle inadvertent use of DYNAMIC_ARCH=0 2021-05-22 14:23:49 +02:00
Martin Kroeker 02087a62e7
Merge pull request #3205 from intelmy/sgemv_n_opt
optimize on sgemv_n for small n
2021-05-17 17:49:01 +02:00
Martin Kroeker 03b4d79a7e
Merge pull request #3238 from martin-frbg/lapack555
Correct function name in error message from SLASQ2 (LAPACK PR555)
2021-05-17 17:32:23 +02:00
Martin Kroeker 5c729c6dce
Correct function name in error message from SLASQ2 (Reference-LAPACK PR 555) 2021-05-17 14:47:14 +02:00
Martin Kroeker e1911b2e60
Merge pull request #3236 from martin-frbg/issue3234
Add -lm for FreeBSD on ARM/ARM64
2021-05-16 17:17:18 +02:00
Martin Kroeker 8f33da4f94
Merge pull request #3235 from dnoan/develop
Update Makefile.arm64
2021-05-16 17:15:45 +02:00
Martin Kroeker 26ccf643a3
Add -lm for FreeBSD on ARM/ARM64 2021-05-16 13:04:38 +02:00
Noan 32264ba496
Update Makefile.arm64
Added -march and -mtune flags for EMAG processors when GCC 9 or later
2021-05-16 09:49:13 +00:00
Martin Kroeker 4ecf631f95
Merge pull request #3228 from martin-frbg/issue3226
filter out -mavx flag on Sandybridge zgemm/ztrmm kernels
2021-05-15 09:06:12 +02:00
Martin Kroeker 5af510081d
Merge pull request #3233 from martin-frbg/issue3230
Add autodetection for Intel Ice Lake SP
2021-05-15 01:04:09 +02:00
Martin Kroeker 164551d5a2
Merge pull request #3232 from martin-frbg/lapack553
Reduce stack size requirements in the LAPACK LIN tests (LAPACK PR 553)
2021-05-14 23:28:45 +02:00
Martin Kroeker 310b76aad7
Merge pull request #3231 from martin-frbg/issue3227
Support compilation with pre-C99 versions of MSVC
2021-05-14 23:28:06 +02:00
Martin Kroeker c4da892ba0
Only filter out -mavx on Sandybridge ZGEMM/ZTRMM kernels 2021-05-14 23:19:10 +02:00
Martin Kroeker cbfd3c87e1
Recognize Intel Ice Lake SP as Cooper Lake 2021-05-14 20:44:06 +02:00
Martin Kroeker 26e87ac517
Support Intel Ice Lake SP as Cooper Lake 2021-05-14 20:39:55 +02:00
Martin Kroeker 15b9d6b4a7
Delete zchkaa.f 2021-05-14 19:55:31 +02:00
Martin Kroeker f7bcd962c1
Delete schkaa.f 2021-05-14 19:54:54 +02:00
Martin Kroeker 93cc066921
Delete dchkaa.f 2021-05-14 19:54:13 +02:00
Martin Kroeker 2c7d4a7766
Delete cchkaa.f 2021-05-14 19:53:38 +02:00
Martin Kroeker eef1c42f03
Convert ?chkaa to use dynamic allocation for the larger arrays 2021-05-14 19:53:03 +02:00
Martin Kroeker 73f637e584
Support compilation with pre-C99 versions of MSVC 2021-05-14 15:08:12 +02:00
Martin Kroeker 8b90e5f202
Drop redundant inclusion of complex.h 2021-05-14 15:06:44 +02:00
Martin Kroeker bd60fb6ffc
filter out -mavx flag on zgemm kernels as it can cause problems with older gcc 2021-05-13 23:05:00 +02:00
Martin Kroeker 37ea8702ee
Merge pull request #3192 from damonyu1989/develop
Update the intrinsic api to the offical name.
2021-05-11 16:00:45 +02:00
Martin Kroeker ec7d6c02bc
Add an Android crossbuild on OSX to Azure CI (#3224)
* Add an Android crossbuild on OSX
2021-05-10 08:02:01 +02:00
Martin Kroeker c90c23e78f
Merge pull request #3223 from martin-frbg/develop
Use percent instead of ampersand as placeholder for substitutions
2021-05-07 08:51:45 +02:00
Martin Kroeker bda8820da7
Use percent instead of ampersand as placeholder for substitutions 2021-05-06 20:20:08 +02:00
Martin Kroeker c0ca63ea46
Fix missing conditionals for non-SKX kernels 2021-05-05 14:55:36 +02:00
Martin Kroeker f497bb949b
Merge pull request #3219 from austinpagan/Gemm.ErrorFix
Add error message token for SBGEMM in gemm.c
2021-05-05 14:30:41 +02:00
Martin Kroeker f86b1bc3da
Merge pull request #3220 from drhpc/drhpc-fixup
Delete lapack_wrappers.c.orig
2021-05-05 14:30:24 +02:00
drhpc 206e03fdac
Delete lapack_wrappers.c.orig
This looks like a leftover from patching and confuses further patching;-)
2021-05-04 21:02:07 +02:00
Gordon Fossum 8b599836db Add error message token for SBGEMM in gemm.c 2021-05-04 13:55:02 -05:00
Martin Kroeker 9721b57ecf
Update version to 0.3.15.dev 2021-05-03 00:01:08 +02:00
Martin Kroeker 380f955078
Update version to 0.3.15.dev 2021-05-03 00:00:29 +02:00
Martin Kroeker 49d18e65e3
Merge pull request #3217 from xianyi/release-0.3.0
merge 0.3.15 back into develop to copy tag
2021-05-02 23:59:55 +02:00
Martin Kroeker 904f9a267d
Update version to 0.3.15 2021-05-02 23:50:22 +02:00
Martin Kroeker 4c033730bb
Update version to 0.3.15 2021-05-02 23:49:49 +02:00
Martin Kroeker 65502c6af6
Merge pull request #3216 from xianyi/develop
Update from develop for 0.3.15 release
2021-05-02 23:48:28 +02:00
Martin Kroeker f71627fa2e
Merge pull request #3215 from martin-frbg/cl0315
Update Changelog for 0.3.15
2021-05-02 23:47:24 +02:00