Martin Kroeker
|
1217eb910d
|
Fix copy-paste errors in variables used
|
2021-05-28 09:38:48 +02:00 |
Martin Kroeker
|
d6d7a6685d
|
Add shortcuts for (small) cases that do not need expensive buffer allocation
|
2021-05-27 22:39:18 +02:00 |
Martin Kroeker
|
f0e7345fb8
|
Add shortcut for small-size gemv_n with increments of one
|
2021-05-26 22:02:34 +02:00 |
Martin Kroeker
|
42f048cf6c
|
Merge pull request #3249 from MikaelUrankar/develop
Fix typo
|
2021-05-26 15:26:30 +02:00 |
MikaelUrankar
|
4fbc0777f4
|
Fix typo
|
2021-05-26 12:14:57 +02:00 |
Martin Kroeker
|
d7472606d5
|
Merge pull request #3244 from martin-frbg/issue3237
Add fast path for small xSYR with INCX==1
|
2021-05-22 22:38:09 +02:00 |
Martin Kroeker
|
03297ff9f0
|
Add fast path for small xSYR with INCX==1
|
2021-05-22 20:41:18 +02:00 |
Martin Kroeker
|
2d8d0af0ea
|
Merge pull request #3243 from martin-frbg/lapack564
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564)
|
2021-05-22 19:25:56 +02:00 |
Martin Kroeker
|
5f677e782e
|
Merge pull request #3196 from guowangy/skylakex-gemm-batch-k
GEMM: skylake: improve the performance when m is small
|
2021-05-22 19:25:28 +02:00 |
Martin Kroeker
|
04c60cee5d
|
Merge pull request #3242 from martin-frbg/issue3239
Handle inadvertent use of DYNAMIC_ARCH=0
|
2021-05-22 19:24:46 +02:00 |
Martin Kroeker
|
3a53207cc9
|
Fix spurious error exit test failures in the ?chktsqr tests (LAPACK564)
|
2021-05-22 14:29:45 +02:00 |
Martin Kroeker
|
0e73d20629
|
Handle inadvertent use of DYNAMIC_ARCH=0
|
2021-05-22 14:23:49 +02:00 |
Martin Kroeker
|
02087a62e7
|
Merge pull request #3205 from intelmy/sgemv_n_opt
optimize on sgemv_n for small n
|
2021-05-17 17:49:01 +02:00 |
Martin Kroeker
|
03b4d79a7e
|
Merge pull request #3238 from martin-frbg/lapack555
Correct function name in error message from SLASQ2 (LAPACK PR555)
|
2021-05-17 17:32:23 +02:00 |
Martin Kroeker
|
5c729c6dce
|
Correct function name in error message from SLASQ2 (Reference-LAPACK PR 555)
|
2021-05-17 14:47:14 +02:00 |
Martin Kroeker
|
e1911b2e60
|
Merge pull request #3236 from martin-frbg/issue3234
Add -lm for FreeBSD on ARM/ARM64
|
2021-05-16 17:17:18 +02:00 |
Martin Kroeker
|
8f33da4f94
|
Merge pull request #3235 from dnoan/develop
Update Makefile.arm64
|
2021-05-16 17:15:45 +02:00 |
Martin Kroeker
|
26ccf643a3
|
Add -lm for FreeBSD on ARM/ARM64
|
2021-05-16 13:04:38 +02:00 |
Noan
|
32264ba496
|
Update Makefile.arm64
Added -march and -mtune flags for EMAG processors when GCC 9 or later
|
2021-05-16 09:49:13 +00:00 |
Martin Kroeker
|
4ecf631f95
|
Merge pull request #3228 from martin-frbg/issue3226
filter out -mavx flag on Sandybridge zgemm/ztrmm kernels
|
2021-05-15 09:06:12 +02:00 |
Martin Kroeker
|
5af510081d
|
Merge pull request #3233 from martin-frbg/issue3230
Add autodetection for Intel Ice Lake SP
|
2021-05-15 01:04:09 +02:00 |
Martin Kroeker
|
164551d5a2
|
Merge pull request #3232 from martin-frbg/lapack553
Reduce stack size requirements in the LAPACK LIN tests (LAPACK PR 553)
|
2021-05-14 23:28:45 +02:00 |
Martin Kroeker
|
310b76aad7
|
Merge pull request #3231 from martin-frbg/issue3227
Support compilation with pre-C99 versions of MSVC
|
2021-05-14 23:28:06 +02:00 |
Martin Kroeker
|
c4da892ba0
|
Only filter out -mavx on Sandybridge ZGEMM/ZTRMM kernels
|
2021-05-14 23:19:10 +02:00 |
Martin Kroeker
|
cbfd3c87e1
|
Recognize Intel Ice Lake SP as Cooper Lake
|
2021-05-14 20:44:06 +02:00 |
Martin Kroeker
|
26e87ac517
|
Support Intel Ice Lake SP as Cooper Lake
|
2021-05-14 20:39:55 +02:00 |
Martin Kroeker
|
15b9d6b4a7
|
Delete zchkaa.f
|
2021-05-14 19:55:31 +02:00 |
Martin Kroeker
|
f7bcd962c1
|
Delete schkaa.f
|
2021-05-14 19:54:54 +02:00 |
Martin Kroeker
|
93cc066921
|
Delete dchkaa.f
|
2021-05-14 19:54:13 +02:00 |
Martin Kroeker
|
2c7d4a7766
|
Delete cchkaa.f
|
2021-05-14 19:53:38 +02:00 |
Martin Kroeker
|
eef1c42f03
|
Convert ?chkaa to use dynamic allocation for the larger arrays
|
2021-05-14 19:53:03 +02:00 |
Martin Kroeker
|
73f637e584
|
Support compilation with pre-C99 versions of MSVC
|
2021-05-14 15:08:12 +02:00 |
Martin Kroeker
|
8b90e5f202
|
Drop redundant inclusion of complex.h
|
2021-05-14 15:06:44 +02:00 |
Martin Kroeker
|
bd60fb6ffc
|
filter out -mavx flag on zgemm kernels as it can cause problems with older gcc
|
2021-05-13 23:05:00 +02:00 |
Martin Kroeker
|
37ea8702ee
|
Merge pull request #3192 from damonyu1989/develop
Update the intrinsic api to the offical name.
|
2021-05-11 16:00:45 +02:00 |
Martin Kroeker
|
ec7d6c02bc
|
Add an Android crossbuild on OSX to Azure CI (#3224)
* Add an Android crossbuild on OSX
|
2021-05-10 08:02:01 +02:00 |
Martin Kroeker
|
c90c23e78f
|
Merge pull request #3223 from martin-frbg/develop
Use percent instead of ampersand as placeholder for substitutions
|
2021-05-07 08:51:45 +02:00 |
Martin Kroeker
|
bda8820da7
|
Use percent instead of ampersand as placeholder for substitutions
|
2021-05-06 20:20:08 +02:00 |
Martin Kroeker
|
c0ca63ea46
|
Fix missing conditionals for non-SKX kernels
|
2021-05-05 14:55:36 +02:00 |
Martin Kroeker
|
f497bb949b
|
Merge pull request #3219 from austinpagan/Gemm.ErrorFix
Add error message token for SBGEMM in gemm.c
|
2021-05-05 14:30:41 +02:00 |
Martin Kroeker
|
f86b1bc3da
|
Merge pull request #3220 from drhpc/drhpc-fixup
Delete lapack_wrappers.c.orig
|
2021-05-05 14:30:24 +02:00 |
drhpc
|
206e03fdac
|
Delete lapack_wrappers.c.orig
This looks like a leftover from patching and confuses further patching;-)
|
2021-05-04 21:02:07 +02:00 |
Gordon Fossum
|
8b599836db
|
Add error message token for SBGEMM in gemm.c
|
2021-05-04 13:55:02 -05:00 |
Martin Kroeker
|
9721b57ecf
|
Update version to 0.3.15.dev
|
2021-05-03 00:01:08 +02:00 |
Martin Kroeker
|
380f955078
|
Update version to 0.3.15.dev
|
2021-05-03 00:00:29 +02:00 |
Martin Kroeker
|
49d18e65e3
|
Merge pull request #3217 from xianyi/release-0.3.0
merge 0.3.15 back into develop to copy tag
|
2021-05-02 23:59:55 +02:00 |
Martin Kroeker
|
904f9a267d
|
Update version to 0.3.15
|
2021-05-02 23:50:22 +02:00 |
Martin Kroeker
|
4c033730bb
|
Update version to 0.3.15
|
2021-05-02 23:49:49 +02:00 |
Martin Kroeker
|
65502c6af6
|
Merge pull request #3216 from xianyi/develop
Update from develop for 0.3.15 release
|
2021-05-02 23:48:28 +02:00 |
Martin Kroeker
|
f71627fa2e
|
Merge pull request #3215 from martin-frbg/cl0315
Update Changelog for 0.3.15
|
2021-05-02 23:47:24 +02:00 |