Martin Kroeker
04255be948
Merge pull request #3344 from gxw-loongson/develop
...
Delete the macro instruction "li" and use "li.d" instead
2021-08-12 15:16:46 +02:00
gxw
a7bc8ec1f1
Delete the macro instruction "li" and use "li.d" instead
...
Change-Id: Icff7981e2eb7df29ba5af1f8eb5be8443c67450f
2021-08-12 17:02:54 +08:00
Martin Kroeker
8cd2b32fef
Merge pull request #3343 from cianciosa/develop
...
Fix undefined behavior checking the size of ARGC
2021-08-12 01:28:18 +02:00
cianciosa
4c766cd11f
Fix a small syntax error. A ( was accidently deleted.
2021-08-11 12:08:34 -04:00
cianciosa
c28560129f
Check the total number of arguments passed insead of if the ARGV# is defined. This fixes a problem when compling openblas as a subproject of another code.
2021-08-11 12:00:07 -04:00
Martin Kroeker
b9e4fb206d
Merge pull request #3341 from RajalakshmiSR/dasump10
...
POWER10: Improving dasum performance
2021-08-11 09:39:10 +02:00
Rajalakshmi Srinivasaraghavan
b06880c2cd
POWER10: Improving dasum performance
...
Unrolling a loop in dasum micro code to help in improving
POWER10 performance.
2021-08-10 22:06:04 -05:00
Zhang Xianyi
cbc583eb54
Merge pull request #3336 from martin-frbg/traviscom
...
Disable all x86 jobs on Travis
2021-08-05 19:13:19 +08:00
Martin Kroeker
e5ba7c3235
Disable all x86 jobs
2021-08-05 11:08:18 +02:00
Martin Kroeker
435d84a7ce
Merge pull request #3332 from martin-frbg/travisbadge
...
Update Travis badge in README
2021-08-05 09:36:59 +02:00
Martin Kroeker
139f632ca4
Merge pull request #3334 from Guobing-Chen/BF16_gemm_full_kernel
...
Add all SBGEMM kernels for IA AVX512-BF16 based platforms
2021-08-05 08:01:13 +02:00
Chen, Guobing
5d86becdae
Add all SBGEMM kernels for IA AVX512-BF16 based platforms
...
Added all SBGEMM kernels including NN/NT/TN/TT for both ColMajor and
RowMajor, based on AVX512-BF16 ISA set on IA.
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
2021-08-05 11:11:29 +08:00
Martin Kroeker
93c8bafff5
Update Travis badge in README
2021-08-03 10:45:45 +02:00
Martin Kroeker
b5858c4472
Merge pull request #3330 from xianyi/issue3321
...
Improve the "tried to allocate too many buffers" error message
2021-08-02 22:36:05 +02:00
Martin Kroeker
898212efcd
Actually add the message to the TLS section
2021-08-02 14:50:14 +02:00
Martin Kroeker
210a1584c5
Rebase source and edit TLS version of the message as well
2021-08-02 14:19:16 +02:00
Martin Kroeker
e6d6d3ee43
Merge pull request #3331 from gxw-loongson/develop
...
Fixed typos about LOONGARCH64
2021-08-02 07:21:46 +02:00
gxw
0b8f7c8c10
Add cmake support for LOONGARCH64
2021-08-02 10:00:41 +08:00
Martin Kroeker
f2a7a67f5a
Improve the "tried to allocate too many buffers" error message
2021-07-31 17:23:40 +02:00
Martin Kroeker
e0e88f9edc
Merge pull request #3329 from martin-frbg/issue3272
...
Work around gcc11+ miscompiling C/ZBLAS3 tests at -O3
2021-07-30 20:39:38 +02:00
Martin Kroeker
5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
2021-07-30 14:46:19 +02:00
Martin Kroeker
e78fbe4654
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
2021-07-30 14:44:54 +02:00
Martin Kroeker
b4f4ed378b
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
2021-07-30 14:21:08 +02:00
Martin Kroeker
cbc41973fd
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
2021-07-30 14:20:12 +02:00
gxw
34207bdf5b
Fixed typos about LOONGARCH64
2021-07-30 18:11:12 +08:00
Martin Kroeker
1b6db3dbba
Merge pull request #3327 from h-vetinari/lapack597_redux
...
Complete the carry of lapack PR 597
2021-07-28 23:04:02 +02:00
Martin Kroeker
f681553c6a
Merge pull request #3326 from wattoc/develop
...
Include Haiku in processor count checks
2021-07-28 23:03:37 +02:00
Martin Kroeker
afadeeba2a
Merge pull request #3325 from gxw-loongson/develop
...
Add support for LOONGARCH64
2021-07-28 23:03:15 +02:00
Isuru Fernando
02d4a49761
Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS
2021-07-27 23:44:51 +02:00
Craig Watson
4d7dfe4845
Include Haiku in processor count checks
2021-07-27 09:00:30 +00:00
gxw
af0a69f355
Add support for LOONGARCH64
2021-07-27 15:29:12 +08:00
Martin Kroeker
5a2fe5bfb9
Merge pull request #3323 from martin-frbg/issue3322
...
GCC did not support -mtune for ARM64 before 5.1
2021-07-23 22:46:02 +02:00
Martin Kroeker
342d3e8b5c
Merge pull request #3314 from martin-frbg/lapack597
...
Fix LAPACK testsuite compatibility with libomp (Reference-LAPACK PR 597)
2021-07-23 15:30:27 +02:00
Martin Kroeker
efbd7c7840
GCC did not support -mtune for ARM64 before 5.1
2021-07-23 13:42:52 +02:00
Martin Kroeker
3a7955cd93
Merge pull request #3320 from martin-frbg/issue3318
...
Empirical workaround for numpy SVD NaN problem from issue 3318
2021-07-22 21:28:50 +02:00
Martin Kroeker
47ba85f314
Fix regex to match kernels suffixed with cpuname too
2021-07-22 17:24:15 +02:00
Martin Kroeker
30f23be0f9
Rework setting of -mfma to only apply it where necessary
2021-07-22 12:00:03 +02:00
Martin Kroeker
49bbf330ca
Empirical workaround for numpy SVD NaN problem from issue 3318
2021-07-18 22:19:19 +02:00
Martin Kroeker
38d5b4b124
Update version to 0.3.17.dev
2021-07-15 15:00:01 +02:00
Martin Kroeker
6e3fbe8ac5
Update version to 0.3.17.dev
2021-07-15 14:59:15 +02:00
Martin Kroeker
86273392e5
Merge pull request #3317 from xianyi/release-0.3.0
...
merge 0.3.17 back into develop to copy tag
2021-07-15 14:58:20 +02:00
Martin Kroeker
d909f9f3d4
Update version to 0.3.17
2021-07-15 14:52:54 +02:00
Martin Kroeker
12d3d94e2e
Merge pull request #3316 from xianyi/develop
...
Merge develop for bugfix release 0.3.17
2021-07-15 14:51:50 +02:00
Martin Kroeker
f349be3bdb
Merge branch 'release-0.3.0' into develop
2021-07-15 14:50:20 +02:00
Martin Kroeker
4777eb678f
Update version to 0.3.17
2021-07-15 14:46:24 +02:00
Martin Kroeker
415876d117
Merge pull request #3315 from martin-frbg/changelog0317
...
Update Changelog for 0.3.17
2021-07-15 14:44:59 +02:00
Martin Kroeker
da8435dc36
Update Changelog for 0.3.17
2021-07-15 14:44:17 +02:00
Martin Kroeker
4c7065f3ee
Merge pull request #3313 from martin-frbg/3266-2
...
Remove BLASLONG casts from SPARC parameter entries
2021-07-15 08:00:57 +02:00
Martin Kroeker
f62bfaafe8
Merge pull request #3312 from martin-frbg/revert_3260
...
Temporarily disable the SkylakeX sgemv_t microkernel
2021-07-15 08:00:34 +02:00
Martin Kroeker
d947116390
Merge pull request #3311 from martin-frbg/issue3309
...
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe …
2021-07-15 07:58:47 +02:00