Commit Graph

7452 Commits

Author SHA1 Message Date
Martin Kroeker 5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 2021-07-30 14:46:19 +02:00
Martin Kroeker e78fbe4654
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 2021-07-30 14:44:54 +02:00
Martin Kroeker b4f4ed378b
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 2021-07-30 14:21:08 +02:00
Martin Kroeker cbc41973fd
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 2021-07-30 14:20:12 +02:00
gxw 34207bdf5b Fixed typos about LOONGARCH64 2021-07-30 18:11:12 +08:00
Martin Kroeker 1b6db3dbba
Merge pull request #3327 from h-vetinari/lapack597_redux
Complete the carry of lapack PR 597
2021-07-28 23:04:02 +02:00
Martin Kroeker f681553c6a
Merge pull request #3326 from wattoc/develop
Include Haiku in processor count checks
2021-07-28 23:03:37 +02:00
Martin Kroeker afadeeba2a
Merge pull request #3325 from gxw-loongson/develop
Add support for LOONGARCH64
2021-07-28 23:03:15 +02:00
Isuru Fernando 02d4a49761 Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS 2021-07-27 23:44:51 +02:00
Craig Watson 4d7dfe4845 Include Haiku in processor count checks 2021-07-27 09:00:30 +00:00
gxw af0a69f355 Add support for LOONGARCH64 2021-07-27 15:29:12 +08:00
Martin Kroeker 5a2fe5bfb9
Merge pull request #3323 from martin-frbg/issue3322
GCC did not support -mtune for ARM64 before 5.1
2021-07-23 22:46:02 +02:00
Martin Kroeker 342d3e8b5c
Merge pull request #3314 from martin-frbg/lapack597
Fix LAPACK testsuite compatibility with libomp (Reference-LAPACK PR 597)
2021-07-23 15:30:27 +02:00
Martin Kroeker efbd7c7840
GCC did not support -mtune for ARM64 before 5.1 2021-07-23 13:42:52 +02:00
Martin Kroeker 3a7955cd93
Merge pull request #3320 from martin-frbg/issue3318
Empirical workaround for numpy SVD NaN problem from issue 3318
2021-07-22 21:28:50 +02:00
Martin Kroeker 47ba85f314
Fix regex to match kernels suffixed with cpuname too 2021-07-22 17:24:15 +02:00
Martin Kroeker 30f23be0f9
Rework setting of -mfma to only apply it where necessary 2021-07-22 12:00:03 +02:00
Martin Kroeker 49bbf330ca
Empirical workaround for numpy SVD NaN problem from issue 3318 2021-07-18 22:19:19 +02:00
Martin Kroeker 38d5b4b124
Update version to 0.3.17.dev 2021-07-15 15:00:01 +02:00
Martin Kroeker 6e3fbe8ac5
Update version to 0.3.17.dev 2021-07-15 14:59:15 +02:00
Martin Kroeker 86273392e5
Merge pull request #3317 from xianyi/release-0.3.0
merge 0.3.17 back into develop to copy tag
2021-07-15 14:58:20 +02:00
Martin Kroeker d909f9f3d4
Update version to 0.3.17 2021-07-15 14:52:54 +02:00
Martin Kroeker 12d3d94e2e
Merge pull request #3316 from xianyi/develop
Merge develop for bugfix release 0.3.17
2021-07-15 14:51:50 +02:00
Martin Kroeker f349be3bdb
Merge branch 'release-0.3.0' into develop 2021-07-15 14:50:20 +02:00
Martin Kroeker 4777eb678f
Update version to 0.3.17 2021-07-15 14:46:24 +02:00
Martin Kroeker 415876d117
Merge pull request #3315 from martin-frbg/changelog0317
Update Changelog for 0.3.17
2021-07-15 14:44:59 +02:00
Martin Kroeker da8435dc36
Update Changelog for 0.3.17 2021-07-15 14:44:17 +02:00
Martin Kroeker 4c7065f3ee
Merge pull request #3313 from martin-frbg/3266-2
Remove BLASLONG casts from SPARC parameter entries
2021-07-15 08:00:57 +02:00
Martin Kroeker f62bfaafe8
Merge pull request #3312 from martin-frbg/revert_3260
Temporarily disable the SkylakeX sgemv_t microkernel
2021-07-15 08:00:34 +02:00
Martin Kroeker d947116390
Merge pull request #3311 from martin-frbg/issue3309
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe …
2021-07-15 07:58:47 +02:00
Martin Kroeker f176ff90af
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:42:43 +02:00
Martin Kroeker f4d4abd423
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:41:45 +02:00
Martin Kroeker 2b9443b7e7
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:40:29 +02:00
Martin Kroeker fe0e66564e
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 2021-07-14 22:39:20 +02:00
Martin Kroeker a6351e32f0
Remove BLASLONG casts from SPARC entries
in response to https://github.com/xianyi/OpenBLAS/pull/3266#issuecomment-878637675
2021-07-14 21:09:36 +02:00
Martin Kroeker 5b4b385ecf
Temporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures 2021-07-14 20:50:14 +02:00
Martin Kroeker 1dea57ab25
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64 2021-07-14 20:32:57 +02:00
Martin Kroeker 54ffe280df
Merge pull request #3310 from jeromerobert/develop
Avoid redefinition of _GNU_SOURCE
2021-07-14 16:06:13 +02:00
Jerome Robert 029d1e16b9 Avoid redefinition of _GNU_SOURCE
* _GNU_SOURCE may have been set by the application and redefinition
  trigger warnings or error with -Werror
* Fix for 220f6a1c5
2021-07-14 12:21:58 +02:00
Martin Kroeker ea8e208029
Merge pull request #3306 from jonaszhou1/develop
Add cpu detection support for Zhaoxin processors
2021-07-12 16:30:10 +02:00
JonasZhou 0fca36c8c3 Add cpu detection support for Zhaoxin processors
Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
2021-07-12 13:43:45 +08:00
Martin Kroeker 44cc7cdecc
Update version to 0.3.16.dev 2021-07-12 00:16:59 +02:00
Martin Kroeker 6492131792
Update version to 0.3.16.dev 2021-07-12 00:16:29 +02:00
Martin Kroeker 6c8ec55fb7
Merge pull request #3305 from xianyi/release-0.3.0
Merge release branch back into develop to copy tag
2021-07-12 00:15:52 +02:00
Martin Kroeker fab746240c
Merge pull request #3304 from xianyi/develop
Merge develop int0 0.3.0 for release 0.3.16
2021-07-12 00:12:52 +02:00
Martin Kroeker 847607c768
Merge branch 'release-0.3.0' into develop 2021-07-12 00:12:25 +02:00
Martin Kroeker 4c81d1c3fe
Update version to 0.3.16 2021-07-12 00:09:35 +02:00
Martin Kroeker db4908ebfa
Update version to 0.3.16 2021-07-12 00:08:55 +02:00
Martin Kroeker ed3eb18cb2
Merge pull request #3303 from martin-frbg/changelog16
Update Changelog for 0.3.16
2021-07-11 23:50:02 +02:00
Martin Kroeker 239ff330f8
Update Changelog for 0.3.16 2021-07-11 23:48:39 +02:00