Martin Kroeker
1f4b6a5d5d
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
...
from #2143 , -march=native precludes use of more specific options like -march=skylake-avx512 in individual kernels, and defeats the purpose of dynamic arch anyway.
2019-06-10 09:50:13 +02:00
Martin Kroeker
be8f70d269
Merge pull request #2157 from martin-frbg/2154-2
...
Add gfortran workaround for potential ABI violation
2019-06-09 12:19:08 +02:00
Martin Kroeker
e674e1c735
Update fc.cmake
2019-06-09 09:31:13 +02:00
Martin Kroeker
6ca898b63b
Add gfortran workaround for potential ABI violation
...
for #2154
2019-06-08 23:17:03 +02:00
Martin Kroeker
26411acd56
Merge pull request #2148 from TiborGY/cpp_thread_test_2
...
Thread safety tester using C++11 threading (cleaned history)
2019-06-07 13:23:07 +02:00
Martin Kroeker
0ab4076dd8
Merge pull request #2156 from martin-frbg/issue2154
...
Add gfortran workaround for C->FORTRAN ABI violation
2019-06-06 13:43:12 +02:00
Martin Kroeker
a0caa762b3
Add gfortran workaround for ABI violations
...
for #2154 (see gcc bug 90329)
2019-06-06 10:24:16 +02:00
Martin Kroeker
900d5a3205
Add gfortran workaround for ABI violations in LAPACKE
...
for #2154 (see gcc bug 90329)
2019-06-06 10:18:40 +02:00
Martin Kroeker
a17cf36225
Merge pull request #2153 from quickwritereader/develop
...
improved power9 zgemm,sgemm
2019-06-06 07:42:56 +02:00
AbdelRauf
148c4cc5fd
conflict resolve
2019-06-05 20:50:50 +00:00
AbdelRauf
d0c3543c3f
power9 zgemm ztrmm optimized
2019-06-05 20:07:16 +00:00
Martin Kroeker
909ad04aef
Merge pull request #2145 from martin-frbg/1912-3
...
Separate implementations of AMAX and IAMAX on arm
2019-06-05 20:27:45 +02:00
Martin Kroeker
417efd41c6
Merge pull request #2110 from pc2/cpu-detection
...
Fix detection of Skylake processors when using GCC
2019-06-05 20:27:05 +02:00
Michael Lass
9cdc828afa
c_check: Unlink correct file
2019-06-05 17:31:01 +02:00
Michael Lass
7a9a4dbc4f
Fix detection of AVX512 capable compilers in getarch
...
21eda8b5
introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.
2019-06-05 17:30:56 +02:00
AbdelRauf
a469b32cf4
sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52
2019-06-04 07:11:30 +00:00
Martin Kroeker
27649b9543
Document NO_AVX512
...
for #2151
2019-06-03 11:01:33 +02:00
TiborGY
16f3df5d35
add c++ thread test option to Makefile.rule
2019-06-01 21:36:41 +02:00
TiborGY
1aded69821
hook up c++ thread safety test (main Makefile)
2019-06-01 21:32:52 +02:00
TiborGY
c00289ba54
upload thread safety test folder
2019-06-01 21:30:06 +02:00
AbdelRauf
8fe794f059
improved zgemm power9 based on power8
2019-05-30 15:31:25 +00:00
Martin Kroeker
74c10b57c6
Use generic kernels for complex (I)AMAX to support softfp
2019-05-30 11:38:11 +02:00
Martin Kroeker
c5495d2056
Ensure correct output for DAMAX with softfp
2019-05-30 11:25:43 +02:00
Martin Kroeker
c70496b108
Separate implementations of AMAX and IAMAX on arm
...
As noted in #1912 and comment on #1942 , the combined implementation happens to "do the right thing" on hardfp, but cannot return both value and index on softfp where they would have to share the return register
2019-05-29 15:02:51 +02:00
Martin Kroeker
ca8d8835f5
Merge pull request #2144 from xianyi/revert-2142-issue1912-2
...
Revert "Add softfp support in min/max kernels"
2019-05-29 14:09:10 +02:00
Martin Kroeker
d76b20b4d2
Revert "Add softfp support in min/max kernels"
2019-05-29 14:07:17 +02:00
Martin Kroeker
85af04da3c
Merge pull request #2142 from martin-frbg/issue1912-2
...
Add softfp support in min/max kernels
2019-05-28 22:56:08 +02:00
Martin Kroeker
11e0dcbffb
Merge pull request #2141 from martin-frbg/issue1912
...
Build and run utests independently of fortran
2019-05-28 20:50:40 +02:00
Martin Kroeker
79366ff7a9
Add softfp support in min/max kernels
...
fix for #1912
2019-05-28 20:34:22 +02:00
Martin Kroeker
21d05a4835
Merge pull request #2140 from martin-frbg/pgi19
...
Do not try ancient PGI hacks with recent versions of that compiler
2019-05-26 12:39:20 +02:00
Martin Kroeker
940f38f6dd
Build and run utests in any case, they do their own checks for fortran availability
2019-05-24 13:02:23 +02:00
Martin Kroeker
1778fd4219
Do not try ancient PGI hacks with recent versions of that compiler
...
should fix #2139
2019-05-22 13:48:27 +02:00
Martin Kroeker
969dd6175e
Merge pull request #2136 from martin-frbg/issue2126
...
Add option to allow combining USE_THREAD=0 with thread locking support
2019-05-16 12:08:16 +02:00
Martin Kroeker
d8d5682481
Merge pull request #2134 from tylerjereddy/skylake_regress_guard_may14
...
TST: add SkylakeX AVX512 CI test
2019-05-15 23:40:06 +02:00
Martin Kroeker
f66c11fc22
Remove unrelated change
2019-05-15 23:38:12 +02:00
Martin Kroeker
5ecffc28f2
Add option USE_LOCKING but keep default settings intact
2019-05-15 23:36:17 +02:00
Martin Kroeker
86dda5c2fa
Add option USE_LOCKING for SMP-like locking in USE_THREAD=0 builds
2019-05-15 23:21:20 +02:00
Martin Kroeker
1e52572be3
Add option USE_LOCKING for single-threaded build with locking support
2019-05-15 23:19:30 +02:00
Martin Kroeker
d2cb610272
Add option USE_LOCKING for single-threaded build with locking support
...
for calling from concurrent threads
2019-05-15 23:18:43 +02:00
Tyler Reddy
a211bc9b6a
TST: add SkylakeX AVX512 CI test
...
* adapt the C-level reproducer code for some
recent SkylakeX AVX512 kernel issues, provided
by Isuru Fernando and modified by Martin Kroeker,
for usage in the utest suite
* add an Intel SDE SkylakeX emulation utest run to
the Azure CI matrix; a custom Docker build was required
because Ubuntu image provided by Azure does not support
AVX512VL instructions
2019-05-14 11:32:23 -07:00
Martin Kroeker
9208ab8603
Merge pull request #2130 from isuruf/drone
...
Drone CI for arm64 native builds
2019-05-14 09:37:00 +02:00
Isuru Fernando
b43deb4ad6
Fix typo
2019-05-12 15:26:18 -05:00
Isuru Fernando
b911525c81
arm32 build
2019-05-12 15:21:43 -05:00
Isuru Fernando
7ff44e0016
Remove qemu armv8 builds
2019-05-12 15:09:53 -05:00
Isuru Fernando
e3cb8ad2d6
See if ubuntu 19.04 fixes the ICE
2019-05-12 14:28:48 -05:00
Isuru Fernando
7aa6faad5f
parallel build
2019-05-12 14:22:36 -05:00
Isuru Fernando
3d94ab660f
build without lapack on cmake
2019-05-12 14:17:12 -05:00
Isuru Fernando
cd99dfe034
Add cmake builds and print options
2019-05-12 14:10:10 -05:00
Isuru Fernando
dadafcdcd8
Add a cmake build as well
2019-05-12 14:10:10 -05:00
Isuru Fernando
d40c109eb0
no need of gcc in clang build
2019-05-12 14:10:10 -05:00