Martin Kroeker
321288597c
Merge pull request #2212 from martin-frbg/nofort-nolib
...
Avoid spurious dependency on the fortran runtime despite NOFORTRAN=1
2019-08-11 20:26:34 +02:00
Martin Kroeker
be147a9f28
Avoid adding a spurious dependency on the fortran runtime despite NOFORTRAN=1
...
for cases where a fortran compiler is present but not wanted (e.g. not fully functional)
2019-08-11 16:24:39 +02:00
Martin Kroeker
b89c781637
Fix surprising behaviour of NO_AFFINITY=0
2019-07-07 16:04:45 +02:00
Martin Kroeker
abea977ded
Merge pull request #2162 from martin-frbg/pgi
...
Fixes for PGI compiler
2019-07-03 19:16:30 +02:00
Martin Kroeker
7684c4f8f8
PGI compiler does not like -march=native
2019-06-20 19:56:01 +02:00
Martin Kroeker
4ea794a522
Avoid unintentional activation of TLS code via USE_TLS=0
...
fixes #2149
2019-06-10 17:24:15 +02:00
Martin Kroeker
0ab4076dd8
Merge pull request #2156 from martin-frbg/issue2154
...
Add gfortran workaround for C->FORTRAN ABI violation
2019-06-06 13:43:12 +02:00
Martin Kroeker
900d5a3205
Add gfortran workaround for ABI violations in LAPACKE
...
for #2154 (see gcc bug 90329)
2019-06-06 10:18:40 +02:00
Michael Lass
7a9a4dbc4f
Fix detection of AVX512 capable compilers in getarch
...
21eda8b5
introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.
2019-06-05 17:30:56 +02:00
Martin Kroeker
f66c11fc22
Remove unrelated change
2019-05-15 23:38:12 +02:00
Martin Kroeker
d2cb610272
Add option USE_LOCKING for single-threaded build with locking support
...
for calling from concurrent threads
2019-05-15 23:18:43 +02:00
Rashmica Gupta
bcdf1d4917
Add in runtime CPU detection for POWER.
2019-04-09 14:20:16 +10:00
Martin Kroeker
c4868d11c0
Make sure that AVX512 is disabled in 32bit builds
...
for #2033
2019-03-01 09:23:03 +01:00
Martin Kroeker
d66214c946
Make x86_32 imply NO_AVX2, NO_AVX512 in addition to NO_AVX
...
fixes #2033
2019-02-28 09:58:25 +01:00
Martin Kroeker
36b844af88
Change ARMV8 target to ARMV7 when BINARY32 is set
...
fixes #1961
2019-01-26 17:47:22 +01:00
TiborGY
d11554c88f
Validate user supplied TARGET ( #1941 )
...
the build will now abort with an error message when an undefined build TARGET is named
Fixes #1938
2018-12-31 23:19:44 +01:00
Martin Kroeker
5bd21ab6e1
Make sure that -fPIC is present when needed
...
override user-provided FFLAGS if necessary
2018-12-23 23:46:48 +01:00
Martin Kroeker
c5f8aeff2d
Merge branch 'develop' into fbsd12
2018-12-03 12:50:14 +01:00
Martin Kroeker
ea6d1b96bd
Update Makefile.system
2018-12-03 08:59:10 +01:00
Martin Kroeker
3c9e3faedb
fixup BSD naming of powerpc arch
2018-12-02 23:24:53 +01:00
Andrew
44c81fd135
oops
2018-12-02 20:27:53 +01:00
Andrew
26b3710485
Add architecture mappings for FreeBSD12
2018-12-02 12:07:41 +01:00
pkubaj
f85ce54d4a
Use correct Makefile on powerpc64
...
FreeBSD uses powerpc64 name for POWER architecture. Use correct Makefile for this platform.
2018-11-30 16:05:49 +00:00
Martin Kroeker
a29ec458c2
propagate verison number for openblas_config_version
2018-11-29 00:10:49 +01:00
Martin Kroeker
b0c15bacc1
Merge pull request #1863 from martin-frbg/aix_install3
...
Set LIBSONAME suffix to .a for AIX
2018-11-09 13:12:06 +01:00
Martin Kroeker
cfb0f5b0f8
Set LIBSONAME suffix to .a for AIX
...
another fix for #1803
2018-11-08 22:39:10 +01:00
Martin Kroeker
5f8f0583d4
Merge branch 'develop' into fc-1847
2018-11-07 08:47:52 +01:00
Andrew
7d3502b500
Add -frecursive gfortran option by default
2018-11-06 08:20:55 +00:00
Ashwin Sekhar T K
d5aeff636f
ARM64: Enable DYNAMIC_ARCH
...
Enable DYNAMIC_ARCH feature on ARM64. This patch uses the cpuid
feature in linux kernel to detect the core type at runtime
(https://www.kernel.org/doc/Documentation/arm64/cpu-feature-registers.txt ).
If this feature is missing in kernel, then the user should use the
OPENBLAS_CORETYPE env variable to select the desired core type.
2018-10-22 01:49:35 -07:00
Martin Kroeker
c0d7cd3dac
Merge pull request #1799 from martin-frbg/issue1796
...
Handle conflicting usage of ARCH in at least some BSD environments
2018-10-09 08:20:52 +02:00
Martin Kroeker
d3d58f8ee5
Catch conflicting usage of ARCH in at least some BSD environments
...
fixes #1796
2018-10-08 22:29:35 +02:00
Martin Kroeker
2caa2210bb
Add USE_TLS option to choose between old and new implementation of memory.c
2018-08-25 19:37:11 +02:00
Martin Kroeker
0b2b83d9ed
Add support for a user-defined list of dynamic targets
2018-06-23 19:41:32 +02:00
Martin Kroeker
6c2d90ba77
Move some DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
2018-06-09 16:29:17 +02:00
Martin Kroeker
f6021c798d
Re-enable QUIET_MAKE
2018-06-05 19:09:38 +02:00
Martin Kroeker
e8002536ec
disable quiet_make for the moment
2018-06-05 18:23:01 +02:00
Martin Kroeker
15a78d6b66
export NO_AVX512 setting
2018-06-05 15:58:34 +02:00
Martin Kroeker
b7feded85a
Propagate NO_AVX512 via CCOMMON_OPT
2018-06-05 10:24:05 +02:00
Martin Kroeker
a7d0f49cec
Add SKYLAKEX to DYNAMIC_CORE list only if AVX512 is available
2018-06-03 23:13:25 +02:00
Martin Kroeker
f1fb9a4745
Propagate NO_AVX512 if needed
2018-06-03 13:48:27 +02:00
Arjan van de Ven
99c7bba8e4
Initial support for SkylakeX / AVX512
...
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
2018-06-03 07:58:52 +00:00
Zhang Xianyi
50acc40613
Merge pull request #1536 from WestAlgo/develop
...
Fix race condition in blas_server_omp.c
2018-05-11 10:09:14 +08:00
Martin Kroeker
12398e53ce
Merge pull request #1553 from martin-frbg/ifort-openmpflag
...
Change -openmp to -fopenmp for ifort entry as well
2018-05-09 14:39:52 +02:00
Martin Kroeker
193f835662
Change -openmp to -fopenmp for ifort entry as well
2018-05-09 12:34:09 +02:00
Martin Kroeker
e3a069f108
Merge pull request #1550 from martin-frbg/ifort-openmpflag
...
Update compiler flag for openmp use with ICC
2018-05-09 09:02:38 +02:00
Martin Kroeker
65b8a5c5d8
Update compiler flag for openmp use with ICC
...
The deprecated -openmp option was finally removed in favor of -qopenmp or -fopenmp, picking the latter to stay compatible with Intel compiler versions before 2015 (when -q options were introduced). Fixes #1546
2018-05-08 21:47:10 +02:00
Martin Kroeker
71051259e0
Restore compiler options for mips P5600 target
2018-05-02 20:37:06 +02:00
Martin Kroeker
d94d7baf7e
Add mips32r2 api target
2018-05-02 20:17:26 +02:00
Zhiyong Dang
1b83341d19
Fix race condition in blas_server_omp.c
...
Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d
2018-04-27 17:00:42 +08:00
Alex Arslan
8f811a9312
Reinstate macOS logic
2018-04-04 11:41:45 -07:00