Martin Kroeker
|
97d6eb97b1
|
Report availability of C11 support
|
2020-07-18 16:59:33 +00:00 |
Martin Kroeker
|
4afd11dae5
|
Add a check for C11 atomics and stdatomic.h
|
2020-07-18 16:57:41 +00:00 |
Martin Kroeker
|
72ec6280c7
|
Merge pull request #2724 from martin-frbg/loongsonreadme
Update cross-compiling example in README to reflect change in Loongson gcc
|
2020-07-18 18:08:40 +02:00 |
Martin Kroeker
|
26b7f24d16
|
Update cross-compiling example to reflect change in Loongson gcc
for #2723
|
2020-07-18 12:51:37 +00:00 |
Martin Kroeker
|
0db4218fed
|
Merge pull request #2722 from martin-frbg/cmakefcheck
Handle lack of fortran compiler more gracefully in cmake
|
2020-07-17 10:33:03 +02:00 |
Martin Kroeker
|
9d000ecaa2
|
include CheckLanguage module
|
2020-07-16 22:36:35 +00:00 |
Martin Kroeker
|
a847d00366
|
handle missing lack of fortran compiler more gracefully
|
2020-07-16 22:17:39 +00:00 |
Martin Kroeker
|
0033f8be0d
|
Use vec_vsx_ld/st to fix misaligned accesses flagged by asan
|
2020-07-16 23:32:54 +02:00 |
Martin Kroeker
|
f308e741b2
|
remove debug output and revert changes to cdot and crot
|
2020-07-15 10:00:07 +02:00 |
Martin Kroeker
|
4f5d26bb02
|
Merge pull request #2716 from RajalakshmiSR/p10_ldflag
Add new linker option for POWER10
|
2020-07-15 01:20:54 +02:00 |
Rajalakshmi Srinivasaraghavan
|
417c4e8af8
|
Add new linker option for POWER10
While building with DYNAMIC_ARCH on POWER9 with POWER10
aware toolchain, new LDFLAG is needed to avoid POWER10
instructions on PLT calls .
|
2020-07-14 11:54:04 -05:00 |
Martin Kroeker
|
da17abec87
|
fix trailing whitespace
|
2020-07-14 18:20:03 +02:00 |
Martin Kroeker
|
f8c2697701
|
Use POWER6 GEMM, TRMM and DTRSM on 32bit POWER8
|
2020-07-14 18:11:19 +02:00 |
Martin Kroeker
|
b144423f0f
|
Do not define USE_TRMM for 32bit POWER8
|
2020-07-14 18:10:12 +02:00 |
Martin Kroeker
|
bd2498c886
|
Use POWER6 GEMM parameters on 32bit POWER8
|
2020-07-14 18:07:58 +02:00 |
Martin Kroeker
|
d8e2edfc20
|
Merge pull request #71 from xianyi/develop
rebase
|
2020-07-14 18:01:34 +02:00 |
Martin Kroeker
|
419b8686d1
|
Merge pull request #2682 from martin-frbg/aix
[WIP] fix compilation on AIX
|
2020-07-13 14:43:24 +02:00 |
Martin Kroeker
|
3ab15ff34c
|
Merge pull request #2651 from leezu/actionsflang
Add flang build to Github Actions
|
2020-07-13 13:00:39 +02:00 |
Martin Kroeker
|
8916c4ae2c
|
Merge branch 'develop' into actionsflang
|
2020-07-12 20:37:29 +02:00 |
Martin Kroeker
|
4fa283de66
|
Merge pull request #2706 from jussienko/use-always-omp-threads
Fix OpenMP builds defaulting to singlethreading with OMP_PLACES or OMP_PROC_BIND set
|
2020-07-12 20:17:11 +02:00 |
Martin Kroeker
|
5865c7d4d6
|
Make 32bit POWER8 use POWER6 kernels for now
|
2020-07-12 18:59:01 +02:00 |
Martin Kroeker
|
ae3a90f78f
|
merge overwritten part of power10 support
|
2020-07-12 18:51:58 +02:00 |
Martin Kroeker
|
009864edde
|
Merge pull request #2710 from martin-frbg/cmake-lapacktest
Add LAPACK-TESTING to the cmake build
|
2020-07-10 12:06:50 +02:00 |
Martin Kroeker
|
3de80b3f5a
|
Merge pull request #2713 from RajalakshmiSR/p10-gcc10
Change minimum gcc version for POWER10
|
2020-07-10 10:43:33 +02:00 |
Rajalakshmi Srinivasaraghavan
|
af1e140e35
|
Change minimum gcc version for POWER10
As the MMA patches for POWER10 are backported to gcc10.2, changing
the minimum gcc version needed to build OpenBLAS for POWER10.
|
2020-07-09 21:46:06 -05:00 |
Martin Kroeker
|
d4a0299e16
|
Do not build lapack-test on MSVC for now (same as with BLAS test)
|
2020-07-09 13:57:27 +02:00 |
Martin Kroeker
|
f766024749
|
enable fortran for cmake
|
2020-07-09 13:44:25 +02:00 |
Martin Kroeker
|
c502760bef
|
Modify for building with OpenBLAS
|
2020-07-09 13:13:16 +02:00 |
Martin Kroeker
|
29b5887d5f
|
Modify for building with OpenBLAS
|
2020-07-09 13:12:35 +02:00 |
Martin Kroeker
|
60188a8c82
|
Append crude hack for enabling lapack tests in the OpenBLAS build
|
2020-07-09 11:44:31 +02:00 |
Martin Kroeker
|
1d63631afe
|
Add lapack-test
|
2020-07-09 11:42:02 +02:00 |
Martin Kroeker
|
e82bb953a7
|
Merge pull request #2708 from RajalakshmiSR/p10_future
Changing mcpu option as power10
|
2020-07-08 12:26:44 +02:00 |
Martin Kroeker
|
ed7e155c35
|
Merge branch 'develop' into aix
|
2020-07-07 18:52:06 +02:00 |
Rajalakshmi Srinivasaraghavan
|
45d819ca82
|
Changing mcpu option as power10
As compiler enabled mcpu option as power10, changing it from future.
|
2020-07-07 11:25:20 -05:00 |
Martin Kroeker
|
8751a69271
|
Obtain actual cpu count on AIX and suppress spurious NO_AVX512 on non-x86
|
2020-07-07 15:46:32 +02:00 |
Jussi Enkovaara
|
10a2923f64
|
fixes #2238
Always obey omp_get_max_threads() when build with USE_OPENMP
|
2020-07-07 13:35:43 +03:00 |
Martin Kroeker
|
5ff83a4261
|
Merge pull request #2670 from mhillenibm/dumpfullversion_on_gcc7
RFC: Use -dumpfullversion to get minor version on gcc-7 and newer
|
2020-07-07 00:12:28 +02:00 |
Martin Kroeker
|
5bc9680a86
|
Merge pull request #2703 from martin-frbg/issue2702
Compatibility fix for gcc < 4.7
|
2020-07-02 22:32:51 +02:00 |
Martin Kroeker
|
4ab3651591
|
Option -mavx2 requires at least gcc 4.7
|
2020-07-02 17:00:15 +02:00 |
Martin Kroeker
|
a83680b40b
|
Merge pull request #69 from xianyi/develop
rebase
|
2020-07-02 16:56:00 +02:00 |
Martin Kroeker
|
c3aa036e99
|
Merge pull request #2693 from EGuesnet/AIX-build-on-POWER8-32bits
AIX build on POWER8 32bits
|
2020-07-01 08:29:52 +02:00 |
EGuesnet
|
634e1305f9
|
Update cgemm_kernel_8x4_power8.S
|
2020-06-30 15:16:39 +02:00 |
Martin Kroeker
|
c467516132
|
Merge pull request #2688 from martin-frbg/cometlake
Add autodetection of Intel Comet Lake H and S models
|
2020-06-27 17:47:24 +02:00 |
Martin Kroeker
|
83f4746825
|
Add support for Comet Lake H and S
|
2020-06-27 14:41:24 +02:00 |
Martin Kroeker
|
584ef8d4ae
|
Add support for Comet Lake H & S
|
2020-06-27 14:36:37 +02:00 |
Martin Kroeker
|
8dfda02e89
|
Merge pull request #68 from xianyi/develop
rebase
|
2020-06-27 14:29:29 +02:00 |
Martin Kroeker
|
28d69e0097
|
Merge pull request #2687 from martin-frbg/utfbom
Strip UTF8 byte order marker from source files
|
2020-06-26 22:53:09 +02:00 |
Martin Kroeker
|
c2467c9619
|
Merge pull request #2686 from RajalakshmiSR/p10_shgemm
powerpc: Optimized SHGEMM kernel for POWER10
|
2020-06-26 22:52:45 +02:00 |
Martin Kroeker
|
f86e749df4
|
Merge pull request #2683 from mtreinish/add-comet-lake-support
Add cpu detection support for comet lake U
|
2020-06-26 12:11:03 +02:00 |
Martin Kroeker
|
d199c2787d
|
Merge pull request #2680 from kavanabhat/aix_makefile_fix
Fix for #2671
|
2020-06-26 11:27:28 +02:00 |