Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
...
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker
d0ba257de0
Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
...
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 20:31:20 +02:00
Rajalakshmi Srinivasaraghavan
1d97405c02
POWER: Enable bfloat16 kernels by default
...
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
2022-07-28 07:43:53 -05:00
gxw
fbfe1daf6e
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 14:28:45 +08:00
gxw
3573306a69
LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC
2022-07-25 16:04:56 +08:00
Martin Kroeker
407a1a242c
Merge pull request #3670 from martin-frbg/osxvermin
...
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
2022-06-29 08:31:04 +02:00
Martin Kroeker
be5500e704
Merge pull request #3669 from VFerrari/fix_small_matrix_kernel
...
POWER: fix issues with the small matrix kernel
2022-06-28 16:09:36 +02:00
Martin Kroeker
914c4d0fe8
Add C versions of the CBLAS test sources ( #3656 )
...
* Add C conversions of the CBLAS tests for NOFORTRAN=1 builds
* Enable CTEST without Fortran and fix passing of BUILD_vartype options to exports/gensymbol
2022-06-28 11:52:48 +02:00
Martin Kroeker
2857987ff6
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
2022-06-28 11:46:25 +02:00
VFerrari
2062280c6f
Power: Enable SMALL_MATRIX OPT as default for dynamic arch
2022-06-25 03:47:03 -03:00
Martin Kroeker
8f13ab94d2
Merge pull request #3613 from Rabenda/fix-riscv
...
Fix riscv64 detect
2022-05-04 07:22:47 +02:00
Martin Kroeker
24e99eca31
Avoid adding -lgfortran with NOFORTRAN
2022-04-27 20:31:42 +02:00
Han Gao
3fc52ebcfb
Fix other arch build in detect.
...
When CORE is empty, use -march=loongson3a. Fix it.
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-04-27 01:34:55 +08:00
Niyas Sait
3f5d145cd4
build: minor fixes to build on windows with make
...
This patch contains following fixes
1. Fix to build without PIC flag
2. Define LAPACK_COMPLEX_STRUCTURE for windows. Builds are failing
without it and changes are consistent with the CMake rules defined
in system.cmake (line 576)
2022-04-25 00:01:12 +01:00
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
...
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
2022-04-09 22:38:58 +02:00
Martin Kroeker
499ae5e8f7
Merge pull request #3510 from martin-frbg/issue3505
...
Fix recent SkylakeX/DYNAMIC_ARCH DGEMM breakage
2022-01-09 14:50:51 +01:00
Martin Kroeker
f1ac59f200
Forward DYNAMIC_ARCH option to Makefile.prebuild
2022-01-08 23:48:58 +01:00
Sunita Nadampalli
19c8f615dc
OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics
2022-01-07 00:28:17 +00:00
Martin Kroeker
ed430cd963
Update -tp option for recent nvfortran on x86_64
2021-12-18 21:56:26 +01:00
kavanabhat
eee3381cbe
Fallback for Power kernels
2021-12-08 03:52:23 -06:00
Martin Kroeker
54d321d742
Merge pull request #3466 from rafaelcfsousa/rafael/small_matrix_p10
...
[POWER] Add small matrix for sgemm/dgemm on Power10
2021-12-03 12:12:20 +01:00
kavanabhat
9a45b5123f
Update Makefile.system
2021-12-02 13:29:38 +05:30
kavanabhat
7b5b93037d
Fix truncated assembler checks
2021-12-01 19:30:40 +05:30
Rafael Cardoso Fernandes Sousa
c78fdcc80d
[POWER] Add support for SMALL_MATRIX_OPT
2021-11-28 12:41:16 -06:00
Martin Kroeker
46947efb83
Ignore compiler support for MIPS MSA if the cpu lacks this capability
2021-11-13 23:32:26 +01:00
Martin Kroeker
9cc0098ce2
Fix potentially wrong HOSTARCH definition in cross-compilation
2021-11-10 22:27:14 +01:00
Martin Kroeker
a6fd497820
Fix nvidia HPC version checks
2021-10-30 17:31:19 +02:00
Martin Kroeker
bb01e26cfe
Adjust compiler options for nvidia hpc 21.9 (and fix a long-standing typo in dynamic_arch settings)
2021-10-29 16:39:03 +02:00
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
2021-10-12 01:30:40 -07:00
Martin Kroeker
8e4c209002
Merge pull request #3398 from kavanabhat/aix_p10_gnuas
...
Big Endian Changes for Power10 kernels
2021-10-05 18:59:47 +02:00
Martin Kroeker
04f3ecd026
Fix minor typo
2021-10-04 16:14:32 +02:00
kavanabhat
9cc95e5657
AIX changes for P10 with GNU Compiler
2021-10-01 05:18:35 -05:00
Alexandru Ardelean
b7bb2e36b8
Makefile.system: adjust mipsel/mips64el ARCH variables
...
When building for MIPS{64} little-endian variants, the included makefiles
should be the same as for the big-endian.
There are already some adjustments being done for some ARCH names.
This change adds the ones for the `mipsel` and `mips64el` names, so that
the Makefile.mips{64} files get included.
This comes as a result of: https://github.com/openwrt/packages/issues/16649
Signed-off-by: Alexandru Ardelean <ardeleanalex@gmail.com>
2021-09-26 12:20:16 +03:00
Wangyang Guo
76ea8db4da
Small Matrix: enable by default for x86_64 arch
...
If no customized GEMM_SMALL_M_PERMIT kernel defined, it will just by pass to normal path.
2021-08-05 02:59:36 +00:00
Xianyi Zhang
0a2077901c
Add small marix optimization kernel interface.
...
make SMALL_MATRIX_OPT=1
2021-08-02 07:01:47 +00:00
gxw
34207bdf5b
Fixed typos about LOONGARCH64
2021-07-30 18:11:12 +08:00
gxw
af0a69f355
Add support for LOONGARCH64
2021-07-27 15:29:12 +08:00
User User-User
9335d42740
add gcc8 version matching
2021-06-19 22:21:39 +02:00
User User-User
b7da75e4fd
WiP CORTEX A55 support
2021-06-19 21:37:51 +02:00
MikaelUrankar
4fbc0777f4
Fix typo
2021-05-26 12:14:57 +02:00
Martin Kroeker
26ccf643a3
Add -lm for FreeBSD on ARM/ARM64
2021-05-16 13:04:38 +02:00
Martin Kroeker
3c356b1a1f
Support compilation with the NAG Fortran compiler
2021-03-11 11:51:09 +01:00
Martin Kroeker
20f492c298
Fix AMD AOCC compiler detection
2021-03-01 21:00:10 +01:00
Martin Kroeker
9b2d69aa80
Add DYNAMIC_LIST option for ARM64
2021-01-24 23:18:01 +01:00
Martin Kroeker
6bbe6d5b92
Make compile-time BUFFERSIZE setting actually reach the compiler/preprocessor
2021-01-13 22:36:04 +01:00
pkubaj
7aa1ff8ff6
Fix build on FreeBSD/powerpc64le
2021-01-01 21:19:57 +00:00
Martin Kroeker
75b1f3becc
Limit POWERPC DYNAMIC_CORE list to P8 and P9 for NVIDIA compilers
2020-12-19 23:17:40 +01:00
Martin Kroeker
b212a2fb9f
Add/modify "PGI" compiler options for NVIDIA SDK 20.11
2020-12-19 22:08:37 +01:00
Martin Kroeker
18d8a67485
Merge pull request #2994 from antonblanchard/power10-fixes
...
Power10 fixes
2020-12-11 23:37:30 +01:00
gxw
4b548857d6
Add msa support for loongson
...
1. Using core loongson3r3 and loongson3r4 for loongson
2. Add DYNAMIC_ARCH for loongson
Change-Id: I1c6b54dbeca3a0cc31d1222af36a7e9bd6ab54c1
2020-12-09 10:28:46 +08:00