Martin Kroeker
ebe50458f3
Do not add a -tp to the flags of the nvc compiler if there is one already in CFLAGS
2023-02-09 09:29:27 +01:00
Martin Kroeker
3e64fa72c4
Settings from Makefile(_kernel).conf should be available to DYNAMIC_ARCH kernel builds
2022-12-29 23:05:22 +01:00
Martin Kroeker
ca3b5ae3f0
Pass NO_SVE if set
2022-12-25 12:19:20 +01:00
Martin Kroeker
d16261fbc6
SVE-enabled targets in ARM64 DYNAMIC_ARCH require a recent compiler
2022-12-25 10:19:02 +01:00
Martin Kroeker
57151b97aa
Fix INTERFACE64 builds on riscv and loongarch
2022-12-15 18:52:46 +01:00
Martin Kroeker
62341ac5e1
Fix missing parenthesis
2022-12-15 12:30:16 +01:00
Martin Kroeker
5a294b0c8a
Add -lm on any arm/arm64 BSD, not just FreeBSD
2022-12-15 10:35:47 +01:00
Martin Kroeker
ea6c5f3cf5
Add option RELAPACK_REPLACE
2022-10-30 12:55:23 +01:00
Martin Kroeker
bd30120ba7
Merge pull request #3720 from FlyGoat/mips64
...
Make it work on general MIPS64 processors
2022-08-19 20:24:27 +02:00
Jiaxun Yang
fae9368f14
Implement DYNAMIC_LIST for MIPS64
...
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-12 13:13:31 +01:00
Jiaxun Yang
a50b29c540
Provide a fallback MIPS64_GENERIC target
...
It is really dangerous to fallback to Loongson core on other
MIPS64 processors.
Signed-off-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
2022-08-12 13:13:28 +01:00
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
...
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker
d0ba257de0
Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
...
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 20:31:20 +02:00
Rajalakshmi Srinivasaraghavan
1d97405c02
POWER: Enable bfloat16 kernels by default
...
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
2022-07-28 07:43:53 -05:00
gxw
fbfe1daf6e
LoongArch64: Add DYNAMIC_ARCH support
2022-07-28 14:28:45 +08:00
gxw
3573306a69
LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC
2022-07-25 16:04:56 +08:00
Martin Kroeker
407a1a242c
Merge pull request #3670 from martin-frbg/osxvermin
...
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
2022-06-29 08:31:04 +02:00
Martin Kroeker
be5500e704
Merge pull request #3669 from VFerrari/fix_small_matrix_kernel
...
POWER: fix issues with the small matrix kernel
2022-06-28 16:09:36 +02:00
Martin Kroeker
914c4d0fe8
Add C versions of the CBLAS test sources ( #3656 )
...
* Add C conversions of the CBLAS tests for NOFORTRAN=1 builds
* Enable CTEST without Fortran and fix passing of BUILD_vartype options to exports/gensymbol
2022-06-28 11:52:48 +02:00
Martin Kroeker
2857987ff6
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
2022-06-28 11:46:25 +02:00
VFerrari
2062280c6f
Power: Enable SMALL_MATRIX OPT as default for dynamic arch
2022-06-25 03:47:03 -03:00
Martin Kroeker
8f13ab94d2
Merge pull request #3613 from Rabenda/fix-riscv
...
Fix riscv64 detect
2022-05-04 07:22:47 +02:00
Martin Kroeker
24e99eca31
Avoid adding -lgfortran with NOFORTRAN
2022-04-27 20:31:42 +02:00
Han Gao
3fc52ebcfb
Fix other arch build in detect.
...
When CORE is empty, use -march=loongson3a. Fix it.
Signed-off-by: Han Gao <gaohan@uniontech.com>
2022-04-27 01:34:55 +08:00
Niyas Sait
3f5d145cd4
build: minor fixes to build on windows with make
...
This patch contains following fixes
1. Fix to build without PIC flag
2. Define LAPACK_COMPLEX_STRUCTURE for windows. Builds are failing
without it and changes are consistent with the CMake rules defined
in system.cmake (line 576)
2022-04-25 00:01:12 +01:00
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
...
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
2022-04-09 22:38:58 +02:00
Martin Kroeker
499ae5e8f7
Merge pull request #3510 from martin-frbg/issue3505
...
Fix recent SkylakeX/DYNAMIC_ARCH DGEMM breakage
2022-01-09 14:50:51 +01:00
Martin Kroeker
f1ac59f200
Forward DYNAMIC_ARCH option to Makefile.prebuild
2022-01-08 23:48:58 +01:00
Sunita Nadampalli
19c8f615dc
OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics
2022-01-07 00:28:17 +00:00
Martin Kroeker
ed430cd963
Update -tp option for recent nvfortran on x86_64
2021-12-18 21:56:26 +01:00
kavanabhat
eee3381cbe
Fallback for Power kernels
2021-12-08 03:52:23 -06:00
Martin Kroeker
54d321d742
Merge pull request #3466 from rafaelcfsousa/rafael/small_matrix_p10
...
[POWER] Add small matrix for sgemm/dgemm on Power10
2021-12-03 12:12:20 +01:00
kavanabhat
9a45b5123f
Update Makefile.system
2021-12-02 13:29:38 +05:30
kavanabhat
7b5b93037d
Fix truncated assembler checks
2021-12-01 19:30:40 +05:30
Rafael Cardoso Fernandes Sousa
c78fdcc80d
[POWER] Add support for SMALL_MATRIX_OPT
2021-11-28 12:41:16 -06:00
Martin Kroeker
46947efb83
Ignore compiler support for MIPS MSA if the cpu lacks this capability
2021-11-13 23:32:26 +01:00
Martin Kroeker
9cc0098ce2
Fix potentially wrong HOSTARCH definition in cross-compilation
2021-11-10 22:27:14 +01:00
Martin Kroeker
a6fd497820
Fix nvidia HPC version checks
2021-10-30 17:31:19 +02:00
Martin Kroeker
bb01e26cfe
Adjust compiler options for nvidia hpc 21.9 (and fix a long-standing typo in dynamic_arch settings)
2021-10-29 16:39:03 +02:00
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
2021-10-12 01:30:40 -07:00
Martin Kroeker
8e4c209002
Merge pull request #3398 from kavanabhat/aix_p10_gnuas
...
Big Endian Changes for Power10 kernels
2021-10-05 18:59:47 +02:00
Martin Kroeker
04f3ecd026
Fix minor typo
2021-10-04 16:14:32 +02:00
kavanabhat
9cc95e5657
AIX changes for P10 with GNU Compiler
2021-10-01 05:18:35 -05:00
Alexandru Ardelean
b7bb2e36b8
Makefile.system: adjust mipsel/mips64el ARCH variables
...
When building for MIPS{64} little-endian variants, the included makefiles
should be the same as for the big-endian.
There are already some adjustments being done for some ARCH names.
This change adds the ones for the `mipsel` and `mips64el` names, so that
the Makefile.mips{64} files get included.
This comes as a result of: https://github.com/openwrt/packages/issues/16649
Signed-off-by: Alexandru Ardelean <ardeleanalex@gmail.com>
2021-09-26 12:20:16 +03:00
Wangyang Guo
76ea8db4da
Small Matrix: enable by default for x86_64 arch
...
If no customized GEMM_SMALL_M_PERMIT kernel defined, it will just by pass to normal path.
2021-08-05 02:59:36 +00:00
Xianyi Zhang
0a2077901c
Add small marix optimization kernel interface.
...
make SMALL_MATRIX_OPT=1
2021-08-02 07:01:47 +00:00
gxw
34207bdf5b
Fixed typos about LOONGARCH64
2021-07-30 18:11:12 +08:00
gxw
af0a69f355
Add support for LOONGARCH64
2021-07-27 15:29:12 +08:00
User User-User
9335d42740
add gcc8 version matching
2021-06-19 22:21:39 +02:00
User User-User
b7da75e4fd
WiP CORTEX A55 support
2021-06-19 21:37:51 +02:00