OpenBLAS/kernel
Martin Kroeker 673e5a0495
Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263)
* Add gcc7-generated assembly files for POWER8/9 isa/ica-min/max and POWER9 caxpy

To work around internal compiler errors encountered when compiling the original C source with gcc 4 and 5, and wrong code generated by gcc 8.3.0

* Use gcc-generated assembly instead of original C sources

to work around internal compiler errors encountered with gcc 4.8/5.4 and wrong code generation by gcc 8.3

* Use gcc-generated assembly instead of the original C source

to work around internal compiler errors encountered with gcc 4.8 and 5.4, and wrong code generation by gcc 8.3

* Add gcc7-generated assembler version of caxpy for power8

to work around wrong code generated by gcc 8.3

* Handle CONJ define for caxpyc

* Handle CONJ define for caxpyc

* Add gcc7-generated assembly cdot for POWER9

* Use prebuilt assembly for POWER9 cdot

created with gcc 7.3.1 to work around ICE in older gcc versions

* Exclude POWER9 from DYNAMIC_ARCH when gcc versions is lower than 6

* Update Makefile.system

* Use PROLOGUE macro to ensure correct function name for DYNAMIC_ARCH

* Disable POWER9 with old gcc versions
2019-09-22 22:35:22 +02:00
..
alpha Add implementations of ssum/dsum and csum/zsum 2019-03-30 22:05:11 +01:00
arm Use generic kernels for complex (I)AMAX to support softfp 2019-05-30 11:38:11 +02:00
arm64 Add ARM64 implementations of ?sum 2019-03-30 22:13:36 +01:00
generic fix small typo 2018-09-09 16:52:25 +02:00
ia64 Add ia64 implementation of ?sum 2019-03-30 22:18:03 +01:00
mips Add MIPS implementation of ?sum 2019-03-30 22:20:14 +01:00
mips64 Add MIPS64 implementation of ?sum 2019-03-30 22:22:15 +01:00
power Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) 2019-09-22 22:35:22 +02:00
sparc Add SPARC implementation of ?sum 2019-03-30 22:25:06 +01:00
x86 Fix unwanted case-sensitivity in x86 LSAME for (AMD) processors without CMOV 2019-08-13 10:19:10 +02:00
x86_64 Keep both PGI/SUN and default code paths to avoid breaking Clang/WIndows 2019-08-28 18:07:44 +02:00
zarch Add ZARCH implementation of ?sum 2019-03-30 22:49:05 +01:00
CMakeLists.txt Add ?sum 2019-03-30 22:01:13 +01:00
Makefile Update Makefile 2018-12-16 18:14:40 +01:00
Makefile.L1 Add ?sum 2019-03-30 22:01:13 +01:00
Makefile.L2 Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
Makefile.L3 Revert #2051 and replace with a better fix (#2261) 2019-09-17 18:56:04 +02:00
Makefile.LA Support NO_LAPACK=1 to build the lib without LAPACK functions. 2011-03-04 11:51:32 +08:00
setparam-ref.c Merge pull request #2072 from martin-frbg/sum 2019-04-23 20:11:36 +02:00