OpenBLAS/kernel at 1938819c25d7dd4ba995900797f5123e4cfd6fa4 - OpenBLAS - Trustie: Git with trustie

floraachy/OpenBLAS

Files

History

Arjan van de Ven 1938819c25 skylake dgemm: Add a 16x8 kernel

The next step for the avx512 dgemm code is adding a 16x8 kernel.
In the 8x8 kernel, each FMA has a matching load (the broadcast);
in the 16x8 kernel we can reuse this load for 2 FMAs, which
in turn reduces pressure on the load ports of the CPU and gives
a nice performance boost (in the 25% range).

2018-10-05 13:11:35 +00:00

..

Remove all trailing whitespace except lapack-netlib

2014-06-27 12:05:18 -07:00

Convert fldmia/fstmia instructions to UAL syntax for clang7

2018-09-28 23:05:15 +02:00

Conditional compilation of assembly files that IOS does not like

2018-09-04 11:06:51 +02:00

fix small typo

2018-09-09 16:52:25 +02:00

Remove all trailing whitespace except lapack-netlib

2014-06-27 12:05:18 -07:00

Merge pull request #1565 from martin-frbg/mipstypo

2018-05-17 20:22:58 +02:00

test_axpy work error on LOONGSON3A platform #1777

2018-09-26 15:14:04 +08:00

Use the new zrot.c on POWER8 for crot as well

2018-05-23 22:54:39 +02:00

Remove all trailing whitespace except lapack-netlib

2014-06-27 12:05:18 -07:00

Typo fix (misplaced parenthesis)

2018-06-03 13:22:59 +02:00

skylake dgemm: Add a 16x8 kernel

2018-10-05 13:11:35 +00:00

Merge pull request #1499 from quickwritereader/develop

2018-03-27 21:43:23 +02:00

CMakeLists.txt

Initial support for SkylakeX / AVX512

2018-06-03 07:58:52 +00:00

Makefile

MIPS n32 ABI and build time mips simd support check

2016-08-10 17:44:22 +05:30

Makefile.L1

Remove duplicate -D args in kernel/Makefile.L1

2015-11-09 14:15:48 +05:30

Makefile.L2

Remove all trailing whitespace except lapack-netlib

2014-06-27 12:05:18 -07:00

Makefile.L3

Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-generic

2018-08-28 21:34:07 +02:00

Makefile.LA

Support NO_LAPACK=1 to build the lib without LAPACK functions.

2011-03-04 11:51:32 +08:00

setparam-ref.c

Return a somewhat sane default value for L2 cache size if cpuid retur… (#1611 )

2018-06-11 13:26:19 +02:00