OpenBLAS/driver
Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)

This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".

Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
2018-06-03 07:58:52 +00:00
..
level2 Merge pull request #1403 from brada4/develop 2017-12-30 14:51:34 +01:00
level3 Work around name clash with Windows10's winnt.h 2018-05-31 13:26:00 +02:00
mapper Remove all trailing whitespace except lapack-netlib 2014-06-27 12:05:18 -07:00
others Initial support for SkylakeX / AVX512 2018-06-03 07:58:52 +00:00