Update with changes from 0.3.7
This commit is contained in:
parent
02d9203981
commit
303869f572
|
@ -1,4 +1,46 @@
|
||||||
OpenBLAS ChangeLog
|
OpenBLAS ChangeLog
|
||||||
|
====================================================================
|
||||||
|
Version 0.3.7
|
||||||
|
11-Aug 2019
|
||||||
|
|
||||||
|
common:
|
||||||
|
* having the gmake special variables TARGET_ARCH or TARGET_MACH
|
||||||
|
defined no longer causes build failures in ctest or utest
|
||||||
|
* defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer
|
||||||
|
has the same effect as setting them to 1
|
||||||
|
* a new test program was added to allow checking the library for
|
||||||
|
thread safety
|
||||||
|
* a new option USE_LOCKING was added to ensure thread safety when
|
||||||
|
OpenBLAS itself is built without multithreading but will be
|
||||||
|
called from multiple threads.
|
||||||
|
* a build failure on Linux with glibc versions earlier than 2.5
|
||||||
|
was fixed
|
||||||
|
* a runtime error with CPU enumeration (and NO_AFFINITY not set)
|
||||||
|
on glibc 2.6 was fixed
|
||||||
|
* NO_AFFINITY was added to the CMAKE options (and defaults to being
|
||||||
|
active on Linux, as in the gmake builds)
|
||||||
|
|
||||||
|
x86_64:
|
||||||
|
* the build-time logic for detection of AVX512 availability in
|
||||||
|
the processor and compiler was fixed
|
||||||
|
* gmake builds on OSX now set the internal name of the library to
|
||||||
|
libopenblas.0.dylib (consistent with CMAKE)
|
||||||
|
* the Haswell DGEMM kernel received a significant speedup through
|
||||||
|
improved prefetch and load instructions
|
||||||
|
* performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly
|
||||||
|
increased by avoiding vpermpd instructions
|
||||||
|
* the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled
|
||||||
|
to fix remaining errors in DGEMM, DSYMM and DTRMM
|
||||||
|
|
||||||
|
## POWER:
|
||||||
|
* added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970
|
||||||
|
* added optimized kernels for POWER9 SGEMM and STRMM
|
||||||
|
|
||||||
|
## ARMV7:
|
||||||
|
* fixed the softfp implementations of xAMAX and IxAMAX
|
||||||
|
* removed the predefined -march= flags on both ARMV5 and ARMV6 as
|
||||||
|
they were appropriate for only a subset of platforms
|
||||||
|
|
||||||
====================================================================
|
====================================================================
|
||||||
Version 0.3.6
|
Version 0.3.6
|
||||||
29-Apr-2019
|
29-Apr-2019
|
||||||
|
|
Loading…
Reference in New Issue