From 8447498b504a3a903507715b3144e8b2a05ec0a0 Mon Sep 17 00:00:00 2001 From: Zhang Xianyi Date: Tue, 27 Oct 2015 15:44:35 -0500 Subject: [PATCH] Update doc for OpenBLAS 0.2.15 version. [CI skipped] --- Changelog.txt | 53 +++++++++++++++++++++++++++++++++++++++++++++++++++ Makefile.rule | 2 +- appveyor.yml | 2 +- 3 files changed, 55 insertions(+), 2 deletions(-) diff --git a/Changelog.txt b/Changelog.txt index 6941a9f96..422b8b519 100644 --- a/Changelog.txt +++ b/Changelog.txt @@ -1,4 +1,57 @@ OpenBLAS ChangeLog +==================================================================== +Version 0.2.15 +27-Oct-2015 +common: + * Support cmake on x86/x86-64. Natively compiling on MS Visual Studio. + (experimental. Thank Hank Anderson for the initial cmake porting work.) + + On Linux and Mac OSX, OpenBLAS cmake supports assembly kernels. + e.g. cmake . + make + make test (Optional) + + On Windows MS Visual Studio, OpenBLAS cmake only support C kernels. + (OpenBLAS uses AT&T style assembly, which is not supported by MSVC.) + e.g. cmake -G "Visual Studio 12 Win64" . + Open OpenBLAS.sln and build. + + * Enable MAX_STACK_ALLOC flags by default. + Improve ger and gemv for small matrices. + * Improve gemv parallel with small m and large n case. + * Improve ?imatcopy when lda==ldb (#633. Thanks, Martin Koehler) + * Add vecLib benchmarks (#565. Thanks, Andreas Noack.) + * Fix LAPACK lantr for row major matrices (#634. Thanks, Dan Kortschak) + * Fix LAPACKE lansy (#640. Thanks, Dan Kortschak) + * Import bug fixes for LAPACKE s/dormlq, c/zunmlq + * Raise the signal when pthread_create fails (#668. Thanks, James K. Lowden) + * Remove g77 from compiler list. + * Enable AppVeyor Windows CI. + +x86/x86-64: + * Support pure C generic kernels for x86/x86-64. + * Support Intel Boardwell and Skylake by Haswell kernels. + * Support AMD Excavator by Steamroller kernels. + * Optimize s/d/c/zdot for Intel SandyBridge and Haswell. + * Optimize s/d/c/zdot for AMD Piledriver and Steamroller. + * Optimize s/d/c/zapxy for Intel SandyBridge and Haswell. + * Optimize s/d/c/zapxy for AMD Piledriver and Steamroller. + * Optimize d/c/zscal for Intel Haswell, dscal for Intel SandyBridge. + * Optimize d/c/zscal for AMD Bulldozer, Piledriver and Steamroller. + * Optimize s/dger for Intel SandyBridge. + * Optimize s/dsymv for Intel SandyBridge. + * Optimize ssymv for Intel Haswell. + * Optimize dgemv for Intel Nehalem and Haswell. + * Optimize dtrmm for Intel Haswell. + +ARM: + * Support Android NDK armeabi-v7a-hard ABI (-mfloat-abi=hard) + e.g. make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7 + * Fix lock, rpcc bugs (#616, #617. Thanks, Grazvydas Ignotas) +POWER: + * Support ppc64le platform (ELF ABI v2. #612. Thanks, Matthew Brandyberry.) + * Support POWER7/8 by POWER6 kernels. (#612. Thanks, Fábio Perez.) + ==================================================================== Version 0.2.14 24-Mar-2015 diff --git a/Makefile.rule b/Makefile.rule index 22f222e3f..459f79c26 100644 --- a/Makefile.rule +++ b/Makefile.rule @@ -3,7 +3,7 @@ # # This library's version -VERSION = 0.2.14 +VERSION = 0.2.15 # If you set the suffix, the library name will be libopenblas_$(LIBNAMESUFFIX).a # and libopenblas_$(LIBNAMESUFFIX).so. Meanwhile, the soname in shared library diff --git a/appveyor.yml b/appveyor.yml index 4daf1bd3d..394e48854 100644 --- a/appveyor.yml +++ b/appveyor.yml @@ -1,4 +1,4 @@ -version: 0.2.14.{build} +version: 0.2.15.{build} #environment: