Update Changelog for 0.3.22 (#3964)
This commit is contained in:
parent
1c2a60e6cf
commit
c05da5960d
|
@ -1,4 +1,80 @@
|
|||
OpenBLAS ChangeLog
|
||||
====================================================================
|
||||
Version 0.3.22
|
||||
26-Mar-2023
|
||||
|
||||
general:
|
||||
- Updated the included LAPACK to Reference-LAPACK release 3.11.0
|
||||
plus post-release corrections and improvements
|
||||
- Added initial support for processing with the EMSCRIPTEN javascript
|
||||
converter (yielding a single-threaded build only)
|
||||
- Added a threshold for multithreading in SYMM, SYMV and SYR2K
|
||||
- Increased the threshold for multithreading in SYRK
|
||||
- OpenBLAS no longer decreases the global OMP_NUM_THREADS when it
|
||||
exceeds the maximum thread count the library was compiled for.
|
||||
- fixed ?GETF2 potentially returning NaN with tiny matrix elements
|
||||
- fixed openblas_set_num_threads to work in USE_OPENMP builds
|
||||
- fixed cpu core counting in USE_OPENMP builds returning the number
|
||||
of OMP "places" rather than cores
|
||||
- fixed interpretation of USE_PERL=0 in build scripts
|
||||
- fixed linking of the library with libm in CMAKE builds
|
||||
- fixed startup delays resulting from a wrong default setting of
|
||||
NO_WARMUP in CMAKE builds
|
||||
- fixed inconsistent defaults for overriding of LAPACK SPMV, SPR,
|
||||
SYMV, SYR functions in gmake and CMAKE builds
|
||||
- fixed stride calculation in the optimized small-matrix path of
|
||||
complex SYR
|
||||
- fixed compilation of ReLAPACK with CMAKE
|
||||
- fixed pkgconfig file contents for INTERFACE64 builds
|
||||
- fixed building of Reference-LAPACK with recent gfortran
|
||||
- fixed building with only a subset of precision types on Windows
|
||||
- added new environment variable OPENBLAS_DEFAULT_NUM_THREADS
|
||||
- added a GEMV-based implementation of GEMMT
|
||||
- added support for building under QNX
|
||||
- updated support for (cross-)building for ALPHA targets
|
||||
|
||||
x86_64:
|
||||
- added autodetection of Intel Raptor Lake cpu models
|
||||
- added SSCAL microkernels for Haswell and newer targets
|
||||
- improved the performance of the Haswell DSCAL microkernel
|
||||
- added CSCAL and ZSCAL microkernels for SkylakeX targets
|
||||
- fixed detection of gfortran and Cray CCE compilers
|
||||
- fixed detection of recent versions of the Intel Fortran compiler
|
||||
- fixed compilation with LLVM to no longer run out of AVX512 registers
|
||||
- fix cpu type option setting with recent NVIDIA HPC compiler versions
|
||||
- fixed compilation for/on AMD Ryzen 4 cpus
|
||||
- fixed compilation of AVX2-capable targets with Apple Clang
|
||||
- fixed runtime selection of COOPERLAKE in DYNAMIC_ARCH builds
|
||||
- worked around gcc/llvm using risky FMA operations in CSCAL/ZSCAL
|
||||
- worked around miscompilations of GEMV, SYMV and ZDOT kernels
|
||||
by gcc12's tree-vectorizer on OSX and Windows
|
||||
|
||||
ARM:
|
||||
- fixed cross-compilation to ARMV5 and ARMV6 targets with CMAKE
|
||||
|
||||
ARMV8:
|
||||
- fixed cross-compilation to CortexA53 with CMAKE
|
||||
- fixed compilation with CMAKE and "Arm Compiler for Linux 22.1"
|
||||
- added cpu autodetection for Cortex X3 and A715
|
||||
- fixed conditional compilation of SVE-capable targets in DYNAMIC_ARCH
|
||||
- sped up SVE kernels by removing unnecessary prefetches
|
||||
- improved the GEMM performance of Neoverse V1
|
||||
- added SVE kernels for SDOT and DDOT
|
||||
- added an SBGEMM kernel for Neoverse N2
|
||||
- improved cpu-specific compiler option selection for Neoverse cpus
|
||||
- added support for setting CONSISTENT_FPCSR
|
||||
|
||||
MIPS64:
|
||||
- improved MSA capability detection and handling
|
||||
- added a MIPS64_GENERIC build target
|
||||
- fixed corner cases in DNRM2
|
||||
|
||||
LOONGARCH64:
|
||||
- fixed handling of the INTERFACE64 option
|
||||
|
||||
RISCV:
|
||||
- fixed handling of the INTERFACE64 option
|
||||
|
||||
====================================================================
|
||||
Version 0.3.21
|
||||
07-Aug-2022
|
||||
|
|
Loading…
Reference in New Issue