Martin Kroeker
62f4c69708
Merge pull request #1717 from martin-frbg/issue1708
...
Add workaround for avx512 compilations on Cygwin
2018-08-06 22:05:47 +02:00
maamountki
453bfa7e71
[ZARCH] Restore detect() function
2018-08-06 20:03:49 +03:00
maamountki
23229011db
[ZARCH] Z14 support, BLAS 1/2 single precision implementations, Some missing double precision implementations, Gemv optimization
2018-08-06 18:20:40 +03:00
Martin Kroeker
73478664d4
Add workaround for avx512 compilations on Cygwin
...
fixes #1708
2018-08-06 16:40:32 +02:00
Martin Kroeker
ee955757f9
Merge pull request #1715 from stevengj/patch-1
...
fix blasabs for windows
2018-08-05 22:48:44 +02:00
Steven G. Johnson
48610a4524
fix blasabs for windows
...
Bugfix in #1713 for Windows (LLP64), where `blasabs` needs to be `llabs` rather than `labs` for the 64-bit API.
2018-08-05 08:18:51 -04:00
Martin Kroeker
4a553e8678
Merge pull request #1713 from martin-frbg/issue1710
...
Introduce blasabs macro and use it to switch between abs and labs for INTERFACE64
2018-08-04 23:51:31 +02:00
Martin Kroeker
e788102c10
Merge pull request #1709 from stevengj/patch-1
...
fabs -> fabsl
2018-08-04 23:51:10 +02:00
Martin Kroeker
165f00c159
fabs -> fabsl
2018-08-04 20:14:51 +02:00
Martin Kroeker
40c068a875
Introduce blasabs() to switch between abs() and labs() for INTERFACE64
2018-08-04 20:07:59 +02:00
Martin Kroeker
933896a1d0
Use blasabs to switch between abs and labs as needed for INTERFACE64
2018-08-04 20:06:49 +02:00
Steven G. Johnson
a4e321400b
fabs -> fabsl
...
Fixes two calls that were using `fabs` on a `long double` argument rather than `fabsl`, which looks like it is doing an unintentional truncation to `double` precision.
2018-08-03 13:00:10 -04:00
Martin Kroeker
9e65430504
Merge pull request #1703 from wsttiger/cmake_fix
...
Set EXPORT_NAME to match OpenBLASConfig.cmake
2018-08-02 23:48:42 +02:00
Martin Kroeker
2cfa86b406
Merge pull request #1707 from extrowerk/haiku_support
...
Haiku supporting patches
2018-08-02 22:27:00 +02:00
Scott Thornton
2a9a9389ef
Added target_include_directories()
2018-08-02 14:58:52 -05:00
Zoltán Mizsei
6463bffd59
Haiku supporting patches
2018-08-02 20:49:14 +02:00
Martin Kroeker
8ef7d4fb54
Merge pull request #1706 from oon3m0oo/develop
...
Fix #1705 where we incorrectly calculate page locations.
2018-08-02 18:53:34 +02:00
Craig Donner
6400868e55
Fix #1705 where we incorrectly calculate page locations.
...
Since we now use an allocation size that isn't a multiple of PAGESIZE, finding
the pages for run_bench wasn't terminating properly. Now we detect if we've
found enough pages for the allocation and terminate the loop.
2018-08-02 16:21:19 +01:00
Scott Thornton
8ebf541e97
Set EXPORT_NAME to match OpenBLASConfig.cmake
2018-07-30 15:18:29 -05:00
Martin Kroeker
b03ae3f4dc
Set version to 0.3.3.dev
2018-07-30 08:23:13 +02:00
Martin Kroeker
2cc8fb0ad2
Set version to 0.3.3.dev
2018-07-30 08:22:38 +02:00
Martin Kroeker
e8a68ef261
Merge pull request #1702 from xianyi/develop
...
Merge develop for 0.3.2
2018-07-30 07:25:01 +02:00
Martin Kroeker
64826a0d7d
Merge branch 'release-0.3.0' into develop
2018-07-29 22:37:09 +02:00
Martin Kroeker
25f2d25cfe
Merge pull request #1697 from martin-frbg/issue1696
...
Do not treat WIndows UWB builds as cross-compiling
2018-07-25 19:55:29 +02:00
Martin Kroeker
73131fa30a
Do not treat WIndows UWB builds as cross-compiling
2018-07-24 17:46:33 +02:00
Martin Kroeker
66fcdd5be8
Merge pull request #1695 from martin-frbg/issue1692
...
Unset memory table entry, not just the local pointer to it on shutdown
2018-07-22 16:34:09 +02:00
Martin Kroeker
43ac839c16
Unset memory table entry, not just the temporary pointer to it on shutdown
...
to fix crash with multiple instances of OpenBLAS, #1692
2018-07-22 09:19:19 +02:00
Martin Kroeker
7ba5936ecd
Merge pull request #1688 from martin-frbg/issue1673
...
Temporarily disable special handling of OPENMP thread memory allocation
2018-07-19 19:03:45 +02:00
Martin Kroeker
b14f44d2ad
Temporarily disable special handling of OPENMP thread memory allocation
...
for issue #1673
2018-07-19 08:57:56 +02:00
Martin Kroeker
e71d70ba87
Merge pull request #1681 from martin-frbg/issue1671
...
Add cpu identification via mfpvr call for the BSDs
2018-07-16 22:47:05 +02:00
Martin Kroeker
d671870f5f
Merge pull request #1684 from martin-frbg/issue1672
...
Work around utest failures in the MIPS64 SICORTEX target
2018-07-16 22:46:49 +02:00
Martin Kroeker
4e103c822c
typo fix
2018-07-16 12:56:39 +02:00
Martin Kroeker
d2142760e0
Fix precision problem in DSDOT
2018-07-15 17:11:40 +02:00
Martin Kroeker
2fbfc64da8
Use C kernels for default c/zAXPY, xROT, c/zSWAP
2018-07-15 17:09:55 +02:00
Martin Kroeker
8d5b33b6be
Add cpu identification via mfpvr call for the BSDs
...
fixes #1671
2018-07-12 23:39:00 +02:00
Martin Kroeker
36aea5ce2d
Merge pull request #1680 from martin-frbg/snprint
...
Fix wrong redefinitions of snprintf for older MSVC
2018-07-12 14:05:13 +02:00
Martin Kroeker
1309711e24
Fix declaration of snprintf for older MSVC
...
_snprintf_s takes an additional (size) argument, so is no direct replacement.
(Note that this code is currently unused - the two instances of snprintf here are within ifdef blocks that are not compiled for MSVC)
2018-07-12 11:47:52 +02:00
Martin Kroeker
571e9de2ac
Fix definition of snprintf for MSVC
...
MS _snprintf_s takes an additional argument for the size of the buffer, so is not a direct replacement (utest/ctest.h from which I copied was wrong)
2018-07-12 11:42:25 +02:00
Martin Kroeker
448ed15115
Merge pull request #1678 from martin-frbg/issue1677
...
Define snprintf for older versions of MSVC
2018-07-12 09:21:34 +02:00
Martin Kroeker
045fb5ea2c
Define snprintf for older versions of MSVC
...
for #1677
2018-07-12 07:30:58 +02:00
Martin Kroeker
4dd70d98d7
Merge pull request #1667 from xianyi/revert-1642-develop
...
Revert "Rewrite &= -> = and simplify the initial blocking phase."
2018-07-04 08:27:21 +02:00
Martin Kroeker
504310eeb9
Merge pull request #1665 from martin-frbg/cpuid-ryzen2
...
Add cpuid for AMD Ryzen 2
2018-07-04 08:19:40 +02:00
Martin Kroeker
ea1f39518f
Merge pull request #1663 from martin-frbg/issue1641
...
Double MAX_ALLOCATING_THREADS to fix segfaults with Go and Octave
2018-07-04 08:19:11 +02:00
Martin Kroeker
5f2a3c05cd
Revert "Rewrite &= -> = and simplify the initial blocking phase."
2018-07-03 21:42:28 +02:00
Martin Kroeker
d0ec4325cf
Add cpuid for AMD Ryzen 2
2018-07-03 21:03:24 +02:00
Martin Kroeker
3f73e8b8cf
Add cpuid for AMD Ryzen 2
...
for #1664
2018-07-03 21:01:35 +02:00
Martin Kroeker
a83f01e0ee
Merge pull request #1662 from martin-frbg/cmake-avx512
...
Add -march=skylake-avx512 to AVX512 compile check and suppress its ou…
2018-07-03 17:40:09 +02:00
Martin Kroeker
a49203b48c
Double MAX_ALLOCATING_THREADS to fix segfaults with Go and Octave
...
for #1641
2018-07-03 17:35:54 +02:00
Martin Kroeker
b74aef2816
Add -march=skylake-avx512 to AVX512 compile check and suppress its output
2018-07-03 14:41:44 +02:00
Martin Kroeker
a9fa805007
Merge pull request #1660 from martin-frbg/issue1659
...
Fix typo that broke compilation with DYNAMIC_ARCH and NO_AVX2
2018-07-02 17:48:19 +02:00