Commit Graph

  • d93cf1126e Merge pull request #1753 from dloghin/risc-v Martin Kroeker 2018-09-07 11:01:23 +02:00
  • 4cf7315a5d Adjust ARMV8 SGEMM unrolling when using the C fallback kernel_2x2 for IOS Martin Kroeker 2018-09-06 21:41:54 +02:00
  • b57af93792 just make CBLAS_LAYOUT an alias of the existing CBLAS_ORDER Martin Kroeker 2018-09-06 16:54:31 +02:00
  • 8aeab0601e Follow netlib renaming/aliasing CBLAS_ORDER to CBLAS_LAYOUT Martin Kroeker 2018-09-06 16:39:52 +02:00
  • a1bdc308b8 override ARCH (archiver) in lapack-netlib/make.inc Dumi Loghin 2018-09-06 13:13:36 +08:00
  • 0b7ccb9e38 Revert "replace ARCH with AR in lapack-netlib" Dumi Loghin 2018-09-06 13:08:30 +08:00
  • db17ce896f replace ARCH with AR in lapack-netlib Dumi Loghin 2018-09-05 12:49:37 +08:00
  • 1cb7b9015e Conditional compilation of assembly files that IOS does not like Martin Kroeker 2018-09-04 11:06:51 +02:00
  • a4bd41e9f2 Fix paths to C kernels for nrm2 Martin Kroeker 2018-09-04 10:51:19 +02:00
  • 9e2bb0c641 Update with the changes from 0.3.3 Martin Kroeker 2018-08-31 00:21:13 +02:00
  • dbfd7524cd Update version to 0.3.4.dev Martin Kroeker 2018-08-31 00:19:21 +02:00
  • 2982ce505d Update version to 0.3.4.dev Martin Kroeker 2018-08-31 00:18:37 +02:00
  • fd8d1868a1 Updates for 0.3.3 v0.3.3 Martin Kroeker 2018-08-31 00:07:48 +02:00
  • f0563f14ba Version 0.3.3 Martin Kroeker 2018-08-30 23:43:57 +02:00
  • 3197f86762 Version 0.3.3 Martin Kroeker 2018-08-30 23:43:14 +02:00
  • 422a8fa953 Merge pull request #1747 from xianyi/develop Martin Kroeker 2018-08-30 23:42:19 +02:00
  • 5bac15adbd Merge pull request #1746 from martin-frbg/issue1674 Martin Kroeker 2018-08-30 17:48:07 +02:00
  • e17f969fa0 Assume cross-compilation if host and target os differ Martin Kroeker 2018-08-30 13:28:46 +02:00
  • e11126b26a Merge pull request #1745 from martin-frbg/issue1743 Martin Kroeker 2018-08-29 07:43:58 +02:00
  • 74608e470d Merge pull request #1744 from martin-frbg/lapack272 Martin Kroeker 2018-08-28 22:58:58 +02:00
  • f3fd44a731 Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-generic Martin Kroeker 2018-08-28 21:34:07 +02:00
  • 9e917b16db Fix missing replacements of ILAENV by ILAENV_2STAGE (lapack PR 272) Martin Kroeker 2018-08-28 21:11:54 +02:00
  • 8440a4cb1a Merge pull request #1742 from martin-frbg/interim033 Martin Kroeker 2018-08-28 08:02:15 +02:00
  • b55690a659 typo fix Martin Kroeker 2018-08-26 11:31:07 +02:00
  • b902a40986 Rewrite glibc version check Martin Kroeker 2018-08-26 11:18:02 +02:00
  • 5991d1a6cd Update memory.c Martin Kroeker 2018-08-25 22:12:40 +02:00
  • b1b743f434 Merge branch 'develop' into interim033 Martin Kroeker 2018-08-25 19:45:19 +02:00
  • 2caa2210bb Add USE_TLS option to choose between old and new implementation of memory.c Martin Kroeker 2018-08-25 19:37:11 +02:00
  • 2a589c4b28 Add USE_TLS option to switch between old and new memory.c Martin Kroeker 2018-08-25 19:36:12 +02:00
  • fd42ca462d Combo of default pre-0.3.1 memory.c and band-aided version of PR1739 Martin Kroeker 2018-08-25 19:35:16 +02:00
  • 52d3f7af50 Merge pull request #1738 from sharkcz/s390x Martin Kroeker 2018-08-16 09:46:34 +02:00
  • 5c6e020f49 detect z14 arch on s390x Dan Horák 2018-08-14 12:30:38 +02:00
  • e6c0e39492 Optimize Zgemv maamountki 2018-08-13 12:23:40 +03:00
  • d4d3113adc Merge pull request #1731 from fenrus75/readme Martin Kroeker 2018-08-13 00:01:37 +02:00
  • 375dff54fc Merge pull request #1733 from fenrus75/dsymv Martin Kroeker 2018-08-12 18:18:36 +02:00
  • a5f165275a Merge pull request #1732 from fenrus75/dgemv Martin Kroeker 2018-08-12 18:17:42 +02:00
  • 8c13aa495a Merge pull request #1730 from fenrus75/fix-sdot Martin Kroeker 2018-08-12 18:17:01 +02:00
  • 1ee6d087c3 Merge pull request #1729 from fenrus75/dscal Martin Kroeker 2018-08-12 18:16:45 +02:00
  • a95a784ab2 Merge pull request #1723 from maamountki/develop Martin Kroeker 2018-08-11 21:08:45 +02:00
  • 9bec34cb67 Add an AVX512 enabled DSYMV (L) function Arjan van de Ven 2018-08-11 17:46:24 +00:00
  • 87bebdbd8a Add an AVX512 enabled DGEMV (n) function Arjan van de Ven 2018-08-11 17:38:12 +00:00
  • 9493f26309 add short blurb about avx512 and needed compiler to README Arjan van de Ven 2018-08-11 17:21:46 +00:00
  • 36add7570a Fix typo in sdot function Arjan van de Ven 2018-08-11 17:16:45 +00:00
  • cacacc8007 Add an AVX512 enabled DSCAL function Arjan van de Ven 2018-08-11 17:14:57 +00:00
  • 1a00ef3d27 Merge pull request #1725 from fenrus75/axpy Martin Kroeker 2018-08-11 11:01:20 +02:00
  • 4c0d832ec3 Merge pull request #1724 from fenrus75/sdot Martin Kroeker 2018-08-11 11:00:56 +02:00
  • fc33cbc7bb Merge pull request #1728 from martin-frbg/changelog Martin Kroeker 2018-08-10 13:24:36 +02:00
  • c52a831ae4 Add changes from the 0.3.x releases Martin Kroeker 2018-08-10 13:23:47 +02:00
  • 2e99873ff7 Add a AVX512 enabled SAXPY/DAXPY functions Arjan van de Ven 2018-08-10 02:58:32 +00:00
  • 00abaa865b Add an AVX512 enabled SDOT function Arjan van de Ven 2018-08-10 02:31:48 +00:00
  • 33043f563f Disable scal to benchmark zgemv separately by default maamountki 2018-08-10 01:54:18 +03:00
  • 66da7677bd Merge pull request #1721 from fenrus75/ddot2 Martin Kroeker 2018-08-09 15:39:06 +02:00
  • 7932ff3ea9 Add an AVX512 enabled DDOT function Arjan van de Ven 2018-08-08 02:59:11 +00:00
  • 62f4c69708 Merge pull request #1717 from martin-frbg/issue1708 Martin Kroeker 2018-08-06 22:05:47 +02:00
  • 453bfa7e71 [ZARCH] Restore detect() function maamountki 2018-08-06 20:03:49 +03:00
  • 23229011db [ZARCH] Z14 support, BLAS 1/2 single precision implementations, Some missing double precision implementations, Gemv optimization maamountki 2018-08-06 18:20:40 +03:00
  • 73478664d4 Add workaround for avx512 compilations on Cygwin Martin Kroeker 2018-08-06 16:40:32 +02:00
  • ee955757f9 Merge pull request #1715 from stevengj/patch-1 Martin Kroeker 2018-08-05 22:48:44 +02:00
  • 48610a4524 fix blasabs for windows Steven G. Johnson 2018-08-05 08:18:51 -04:00
  • 4a553e8678 Merge pull request #1713 from martin-frbg/issue1710 Martin Kroeker 2018-08-04 23:51:31 +02:00
  • e788102c10 Merge pull request #1709 from stevengj/patch-1 Martin Kroeker 2018-08-04 23:51:10 +02:00
  • 165f00c159 fabs -> fabsl Martin Kroeker 2018-08-04 20:14:51 +02:00
  • 40c068a875 Introduce blasabs() to switch between abs() and labs() for INTERFACE64 Martin Kroeker 2018-08-04 20:07:59 +02:00
  • 933896a1d0 Use blasabs to switch between abs and labs as needed for INTERFACE64 Martin Kroeker 2018-08-04 20:06:49 +02:00
  • a4e321400b fabs -> fabsl Steven G. Johnson 2018-08-03 13:00:10 -04:00
  • 9e65430504 Merge pull request #1703 from wsttiger/cmake_fix Martin Kroeker 2018-08-02 23:48:42 +02:00
  • 2cfa86b406 Merge pull request #1707 from extrowerk/haiku_support Martin Kroeker 2018-08-02 22:27:00 +02:00
  • 2a9a9389ef Added target_include_directories() Scott Thornton 2018-08-02 14:58:52 -05:00
  • 6463bffd59 Haiku supporting patches Zoltán Mizsei 2018-08-02 20:49:14 +02:00
  • 8ef7d4fb54 Merge pull request #1706 from oon3m0oo/develop Martin Kroeker 2018-08-02 18:53:34 +02:00
  • 6400868e55 Fix #1705 where we incorrectly calculate page locations. Craig Donner 2018-08-02 16:21:19 +01:00
  • 8ebf541e97 Set EXPORT_NAME to match OpenBLASConfig.cmake Scott Thornton 2018-07-30 15:18:29 -05:00
  • b03ae3f4dc Set version to 0.3.3.dev Martin Kroeker 2018-07-30 08:23:13 +02:00
  • 2cc8fb0ad2 Set version to 0.3.3.dev Martin Kroeker 2018-07-30 08:22:38 +02:00
  • e8a68ef261 Merge pull request #1702 from xianyi/develop v0.3.2 Martin Kroeker 2018-07-30 07:25:01 +02:00
  • 64826a0d7d Merge branch 'release-0.3.0' into develop Martin Kroeker 2018-07-29 22:37:09 +02:00
  • 25f2d25cfe Merge pull request #1697 from martin-frbg/issue1696 Martin Kroeker 2018-07-25 19:55:29 +02:00
  • 73131fa30a Do not treat WIndows UWB builds as cross-compiling Martin Kroeker 2018-07-24 17:46:33 +02:00
  • 66fcdd5be8 Merge pull request #1695 from martin-frbg/issue1692 Martin Kroeker 2018-07-22 16:34:09 +02:00
  • 43ac839c16 Unset memory table entry, not just the temporary pointer to it on shutdown Martin Kroeker 2018-07-22 09:19:19 +02:00
  • 7ba5936ecd Merge pull request #1688 from martin-frbg/issue1673 Martin Kroeker 2018-07-19 19:03:45 +02:00
  • b14f44d2ad Temporarily disable special handling of OPENMP thread memory allocation Martin Kroeker 2018-07-19 08:57:56 +02:00
  • e71d70ba87 Merge pull request #1681 from martin-frbg/issue1671 Martin Kroeker 2018-07-16 22:47:05 +02:00
  • d671870f5f Merge pull request #1684 from martin-frbg/issue1672 Martin Kroeker 2018-07-16 22:46:49 +02:00
  • 4e103c822c typo fix Martin Kroeker 2018-07-16 12:56:39 +02:00
  • d2142760e0 Fix precision problem in DSDOT Martin Kroeker 2018-07-15 17:11:40 +02:00
  • 2fbfc64da8 Use C kernels for default c/zAXPY, xROT, c/zSWAP Martin Kroeker 2018-07-15 17:09:55 +02:00
  • 8d5b33b6be Add cpu identification via mfpvr call for the BSDs Martin Kroeker 2018-07-12 23:39:00 +02:00
  • 36aea5ce2d Merge pull request #1680 from martin-frbg/snprint Martin Kroeker 2018-07-12 14:05:13 +02:00
  • 1309711e24 Fix declaration of snprintf for older MSVC Martin Kroeker 2018-07-12 11:47:52 +02:00
  • 571e9de2ac Fix definition of snprintf for MSVC Martin Kroeker 2018-07-12 11:42:25 +02:00
  • 448ed15115 Merge pull request #1678 from martin-frbg/issue1677 Martin Kroeker 2018-07-12 09:21:34 +02:00
  • 045fb5ea2c Define snprintf for older versions of MSVC Martin Kroeker 2018-07-12 07:30:58 +02:00
  • 4dd70d98d7 Merge pull request #1667 from xianyi/revert-1642-develop Martin Kroeker 2018-07-04 08:27:21 +02:00
  • 504310eeb9 Merge pull request #1665 from martin-frbg/cpuid-ryzen2 Martin Kroeker 2018-07-04 08:19:40 +02:00
  • ea1f39518f Merge pull request #1663 from martin-frbg/issue1641 Martin Kroeker 2018-07-04 08:19:11 +02:00
  • 5f2a3c05cd Revert "Rewrite &= -> = and simplify the initial blocking phase." revert-1642-develop Martin Kroeker 2018-07-03 21:42:28 +02:00
  • d0ec4325cf Add cpuid for AMD Ryzen 2 Martin Kroeker 2018-07-03 21:03:24 +02:00
  • 3f73e8b8cf Add cpuid for AMD Ryzen 2 Martin Kroeker 2018-07-03 21:01:35 +02:00
  • a83f01e0ee Merge pull request #1662 from martin-frbg/cmake-avx512 Martin Kroeker 2018-07-03 17:40:09 +02:00