Commit Graph

  • eb2eddf241 Merge pull request #2296 from kdunee/develop Martin Kroeker 2019-10-28 13:24:18 +01:00
  • 8691825944 Fixed a minor cmake problem, occuring when DYNAMIC_CORE=ON and CMAKE_C_FLAGS was empty k.dunikowski 2019-10-28 08:51:05 +01:00
  • 7dc8a76f60 Merge pull request #2293 from martin-frbg/pr2288 Martin Kroeker 2019-10-25 23:46:39 +02:00
  • df857551c0 Remove special parameter set for obsolete IOS/ARMV8 workaround Martin Kroeker 2019-10-25 23:07:00 +02:00
  • 85ccdce8c4 Remove the IOS fallbacks to generic C kernels Martin Kroeker 2019-10-25 23:02:37 +02:00
  • aeabe0a83f Fix regex to parse -R options with and without whitespace Martin Kroeker 2019-10-25 22:52:30 +02:00
  • 1b90989662 Add NetBSD to the xBSD conditionals Martin Kroeker 2019-10-25 12:52:49 +02:00
  • e3e8b5cdca Add NetBSD Martin Kroeker 2019-10-25 12:51:06 +02:00
  • 69b16a894d Merge pull request #2292 from martin-frbg/g95fixes Martin Kroeker 2019-10-25 10:35:17 +02:00
  • 6782e5767d Merge pull request #2291 from martin-frbg/gensymbol Martin Kroeker 2019-10-25 10:34:50 +02:00
  • 48f5a89f92 Merge pull request #2282 from martin-frbg/issue2281 Martin Kroeker 2019-10-25 09:56:30 +02:00
  • 4ae1610f37 Merge pull request #2290 from martin-frbg/cpuidfixes Martin Kroeker 2019-10-24 22:52:15 +02:00
  • 911c3e2f4b Improve support for g95 and non-GNU ld Martin Kroeker 2019-10-24 22:43:27 +02:00
  • fab49e49e5 Move most lapack 3.7/3.8 additions to the embedded_underscores list Martin Kroeker 2019-10-24 21:26:20 +02:00
  • b687fba5bc Disable direct clock register access on IOS and Android Martin Kroeker 2019-10-24 21:18:17 +02:00
  • 46a8c2519a Remove prototype of unused, unimplemented function (#2274) luzpaz 2019-10-24 12:56:53 -04:00
  • e9437eebd2 Restore Goldmont ID and improve QEMU support Martin Kroeker 2019-10-24 18:45:27 +02:00
  • 3a39062cfc Merge pull request #12 from xianyi/develop Martin Kroeker 2019-10-24 18:40:13 +02:00
  • eaa0be1313 Merge pull request #2286 from wjc404/develop Martin Kroeker 2019-10-20 12:44:19 +02:00
  • 6ff013bae0 native support for icopy_4 wjc404 2019-10-19 03:54:44 +08:00
  • 0d669e04bb Update dgemm_kernel_8x8_skylakex.c wjc404 2019-10-18 15:00:17 +08:00
  • 17cdd9f9e1 some correction wjc404 2019-10-18 14:58:07 +08:00
  • 6bcb06fcb1 make further changes to icopy_8 easier wjc404 2019-10-18 10:47:31 +08:00
  • b7315f8401 Add files via upload wjc404 2019-10-16 19:23:36 +08:00
  • 9b19e9e1b0 Update dgemm_kernel_8x8_skylakex.c wjc404 2019-10-16 10:14:51 +08:00
  • 6bd67ddbab Update dgemm_kernel_8x8_skylakex.c wjc404 2019-10-16 03:20:08 +08:00
  • 5da9484d93 Add files via upload wjc404 2019-10-16 02:01:13 +08:00
  • 844629af57 Add files via upload wjc404 2019-10-16 02:00:34 +08:00
  • 2beaa82c05 Merge pull request #2283 from martin-frbg/issue2176 Martin Kroeker 2019-10-09 22:06:09 +02:00
  • e8a2aed2b9 Support QEMU cpu calling itself 64bit AMD Athlon as well Martin Kroeker 2019-10-09 18:24:13 +02:00
  • f262031685 Support QEMU virtual cpu as CORE2 Martin Kroeker 2019-10-08 22:30:02 +02:00
  • 5f6206fa2d Simplify OSX/IOS cross-compilation and add a CI test for it (#2279) Martin Kroeker 2019-10-08 20:13:14 +02:00
  • f2cde2ccfb Update common_arm64.h Martin Kroeker 2019-10-08 20:12:08 +02:00
  • ba7838d2e1 Merge pull request #2280 from martin-frbg/iosfix Martin Kroeker 2019-10-08 10:25:25 +02:00
  • a448884a63 Remove automatic label postfixes from macro included only once Martin Kroeker 2019-10-08 08:37:50 +02:00
  • 17609f88f1 Merge pull request #11 from xianyi/develop Martin Kroeker 2019-10-08 08:32:52 +02:00
  • 3a2df19db6 Fix accidental duplication of jump instruction Martin Kroeker 2019-10-08 08:09:26 +02:00
  • d2093a40d3 Merge pull request #2277 from martin-frbg/issue2275 Martin Kroeker 2019-10-06 23:01:54 +02:00
  • aa04b0925e Merge pull request #2276 from xianyi/revert-2272-thread-sqrt-of-negative Martin Kroeker 2019-10-06 11:12:44 +02:00
  • 258ac56e0a Move 32bit OSX build back to xcode 8.3 but switch to gcc8 Martin Kroeker 2019-10-05 10:52:47 +02:00
  • 56837e9d92 Make local labels in macro compatible with the xcode assembler Martin Kroeker 2019-10-04 14:53:23 +02:00
  • bb5413863f Rewrite ARM64 PROLOGUE to make it compatible with xcode/ios Martin Kroeker 2019-10-04 14:50:03 +02:00
  • 32f5907fef Update 32bit macOS again to xcode 9.3 Martin Kroeker 2019-10-03 01:09:02 +02:00
  • ac10236cc8 Update the OSX BINARY=32 test to xcode9.2 Martin Kroeker 2019-10-02 22:35:34 +02:00
  • 8617d75548 Revert "Avoid taking root of negative number in symv_thread.c" revert-2272-thread-sqrt-of-negative Martin Kroeker 2019-10-01 23:50:41 +02:00
  • c07d78b9e9 Merge pull request #2272 from seberg/thread-sqrt-of-negative Martin Kroeker 2019-09-30 11:27:29 +02:00
  • 6355c25dde Avoid taking root of negative number in symv_thread.c Sebastian Berg 2019-09-29 22:03:12 -07:00
  • 5e244d80f2 Merge pull request #2271 from quickwritereader/strmm_fix Martin Kroeker 2019-09-29 13:53:45 +02:00
  • ede5efebab trmm fix AbdelRauf 2019-09-29 02:27:50 +00:00
  • 84908d60d2 Merge pull request #2269 from martin-frbg/ppc-fixes Martin Kroeker 2019-09-27 09:52:19 +02:00
  • 596a22325a Fix prologue of power9 assembly cdot(c) kernel to provide cdotc Martin Kroeker 2019-09-27 00:47:18 +02:00
  • 7f58f3ad0e Fix mis-edits in the gcc-derived power8 caxpy kernel Martin Kroeker 2019-09-27 00:44:26 +02:00
  • c0d570a357 Merge pull request #7 from xianyi/develop Martin Kroeker 2019-09-27 00:42:32 +02:00
  • 6b83079368 Count cpu cores on ARMV8 and use that to pick the GEMM_PQ parameters (#2267) Martin Kroeker 2019-09-25 23:13:24 +02:00
  • 673e5a0495 Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) Martin Kroeker 2019-09-22 22:35:22 +02:00
  • bfa2cc7d64 Restore ppc64 CI job and remove the travis_wait that caused the problem with it Martin Kroeker 2019-09-20 10:29:35 +02:00
  • e7c4d6705a Revert #2051 and replace with a better fix (#2261) Martin Kroeker 2019-09-17 18:56:04 +02:00
  • 71d488ffba Revert "Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1" revert-2051-issue2048 Martin Kroeker 2019-09-13 22:51:33 +02:00
  • 2a1911cc14 Merge pull request #6 from xianyi/develop Martin Kroeker 2019-09-13 14:00:23 +02:00
  • 9f7a9a32e3 Merge pull request #2252 from thrasibule/trtrs Martin Kroeker 2019-09-12 21:45:47 +02:00
  • 2463938879 fix error message Guillaume Horel 2019-09-11 10:33:35 -04:00
  • 5d6525c87c more bugfix Guillaume Horel 2019-09-10 17:30:57 -04:00
  • 6cb47ea3f0 fix Makefile Guillaume Horel 2019-09-10 17:11:01 -04:00
  • 459bb9291d fix error codes Guillaume Horel 2019-09-10 17:10:33 -04:00
  • 3f1077ce6f Merge pull request #2249 from brada4/gcc7minor Martin Kroeker 2019-09-10 08:27:32 +02:00
  • eb45eb6942 Fix C compiler handling and BINARY=32 mode in CMAKE builds (#2248) Martin Kroeker 2019-09-10 08:27:06 +02:00
  • f2becb777a fix Makefile Guillaume Horel 2019-09-09 11:36:50 -04:00
  • 5997b6b491 bugfix Guillaume Horel 2019-09-07 22:06:27 -04:00
  • 4b21b646ea turn on optimized code Guillaume Horel 2019-09-06 17:19:40 -04:00
  • 7ec7b999a5 add missing file Guillaume Horel 2019-09-06 16:49:27 -04:00
  • af9ac0898a fix Makefile Guillaume Horel 2019-09-06 16:49:12 -04:00
  • c7b5a459b6 add missing defines and headers Guillaume Horel 2019-09-06 16:48:18 -04:00
  • 9b2f0323d6 update Makefile Guillaume Horel 2019-09-06 16:01:55 -04:00
  • 9f6984fe4b add missing files Guillaume Horel 2019-09-03 14:45:43 -04:00
  • 42203dafdc add logic Guillaume Horel 2019-09-02 21:57:28 -04:00
  • a4f17a9297 add missing objects Guillaume Horel 2019-09-02 21:15:20 -04:00
  • 733d97b2df add files Guillaume Horel 2019-08-30 16:31:25 -04:00
  • ea747cf933 start working on ?trtrs Guillaume Horel 2019-08-30 15:06:38 -04:00
  • 4de545aa7d address minor warnings from gcc7 Andrew 2019-09-07 10:21:08 +03:00
  • 6e9a93ec19 init Andrew 2019-09-07 10:18:46 +03:00
  • fde8a8e6a0 Improve cmake build behaviour with non-host cpu targets (#2246) Martin Kroeker 2019-09-03 22:41:17 +02:00
  • 256fc15f5f Merge pull request #2 from xianyi/develop Martin Kroeker 2019-09-03 15:12:14 +02:00
  • ee498525e0 Merge pull request #2242 from martin-frbg/issue2235 Martin Kroeker 2019-09-02 22:06:29 +02:00
  • 1fec0570f6 Add cgemm and zgemm unroll factors for core2 Martin Kroeker 2019-09-02 15:03:45 +02:00
  • b5af7b9c78 Disable ppc64le test environment on Travis CI Martin Kroeker 2019-08-31 18:06:12 +02:00
  • f3c314550c Merge pull request #2243 from quickwritereader/develop Martin Kroeker 2019-08-30 23:06:23 +02:00
  • 847c20c9b7 fix uninitialized variables i AbdelRauf 2019-08-30 11:14:55 +00:00
  • 4c22828812 caxpy and cdot are using vec_vsx_ld AbdelRauf 2019-08-30 04:09:15 +00:00
  • e79712d969 cgemv using vec_vsx_ld instead of letting gcc to decide AbdelRauf 2019-08-30 02:52:04 +00:00
  • be09551cdf aligned AbdelRauf 2019-08-29 23:22:23 +00:00
  • ec1ef6aa9e Merge pull request #2241 from martin-frbg/zdotfix Martin Kroeker 2019-08-29 07:12:54 +02:00
  • 11c59acfb1 Keep both PGI/SUN and default code paths to avoid breaking Clang/WIndows Martin Kroeker 2019-08-28 18:07:44 +02:00
  • bf0d92a310 Add arch data for cross-compiling to CORE2 Martin Kroeker 2019-08-28 17:35:56 +02:00
  • db066151ee Merge pull request #2240 from martin-frbg/issue2237 Martin Kroeker 2019-08-28 15:30:53 +02:00
  • 3a55dca2dc Make x86_64 zdot compile with PGI and Sun C again Martin Kroeker 2019-08-28 11:35:31 +02:00
  • 7d380f7d79 Fix PGI build options (again) Martin Kroeker 2019-08-28 11:31:20 +02:00
  • 300f158d3b Merge pull request #2239 from martin-frbg/issue2231 Martin Kroeker 2019-08-28 07:54:57 +02:00
  • 3635fdbf2b Do not abuse the global ARCH variable as a local temporary Martin Kroeker 2019-08-27 22:52:17 +02:00
  • b6552b11eb Merge pull request #2 from xianyi/develop Martin Kroeker 2019-08-27 22:41:31 +02:00
  • 3dc6b26eff AIX changes for Power8 Kavana Bhat 2019-08-20 06:51:35 -05:00