Commit Graph

  • 0f105dd8a5 sgemm/strmm AbdelRauf 2019-04-13 13:56:19 +00:00
  • 9c4edd38f2 Merge pull request #2099 from martin-frbg/rela-gbtrf Martin Kroeker 2019-04-29 09:25:19 +02:00
  • 1036299da0 Disable repeated recursion on Ab_BR in ReLAPACK xGBTRF Martin Kroeker 2019-04-29 00:12:37 +02:00
  • 5b0398186e Merge pull request #2098 from martin-frbg/rela-malloc Martin Kroeker 2019-04-28 19:31:01 +02:00
  • 452859f4e1 Merge pull request #2097 from martin-frbg/rela-getrf Martin Kroeker 2019-04-28 19:28:57 +02:00
  • 2cd463eabd Disable reallocation of work array in xSYTRF Martin Kroeker 2019-04-28 10:02:28 +02:00
  • 11530b76f7 Correct INFO=4 condition Martin Kroeker 2019-04-28 09:58:56 +02:00
  • 91943b7325 Merge pull request #2096 from martin-frbg/eig-testing Martin Kroeker 2019-04-28 09:55:42 +02:00
  • 268c28db7d Merge pull request #2095 from martin-frbg/trsm Martin Kroeker 2019-04-28 09:55:25 +02:00
  • 2aad88d5b9 Avoid out-of-bounds accesses in LAPACK EIG tests Martin Kroeker 2019-04-27 23:01:49 +02:00
  • 0bd956fd21 Correct length of name string in xerbla call Martin Kroeker 2019-04-27 22:49:04 +02:00
  • bbd9d98664 Merge pull request #2094 from martin-frbg/issue2066 Martin Kroeker 2019-04-27 22:45:47 +02:00
  • 798c448b0c Add support for INTERFACE64 and fix XERBLA calls Martin Kroeker 2019-04-27 19:06:00 +02:00
  • 9a19616a28 Support INTERFACE64=1 Martin Kroeker 2019-04-27 18:55:47 +02:00
  • 6b41eb9c0c Merge pull request #2092 from jeffbaylor/snprintf_with_MSC_VER Martin Kroeker 2019-04-23 20:12:06 +02:00
  • ccfb7ead15 Merge pull request #2072 from martin-frbg/sum Martin Kroeker 2019-04-23 20:11:36 +02:00
  • 40e53e52d6 snprintf define consolidated to common.h Jeff Baylor 2019-04-22 17:01:34 -07:00
  • 744779d335 Merge pull request #2084 from RashmicaG/develop Martin Kroeker 2019-04-14 21:40:07 +02:00
  • bcdf1d4917 Add in runtime CPU detection for POWER. Rashmica Gupta 2019-04-09 14:13:24 +10:00
  • e06b8438b4 Merge pull request #2080 from martin-frbg/issue2075 Martin Kroeker 2019-04-02 21:40:58 +02:00
  • 9229d6859b Add -lm and disable EXPRECISION support on *BSD Martin Kroeker 2019-04-02 09:38:18 +02:00
  • 21d146a8de Add declarations for ?sum Martin Kroeker 2019-03-31 22:12:23 +02:00
  • 7f4e36d219 Merge pull request #2073 from martin-frbg/issue2056-2 Martin Kroeker 2019-03-31 13:56:08 +02:00
  • c04a729081 Add ?sum definitions for generic kernel Martin Kroeker 2019-03-31 13:55:49 +02:00
  • 100d94f94e Add ?sum Martin Kroeker 2019-03-31 13:55:05 +02:00
  • d17da6c6a4 Add cmake defaults for ?sum kernels Martin Kroeker 2019-03-31 11:57:01 +02:00
  • 1679de5e59 Detect 32bit environment on 64bit ARM hardware Martin Kroeker 2019-03-31 10:50:43 +02:00
  • 246ca29679 Add ZARCH implementation of ?sum Martin Kroeker 2019-03-30 22:49:05 +01:00
  • 9d717cb5ee Add x86_64 implementation of ?sum Martin Kroeker 2019-03-30 22:27:04 +01:00
  • e3bc83f2a8 Add x86 implementation of ?sum Martin Kroeker 2019-03-30 22:26:10 +01:00
  • 70f2a4e0d7 Add SPARC implementation of ?sum Martin Kroeker 2019-03-30 22:25:06 +01:00
  • 706dfe263b Add POWER implementation of ?sum Martin Kroeker 2019-03-30 22:23:42 +01:00
  • 688fa9201c Add MIPS64 implementation of ?sum Martin Kroeker 2019-03-30 22:22:15 +01:00
  • cdbe0f0235 Add MIPS implementation of ?sum Martin Kroeker 2019-03-30 22:20:14 +01:00
  • f8b82bc6dc Add ia64 implementation of ?sum Martin Kroeker 2019-03-30 22:18:03 +01:00
  • 3e3ccb9011 Add ARM64 implementations of ?sum Martin Kroeker 2019-03-30 22:13:36 +01:00
  • 94ab4e6fb2 Add ARM implementations of ?sum Martin Kroeker 2019-03-30 22:11:38 +01:00
  • c3cfc6986b Add implementations of ssum/dsum and csum/zsum Martin Kroeker 2019-03-30 22:05:11 +01:00
  • b9f4943a14 Add ?sum Martin Kroeker 2019-03-30 22:01:13 +01:00
  • 79cfc24a62 Add interface for ?sum (derived from ?asum) Martin Kroeker 2019-03-30 21:59:18 +01:00
  • 5c42287c4f Add declarations for ?sum and cblas_?sum Martin Kroeker 2019-03-30 21:58:03 +01:00
  • 32c7063cb0 Merge pull request #2061 from martin-frbg/martin-frbg-patch-1 Martin Kroeker 2019-03-30 21:21:38 +01:00
  • c19a449096 Merge pull request #2071 from martin-frbg/issue2068 Martin Kroeker 2019-03-30 14:54:28 +01:00
  • 3d1e36d4cb Build CBLAS interfaces for I?MIN and I?MAX Martin Kroeker 2019-03-30 12:38:41 +01:00
  • 4f9d3e4b28 Expose CBLAS interfaces for I?MIN and I?MAX Martin Kroeker 2019-03-30 12:37:13 +01:00
  • 4dec151d0b Merge pull request #2070 from quickwritereader/develop Martin Kroeker 2019-03-29 21:46:21 +01:00
  • 7c51cc8527 Merge branch 'develop' into develop Martin Kroeker 2019-03-29 19:36:29 +01:00
  • 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself AbdelRauf 2019-03-14 10:42:04 +00:00
  • 3ae122e2c7 Merge pull request #2069 from aixoss/aix-asm-change Martin Kroeker 2019-03-25 21:34:30 +01:00
  • b043a5962e AIX asm syntax changes needed for shared object creation Ayappan P 2019-03-25 18:53:25 +05:30
  • 8502030e5e Merge pull request #2064 from embray/cygwin/use-tls-thread-memory-cleanup Martin Kroeker 2019-03-19 22:12:51 +01:00
  • 8ba9e2a61a Also call CloseHandle on each thread, as well as on the event so as to not leak thread handles. Erik M. Bray 2019-03-19 10:22:02 +01:00
  • 4ad694eda1 Fix for #2063: The DllMain used in Cygwin did not run the thread memory pool cleanup upon THREAD_DETACH which is needed when compiled with USE_TLS=1. Erik M. Bray 2019-03-18 20:32:48 +01:00
  • dff4a197a5 Merge pull request #2058 from xsacha/patch-3 Martin Kroeker 2019-03-16 11:57:23 +01:00
  • a5425575b1 Merge pull request #2060 from embray/cygwin/readenv Martin Kroeker 2019-03-16 11:56:51 +01:00
  • 1006ff8a7b Use POSIX getenv on Cygwin Erik M. Bray 2019-03-15 15:06:30 +01:00
  • e608d4f7fe Disable the AVX512 DGEMM kernel (again) Martin Kroeker 2019-03-13 22:10:28 +01:00
  • 4fc17d0d75 Trivial typo fix Martin Kroeker 2019-03-13 19:20:23 +01:00
  • c3e30b2bc2 Change 64-bit detection as explained in #2056 Sacha 2019-03-13 23:21:54 +10:00
  • 03d7110900 Merge pull request #2042 from maomao194313/develop Martin Kroeker 2019-03-12 22:57:39 +01:00
  • 3ce28fb81a Merge pull request #2055 from martin-frbg/atomid Martin Kroeker 2019-03-12 22:57:07 +01:00
  • 04f2226ea6 Add Intel Denverton Martin Kroeker 2019-03-12 16:09:55 +01:00
  • b1393c7a97 Add Intel Denverton Martin Kroeker 2019-03-12 16:03:56 +01:00
  • 7e3eb9b25d make DYNAMIC_ARCH=1 package work on TSV110 maomao194313 2019-03-12 16:11:01 +08:00
  • f074d7d146 make DYNAMIC_ARCH=1 package work on TSV110. maomao194313 2019-03-12 16:05:19 +08:00
  • f18ab6c17b Merge pull request #2051 from martin-frbg/issue2048 Martin Kroeker 2019-03-09 16:39:35 +01:00
  • 946ec6c3b8 Merge pull request #2050 from kencu/PowerMacFix Martin Kroeker 2019-03-09 16:39:08 +01:00
  • 5b95534afc Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1 Martin Kroeker 2019-03-09 11:21:16 +01:00
  • f7a06463d9 common_power.h: force DCBT_ARG 0 on PPC970 Darwin ken-cunningham-webuse 2019-03-07 11:41:58 -08:00
  • b0c714ef60 param.h : enable defines for PPC970 on DarwinOS ken-cunningham-webuse 2019-03-07 11:36:35 -08:00
  • 8d3d29e4d7 Merge pull request #2049 from Celelibi/fix_crash_sgemm_sse_x64 Martin Kroeker 2019-03-07 19:28:06 +01:00
  • b7f59da42d Fix crash in sgemm SSE/nano kernel on x86_64 Celelibi 2019-03-07 16:39:41 +01:00
  • db3dc9e282 Merge pull request #2046 from kencu/powermac Martin Kroeker 2019-03-07 14:51:41 +01:00
  • 4290afdae2 ctest.c : add __POWERPC__ for PowerMac ken-cunningham-webuse 2019-03-06 20:55:06 -08:00
  • 4741ce803b Merge pull request #2045 from martin-frbg/2033-3 Martin Kroeker 2019-03-06 22:40:26 +01:00
  • 11cfd0bd75 Do not compile in AVX512 check if AVX support is disabled Martin Kroeker 2019-03-05 16:04:25 +01:00
  • 651ab01d2b Merge pull request #2044 from martin-frbg/issue2043 Martin Kroeker 2019-03-05 12:11:32 +01:00
  • d7b2c53c0b Merge pull request #2039 from brada4/meminit Martin Kroeker 2019-03-05 12:11:15 +01:00
  • e4864a8933 Fix module definition conflicts between LAPACK and ReLAPACK Martin Kroeker 2019-03-04 21:17:08 +01:00
  • 10d841d8b9 Merge pull request #2026 from martin-frbg/trmv_threads Martin Kroeker 2019-03-04 15:08:31 +01:00
  • 12f2b76748 Merge pull request #2038 from martin-frbg/issue2035 Martin Kroeker 2019-03-04 15:07:48 +01:00
  • 6c83b878f6 Merge pull request #2040 from martin-frbg/locks2002 Martin Kroeker 2019-03-04 15:07:14 +01:00
  • fb4dae7124 add TARGET support for HiSilicon tsv110 CPUs maomao194313 2019-03-04 16:48:49 +08:00
  • 760842dda1 add TARGET support for HiSilicon tsv110 CPUs maomao194313 2019-03-04 16:45:22 +08:00
  • 53f482ee72 add TARGET support for HiSilicon tsv110 CPUs maomao194313 2019-03-04 16:41:21 +08:00
  • 783ba8058f HiSilicon tsv110 CPUs optimization branch maomao194313 2019-03-04 16:30:50 +08:00
  • af480b02a4 Restore locking optimizations for OpenMP case Martin Kroeker 2019-03-03 14:17:07 +01:00
  • e4a79be6bb address warning introed with #1814 et al Andrew 2019-03-03 09:05:11 +02:00
  • e5c316c6b9 init Andrew 2019-03-03 08:59:27 +02:00
  • 25427926bc Improve handling of NO_STATIC and NO_SHARED Martin Kroeker 2019-03-02 23:36:36 +01:00
  • edb8143141 Merge pull request #2037 from martin-frbg/issue2033-2 Martin Kroeker 2019-03-01 11:45:02 +01:00
  • c4868d11c0 Make sure that AVX512 is disabled in 32bit builds Martin Kroeker 2019-03-01 09:23:03 +01:00
  • 4c321ae571 Merge pull request #2034 from martin-frbg/issue2033 Martin Kroeker 2019-02-28 22:10:12 +01:00
  • 2ffb727187 Keep xcode8.3 for osx BINARY=32 build Martin Kroeker 2019-02-28 10:51:54 +01:00
  • d66214c946 Make x86_32 imply NO_AVX2, NO_AVX512 in addition to NO_AVX Martin Kroeker 2019-02-28 09:58:25 +01:00
  • fd34820b99 Fix AVX512 test always returning false due to missing compiler option Martin Kroeker 2019-02-25 17:58:31 +01:00
  • 918a0cc4d1 Fix missing -c option in AVX512 test Martin Kroeker 2019-02-25 17:55:36 +01:00
  • 0db9c03e7e Merge pull request #2028 from brada4/mv Martin Kroeker 2019-02-24 19:50:23 +01:00
  • 6eee1beac5 move fix to right place Andrew 2019-02-24 20:41:02 +02:00
  • e5df5958cc init Andrew 2019-02-24 20:39:25 +02:00