Commit Graph

  • 5fdf9ad24f Merge pull request #2228 from martin-frbg/issue2227 Travisppc Martin Kroeker 2019-08-19 18:26:51 +02:00
  • 2fe967c542 Merge branch 'develop' into issue2227 Martin Kroeker 2019-08-19 14:20:39 +02:00
  • 6d8595351c Add Intel Goldmont Plus CPUID Martin Kroeker 2019-08-19 14:19:21 +02:00
  • f40200f559 Merge pull request #2223 from martin-frbg/getarch-pgi Martin Kroeker 2019-08-16 12:21:30 +02:00
  • a95a5e52b8 Fix PGI compiler detection for getarch Martin Kroeker 2019-08-16 09:00:11 +02:00
  • e3d846ab57 Do not use -march=native with the PGI compiler Martin Kroeker 2019-08-16 08:58:10 +02:00
  • 8506386d82 Merge pull request #1 from xianyi/develop Martin Kroeker 2019-08-16 08:56:15 +02:00
  • 9ef96b32a6 Add multithreading support to the x86_64 zdot kernel (#2222) Martin Kroeker 2019-08-15 22:09:12 +02:00
  • b48c025974 Merge pull request #2218 from martin-frbg/issue2215 Martin Kroeker 2019-08-14 07:32:31 +02:00
  • a1fce67743 Make the new DGEMM regression test properly depend on CBLAS and LAPACKE Martin Kroeker 2019-08-13 22:29:48 +02:00
  • 103b32fdb7 Merge pull request #2216 from martin-frbg/issue2214 Martin Kroeker 2019-08-13 13:59:33 +02:00
  • aef9804089 Fix unwanted case-sensitivity in x86 LSAME for (AMD) processors without CMOV Martin Kroeker 2019-08-13 10:19:10 +02:00
  • 303869f572 Update with changes from 0.3.7 Martin Kroeker 2019-08-11 23:31:36 +02:00
  • 02d9203981 Increment version to 0.3.8.dev Martin Kroeker 2019-08-11 23:28:47 +02:00
  • 7b6808b69c Increment version to 0.3.8.dev Martin Kroeker 2019-08-11 23:28:13 +02:00
  • 5f36f18148 Update with 0.3.7 changes v0.3.7 Martin Kroeker 2019-08-11 23:23:27 +02:00
  • d47fe78b0e Set version to 0.3.7 Martin Kroeker 2019-08-11 23:16:45 +02:00
  • ebe2f47a0f Set version to 0.3.7 Martin Kroeker 2019-08-11 23:16:11 +02:00
  • 20d417762f Merge pull request #2213 from xianyi/develop Martin Kroeker 2019-08-11 23:14:49 +02:00
  • 321288597c Merge pull request #2212 from martin-frbg/nofort-nolib Martin Kroeker 2019-08-11 20:26:34 +02:00
  • be147a9f28 Avoid adding a spurious dependency on the fortran runtime despite NOFORTRAN=1 Martin Kroeker 2019-08-11 16:24:39 +02:00
  • c275290ea6 Merge pull request #2211 from martin-frbg/arm64_gcc_trivial Martin Kroeker 2019-08-11 16:08:05 +02:00
  • b7bbb02447 Silence two nuisance warnings from gcc Martin Kroeker 2019-08-11 12:46:05 +02:00
  • bf1430f7d7 Merge pull request #2208 from martin-frbg/munmap-debug Martin Kroeker 2019-08-09 07:55:35 +02:00
  • dccff2e785 Merge pull request #2206 from martin-frbg/zen-dtrmm Martin Kroeker 2019-08-09 07:55:20 +02:00
  • 5c3458a6e7 Merge pull request #2199 from martin-frbg/zen-dtrsm Martin Kroeker 2019-08-09 07:55:02 +02:00
  • 1776ad82c0 Add files via upload Martin Kroeker 2019-08-09 00:08:11 +02:00
  • 4e2f81cfa1 Provide more information on mmap/munmap failure Martin Kroeker 2019-08-08 23:15:35 +02:00
  • acf6002ab2 Replace most vpermpd calls in the Haswell DTRSM_RN kernel Martin Kroeker 2019-08-03 12:40:13 +02:00
  • 96a794e9fd Merge pull request #2198 from martin-frbg/icelake Martin Kroeker 2019-08-02 08:36:14 +02:00
  • 3d36c45116 Add CPUID identification of Intel Ice Lake Martin Kroeker 2019-08-01 22:52:35 +02:00
  • 648491e1aa Autodetect Intel Ice Lake (as SKYLAKEX target) Martin Kroeker 2019-08-01 22:51:09 +02:00
  • 2dfb804cb9 Replace vpermpd with vpermilpd in the Haswell DTRMM kernel Martin Kroeker 2019-07-28 23:17:28 +02:00
  • 4c153ec9da Merge pull request #2196 from wjc404/develop Martin Kroeker 2019-07-28 23:11:40 +02:00
  • 7eecd8e39c Add files via upload wjc404 2019-07-28 07:39:09 +08:00
  • f0406a7708 Merge pull request #2112 from ffontaine/develop Martin Kroeker 2019-07-27 13:00:13 +02:00
  • 561f3fd995 Merge pull request #2193 from martin-frbg/makeutest Martin Kroeker 2019-07-24 20:19:21 +02:00
  • 30efed14d1 Unset special make variables in ctest Makefile as well Martin Kroeker 2019-07-24 15:26:09 +02:00
  • af2e7f28fc Override special make variables Martin Kroeker 2019-07-23 16:56:40 +02:00
  • 4250e6ed64 Merge pull request #2191 from tylerjereddy/conditional_updates Martin Kroeker 2019-07-23 16:20:39 +02:00
  • 7b0b7c11d2 Merge pull request #2190 from martin-frbg/zdot-zen Martin Kroeker 2019-07-23 16:15:08 +02:00
  • d14cf1ccf4 Merge pull request #2189 from wjc404/develop Martin Kroeker 2019-07-23 08:32:56 +02:00
  • 3f6ab1582a MAINT: remove legacy CMake endif() Tyler Reddy 2019-07-22 21:24:57 -06:00
  • 28e96458e5 Replace vpermpd with vpermilpd Martin Kroeker 2019-07-22 08:28:16 +02:00
  • 95fb98f556 Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-21 01:10:32 +08:00
  • 4801c6d36b Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-21 00:47:45 +08:00
  • 9440fa607d Add files via upload wjc404 2019-07-20 22:08:22 +08:00
  • 94db259e5b Add files via upload wjc404 2019-07-20 22:04:41 +08:00
  • f49f8047ac Add files via upload wjc404 2019-07-20 14:33:37 +08:00
  • 825777faab Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-19 23:58:24 +08:00
  • 9c89757562 Add files via upload wjc404 2019-07-19 23:47:58 +08:00
  • b0b7600bef Merge pull request #2186 from wjc404/develop Martin Kroeker 2019-07-18 16:04:44 +02:00
  • 9b04baeaee Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-17 23:50:03 +08:00
  • 8a074b3965 Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-17 23:47:30 +08:00
  • 211ab03b14 Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-17 22:39:15 +08:00
  • 1733f927e6 Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-17 21:27:41 +08:00
  • 182b06d6ad Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-17 17:02:35 +08:00
  • 7a9050d681 Update dgemm_kernel_4x8_haswell.S wjc404 2019-07-17 00:55:06 +08:00
  • 0ba29fd262 Update dgemm_kernel_4x8_haswell.S for zen2 wjc404 2019-07-17 00:46:51 +08:00
  • bafa021ed6 Merge pull request #2181 from isuruf/install_name Martin Kroeker 2019-07-09 20:08:52 +02:00
  • b89d9762a2 Change install_name on osx to match linux Isuru Fernando 2019-07-08 17:13:21 -05:00
  • 08dedf4c5e Merge pull request #2177 from martin-frbg/noaff Martin Kroeker 2019-07-07 18:28:21 +02:00
  • b89c781637 Fix surprising behaviour of NO_AFFINITY=0 Martin Kroeker 2019-07-07 16:04:45 +02:00
  • dd7ff77f4b Merge pull request #2175 from martin-frbg/cmake-mingw-fixes Martin Kroeker 2019-07-06 18:07:19 +02:00
  • 8fb76134bc Mingw32 needs leading underscore on object names Martin Kroeker 2019-07-06 15:07:15 +02:00
  • 04d671aae2 Make disabling DYNAMIC_ARCH on unsupported systems work Martin Kroeker 2019-07-06 15:05:04 +02:00
  • f69a0be712 Add getarch flags to disable AVX on x86 Martin Kroeker 2019-07-06 15:02:39 +02:00
  • ae9e8b131e Add mingw builds to Appveyor config Martin Kroeker 2019-07-06 14:30:33 +02:00
  • 9086543f50 Utest needs CBLAS but not necessarily FORTRAN Martin Kroeker 2019-07-06 14:29:47 +02:00
  • abea977ded Merge pull request #2162 from martin-frbg/pgi Martin Kroeker 2019-07-03 19:16:30 +02:00
  • 6b6c9b1441 Merge pull request #2172 from quickwritereader/develop Martin Kroeker 2019-07-01 21:06:02 +02:00
  • a97b301aaa cgemm/ctrmm power9 AbdelRauf 2019-06-18 15:55:56 +00:00
  • 2f13f04224 Merge pull request #2170 from pkubaj/patch-1 Martin Kroeker 2019-06-30 23:29:02 +02:00
  • 7c7505a778 Fix build for PPC970 on FreeBSD pt.2 pkubaj 2019-06-28 10:31:45 +00:00
  • 5a4f1a2118 Fix build for PPC970 on FreeBSD pt. 1 pkubaj 2019-06-28 10:29:44 +00:00
  • 3b761892df Merge pull request #2169 from pkubaj/develop Martin Kroeker 2019-06-25 12:56:33 +02:00
  • eebfeba768 Fix build on FreeBSD/powerpc64. Piotr Kubaj 2019-06-25 10:58:56 +02:00
  • 7684c4f8f8 PGI compiler does not like -march=native Martin Kroeker 2019-06-20 19:56:01 +02:00
  • 7faf42b7bb Merge pull request #2167 from kavanabhat/dtrmm_power8_segfault Martin Kroeker 2019-06-19 14:38:01 +02:00
  • a575f1e4c7 Update dtrmm_kernel_16x4_power8.S kavanabhat 2019-06-19 15:27:14 +05:30
  • cdbfb891da new sgemm 8x16 AbdelRauf 2019-06-17 15:33:38 +00:00
  • 280552b988 Fix mov syntax Martin Kroeker 2019-06-16 18:35:43 +02:00
  • bbd4bb0154 Zero ecx with a mov instruction Martin Kroeker 2019-06-16 15:04:10 +02:00
  • 6d3efb2b58 Update Makefile.x86_64 Martin Kroeker 2019-06-14 08:08:11 +02:00
  • d9ff2cd90d Do not force gcc options on non-gcc compilers Martin Kroeker 2019-06-13 23:01:35 +02:00
  • 2a43062de7 Merge pull request #2159 from martin-frbg/issue2149 Martin Kroeker 2019-06-10 19:12:45 +02:00
  • 4ea794a522 Avoid unintentional activation of TLS code via USE_TLS=0 Martin Kroeker 2019-06-10 17:24:15 +02:00
  • ece0bfb881 Merge pull request #2158 from martin-frbg/issue2143 Martin Kroeker 2019-06-10 14:08:11 +02:00
  • 1f4b6a5d5d Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds Martin Kroeker 2019-06-10 09:50:13 +02:00
  • be8f70d269 Merge pull request #2157 from martin-frbg/2154-2 Martin Kroeker 2019-06-09 12:19:08 +02:00
  • e674e1c735 Update fc.cmake Martin Kroeker 2019-06-09 09:31:13 +02:00
  • 6ca898b63b Add gfortran workaround for potential ABI violation Martin Kroeker 2019-06-08 23:17:03 +02:00
  • 26411acd56 Merge pull request #2148 from TiborGY/cpp_thread_test_2 Martin Kroeker 2019-06-07 13:23:07 +02:00
  • 0ab4076dd8 Merge pull request #2156 from martin-frbg/issue2154 Martin Kroeker 2019-06-06 13:43:12 +02:00
  • a0caa762b3 Add gfortran workaround for ABI violations Martin Kroeker 2019-06-06 10:24:16 +02:00
  • 900d5a3205 Add gfortran workaround for ABI violations in LAPACKE Martin Kroeker 2019-06-06 10:18:40 +02:00
  • a17cf36225 Merge pull request #2153 from quickwritereader/develop Martin Kroeker 2019-06-06 07:42:56 +02:00
  • 148c4cc5fd conflict resolve AbdelRauf 2019-06-05 20:50:50 +00:00
  • d0c3543c3f power9 zgemm ztrmm optimized AbdelRauf 2019-06-05 10:30:57 +00:00
  • 909ad04aef Merge pull request #2145 from martin-frbg/1912-3 Martin Kroeker 2019-06-05 20:27:45 +02:00