Commit Graph

  • a4ceb1ade9 Merge pull request #2737 from ashwinyes/add_thunderx3_target Martin Kroeker 2020-07-27 15:19:47 +02:00
  • 4e1be0e481 ARM64: Add THUNDERX3T110 Target Ashwin Sekhar T K 2020-06-11 04:12:49 -07:00
  • 49b83e00b7 Merge pull request #2735 from martin-frbg/move_potrf Martin Kroeker 2020-07-26 19:54:11 +02:00
  • 769ed9ffad Merge pull request #2734 from RajalakshmiSR/p10_fix Martin Kroeker 2020-07-25 09:02:32 +02:00
  • f194ad59e1 Use _Atomic instead of volatile where available (file moved from ../getrf) Martin Kroeker 2020-07-25 08:52:24 +02:00
  • 4fda217f99 Delete potrf_parallel.c (moving it to ../potrf) Martin Kroeker 2020-07-25 06:42:39 +00:00
  • 9be2688c78 Fix to store results in correct order for POWER10 GEMM kernels Rajalakshmi Srinivasaraghavan 2020-07-24 23:08:11 -05:00
  • 6a2a60038c Merge pull request #2720 from martin-frbg/issue2694 Martin Kroeker 2020-07-24 23:19:45 +02:00
  • 251a09ec90 Typo fix Martin Kroeker 2020-07-24 16:04:58 +00:00
  • 95d37e1575 Regroup the 32 and 64bit sections and restore 64bit CAXPY Martin Kroeker 2020-07-24 10:13:46 +00:00
  • 3523bb778e Merge pull request #2721 from martin-frbg/p8align Martin Kroeker 2020-07-24 11:06:20 +02:00
  • a50d0e29c8 Merge pull request #2731 from martin-frbg/pgippc Martin Kroeker 2020-07-24 11:05:16 +02:00
  • bf1f0734ff Use OPENBLAS_MAKE_COMPLEX_FLOAT on PPC only Martin Kroeker 2020-07-23 20:40:13 +00:00
  • ca3561cab9 Add ifdefs around call to altivec microkernel Martin Kroeker 2020-07-23 18:30:42 +00:00
  • 21072e502a Typo fix Martin Kroeker 2020-07-23 17:34:56 +00:00
  • 7c6e56b5df Rewrite assignment to complex for better portability Martin Kroeker 2020-07-23 17:10:59 +02:00
  • 661c6bfa5a Exclude altivec code paths if the compiler does not support them Martin Kroeker 2020-07-23 17:08:20 +02:00
  • 9796e552ea Avoid undefining NAME,CNAME etc for pgcc as it makes it ignore the new defininitions Martin Kroeker 2020-07-23 17:03:28 +02:00
  • d6b6e5ccd7 Merge pull request #73 from xianyi/develop Martin Kroeker 2020-07-23 16:59:06 +02:00
  • 349b722d8d Merge pull request #2729 from martin-frbg/issue2728 Martin Kroeker 2020-07-22 22:45:57 +02:00
  • 6c33764ca4 Unify BUFFER_SIZE settings for x86_64 again to fix potentially fatal mismatch in DYNAMIC_ARCH builds Martin Kroeker 2020-07-22 17:30:55 +00:00
  • d1b9613fd4 Merge pull request #2727 from wyphan/develop Martin Kroeker 2020-07-21 17:06:53 +02:00
  • 3cfc74b1a0 Merge pull request #2726 from martin-frbg/2725-2 Martin Kroeker 2020-07-21 16:42:06 +02:00
  • 9ae154ba89 Patch for building on Summit Wileam Phan 2020-07-20 23:30:28 -04:00
  • 9e21a100e3 Add trivial check for stdatomic.h Martin Kroeker 2020-07-20 22:52:09 +00:00
  • 31d30312dc Merge pull request #72 from xianyi/develop Martin Kroeker 2020-07-21 00:49:12 +02:00
  • fcfb7ffafb Merge pull request #2725 from martin-frbg/ccheck_c11 Martin Kroeker 2020-07-18 23:08:08 +02:00
  • bbe119ee3b Update conditional for atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:19:59 +00:00
  • f4f74941bd Update conditional for atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:14:50 +00:00
  • a36eb19ae0 Update conditional for C11 atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:13:24 +00:00
  • ce45af8151 Update conditional for atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:09:56 +00:00
  • 6f38de06d2 Update conditional for atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:09:01 +00:00
  • 09eb9d2584 Update conditional for atomics to HAVE_C11 Martin Kroeker 2020-07-18 17:07:38 +00:00
  • 791e046744 Update conditional for atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:05:59 +00:00
  • 94bab9d1f9 Update conditional for atomics to use HAVE_C11 Martin Kroeker 2020-07-18 17:03:31 +00:00
  • 97d6eb97b1 Report availability of C11 support Martin Kroeker 2020-07-18 16:59:33 +00:00
  • 4afd11dae5 Add a check for C11 atomics and stdatomic.h Martin Kroeker 2020-07-18 16:57:41 +00:00
  • 72ec6280c7 Merge pull request #2724 from martin-frbg/loongsonreadme Martin Kroeker 2020-07-18 18:08:40 +02:00
  • 26b7f24d16 Update cross-compiling example to reflect change in Loongson gcc Martin Kroeker 2020-07-18 12:51:37 +00:00
  • 0db4218fed Merge pull request #2722 from martin-frbg/cmakefcheck Martin Kroeker 2020-07-17 10:33:03 +02:00
  • 9d000ecaa2 include CheckLanguage module Martin Kroeker 2020-07-16 22:36:35 +00:00
  • a847d00366 handle missing lack of fortran compiler more gracefully Martin Kroeker 2020-07-16 22:17:39 +00:00
  • 0033f8be0d Use vec_vsx_ld/st to fix misaligned accesses flagged by asan Martin Kroeker 2020-07-16 23:32:54 +02:00
  • f308e741b2 remove debug output and revert changes to cdot and crot Martin Kroeker 2020-07-15 10:00:07 +02:00
  • 4f5d26bb02 Merge pull request #2716 from RajalakshmiSR/p10_ldflag Martin Kroeker 2020-07-15 01:20:54 +02:00
  • 417c4e8af8 Add new linker option for POWER10 Rajalakshmi Srinivasaraghavan 2020-07-14 11:54:04 -05:00
  • da17abec87 fix trailing whitespace Martin Kroeker 2020-07-14 18:20:03 +02:00
  • f8c2697701 Use POWER6 GEMM, TRMM and DTRSM on 32bit POWER8 Martin Kroeker 2020-07-14 18:11:19 +02:00
  • b144423f0f Do not define USE_TRMM for 32bit POWER8 Martin Kroeker 2020-07-14 18:10:12 +02:00
  • bd2498c886 Use POWER6 GEMM parameters on 32bit POWER8 Martin Kroeker 2020-07-14 18:07:58 +02:00
  • d8e2edfc20 Merge pull request #71 from xianyi/develop Martin Kroeker 2020-07-14 18:01:34 +02:00
  • 419b8686d1 Merge pull request #2682 from martin-frbg/aix Martin Kroeker 2020-07-13 14:43:24 +02:00
  • 3ab15ff34c Merge pull request #2651 from leezu/actionsflang Martin Kroeker 2020-07-13 13:00:39 +02:00
  • 8916c4ae2c Merge branch 'develop' into actionsflang Martin Kroeker 2020-07-12 20:37:29 +02:00
  • 4fa283de66 Merge pull request #2706 from jussienko/use-always-omp-threads Martin Kroeker 2020-07-12 20:17:11 +02:00
  • 5865c7d4d6 Make 32bit POWER8 use POWER6 kernels for now Martin Kroeker 2020-07-12 18:59:01 +02:00
  • ae3a90f78f merge overwritten part of power10 support Martin Kroeker 2020-07-12 18:51:58 +02:00
  • 009864edde Merge pull request #2710 from martin-frbg/cmake-lapacktest Martin Kroeker 2020-07-10 12:06:50 +02:00
  • 3de80b3f5a Merge pull request #2713 from RajalakshmiSR/p10-gcc10 Martin Kroeker 2020-07-10 10:43:33 +02:00
  • af1e140e35 Change minimum gcc version for POWER10 Rajalakshmi Srinivasaraghavan 2020-07-09 21:46:06 -05:00
  • d4a0299e16 Do not build lapack-test on MSVC for now (same as with BLAS test) Martin Kroeker 2020-07-09 13:57:27 +02:00
  • f766024749 enable fortran for cmake Martin Kroeker 2020-07-09 13:44:25 +02:00
  • c502760bef Modify for building with OpenBLAS Martin Kroeker 2020-07-09 13:13:16 +02:00
  • 29b5887d5f Modify for building with OpenBLAS Martin Kroeker 2020-07-09 13:12:35 +02:00
  • 60188a8c82 Append crude hack for enabling lapack tests in the OpenBLAS build Martin Kroeker 2020-07-09 11:44:31 +02:00
  • 1d63631afe Add lapack-test Martin Kroeker 2020-07-09 11:42:02 +02:00
  • e82bb953a7 Merge pull request #2708 from RajalakshmiSR/p10_future Martin Kroeker 2020-07-08 12:26:44 +02:00
  • ed7e155c35 Merge branch 'develop' into aix Martin Kroeker 2020-07-07 18:52:06 +02:00
  • 45d819ca82 Changing mcpu option as power10 Rajalakshmi Srinivasaraghavan 2020-07-07 11:25:20 -05:00
  • 8751a69271 Obtain actual cpu count on AIX and suppress spurious NO_AVX512 on non-x86 Martin Kroeker 2020-07-07 15:46:32 +02:00
  • 10a2923f64 fixes #2238 Jussi Enkovaara 2020-07-07 13:35:43 +03:00
  • 5ff83a4261 Merge pull request #2670 from mhillenibm/dumpfullversion_on_gcc7 Martin Kroeker 2020-07-07 00:12:28 +02:00
  • 5bc9680a86 Merge pull request #2703 from martin-frbg/issue2702 Martin Kroeker 2020-07-02 22:32:51 +02:00
  • 4ab3651591 Option -mavx2 requires at least gcc 4.7 Martin Kroeker 2020-07-02 17:00:15 +02:00
  • a83680b40b Merge pull request #69 from xianyi/develop Martin Kroeker 2020-07-02 16:56:00 +02:00
  • c3aa036e99 Merge pull request #2693 from EGuesnet/AIX-build-on-POWER8-32bits Martin Kroeker 2020-07-01 08:29:52 +02:00
  • 634e1305f9 Update cgemm_kernel_8x4_power8.S EGuesnet 2020-06-30 15:16:39 +02:00
  • c467516132 Merge pull request #2688 from martin-frbg/cometlake Martin Kroeker 2020-06-27 17:47:24 +02:00
  • 83f4746825 Add support for Comet Lake H and S Martin Kroeker 2020-06-27 14:41:24 +02:00
  • 584ef8d4ae Add support for Comet Lake H & S Martin Kroeker 2020-06-27 14:36:37 +02:00
  • 8dfda02e89 Merge pull request #68 from xianyi/develop Martin Kroeker 2020-06-27 14:29:29 +02:00
  • 28d69e0097 Merge pull request #2687 from martin-frbg/utfbom Martin Kroeker 2020-06-26 22:53:09 +02:00
  • c2467c9619 Merge pull request #2686 from RajalakshmiSR/p10_shgemm Martin Kroeker 2020-06-26 22:52:45 +02:00
  • f86e749df4 Merge pull request #2683 from mtreinish/add-comet-lake-support Martin Kroeker 2020-06-26 12:11:03 +02:00
  • d199c2787d Merge pull request #2680 from kavanabhat/aix_makefile_fix Martin Kroeker 2020-06-26 11:27:28 +02:00
  • e30ad0e521 Strip UTF8 byte order marker from source Martin Kroeker 2020-06-26 09:00:43 +02:00
  • d23419accc powerpc: Optimized SHGEMM kernel for POWER10 Rajalakshmi Srinivasaraghavan 2020-06-25 22:19:08 -05:00
  • 2f9c10810c Also set CPUTYPE in get_cpuname() Matthew Treinish 2020-06-25 15:53:56 -04:00
  • f37e941d52 Add support to driver/others/dynamic.c too Matthew Treinish 2020-06-25 11:56:49 -04:00
  • 2a91452bdd Add cpu detection support for comet lake U Matthew Treinish 2020-06-25 11:32:09 -04:00
  • c854ef5471 Fix variable names in conditional Martin Kroeker 2020-06-25 13:29:52 +02:00
  • c0afc11742 Fix POWERPC builds on AIX (gcc/gfortran 7) Martin Kroeker 2020-06-25 13:12:36 +02:00
  • c592f0f80a Fix utest build on AIX Martin Kroeker 2020-06-25 12:58:13 +02:00
  • 3f613b1301 Tentative changes for building on AIX Martin Kroeker 2020-06-25 12:57:00 +02:00
  • 72a0ec8e75 Fix reading of CPU name from prtconf output on AIX Martin Kroeker 2020-06-25 12:55:10 +02:00
  • 3446e58daf Fix handling of uname output on AIX Martin Kroeker 2020-06-25 12:31:35 +02:00
  • 4ca8becc4b Merge pull request #67 from xianyi/develop Martin Kroeker 2020-06-25 10:33:03 +02:00
  • dfe819f3bd Merge pull request #2679 from RajalakshmiSR/P10_GEMM Martin Kroeker 2020-06-25 08:31:38 +02:00
  • 4369e52555 Merge pull request #2677 from brada4/develop Martin Kroeker 2020-06-25 08:31:17 +02:00
  • bb2f52844b powerpc: Optimized ZGEMM kernel for POWER10 Gordon Fossum 2020-06-24 14:50:12 -05:00