Commit Graph

  • 93473174d6 Fix utest build with SolarisStudio compilers Martin Kroeker 2020-12-06 19:12:56 +01:00
  • b0b14f4e9b Change comments to C style for compatibility Martin Kroeker 2020-12-06 19:12:02 +01:00
  • 3a1b1b7c8c Fix complex ABI for 32bit SolarisStudio builds Martin Kroeker 2020-12-06 19:08:43 +01:00
  • da6d5d675c Fix hostarch detection for sparc Martin Kroeker 2020-12-06 19:07:45 +01:00
  • 04fa17322c Fix build options for SolarisStudio compilers Martin Kroeker 2020-12-06 19:05:27 +01:00
  • 3853014ea1 Merge pull request #1 from xianyi/develop Martin Kroeker 2020-12-06 18:52:51 +01:00
  • 65de6f5957 Fix test errors reported by cblas_cgemm & cblas_ctrmm Jin Bo 2020-12-05 15:06:12 +08:00
  • 213c0e7abb Added special unrolled vectorized versions of "Solve" for specific sizes, in DTRSM and STRSM, to improve performance in Power9 and Power10. Gordon Fossum 2020-12-04 17:07:06 -06:00
  • f21618684b Merge pull request #3018 from martin-frbg/issue3015 Martin Kroeker 2020-12-04 22:08:17 +01:00
  • 441c08c9ff Merge pull request #3016 from xiegengxin/complex-asum Martin Kroeker 2020-12-04 22:07:16 +01:00
  • 66302b3c06 Merge pull request #3013 from martin-frbg/gcc46 Martin Kroeker 2020-12-04 08:54:11 +01:00
  • 07e9a12349 Merge pull request #3011 from cyyever/fix_link Martin Kroeker 2020-12-04 08:50:59 +01:00
  • dd1adbdec4 Merge pull request #3019 from RajalakshmiSR/dgemm_param Martin Kroeker 2020-12-04 08:49:28 +01:00
  • a1eecccda2 Update f_check Martin Kroeker 2020-12-03 23:43:17 +01:00
  • 41fe6e864e POWER10: Update param.h Rajalakshmi Srinivasaraghavan 2020-12-03 14:40:11 -06:00
  • 74b5850581 Add libomp to the LAPACK(-test) dependencies in clang/gfortran builds Martin Kroeker 2020-12-03 21:28:10 +01:00
  • da0c94c76f Avoid linking both GNU libgomp and LLVM libomp in clang/gfortran builds Martin Kroeker 2020-12-03 21:25:57 +01:00
  • a6692dc129 use gfortran-10 with xcode 12 Martin Kroeker 2020-12-03 14:32:21 +01:00
  • 72a553f5bc Update .travis.yml Martin Kroeker 2020-12-03 09:17:27 +01:00
  • dcbb3b5ef1 fix misplaced lines Martin Kroeker 2020-12-02 23:13:13 +01:00
  • 57456c248b fix gfortran requirement in osx interface64 test Martin Kroeker 2020-12-02 15:56:21 +01:00
  • c361313564 Disable deprecated 32bit xcode Martin Kroeker 2020-12-02 07:49:43 +01:00
  • 0cb7a403b2 fix error declare function blas_level1_thread_with_return_value Gengxin Xie 2020-12-02 09:51:52 +08:00
  • 77a538d4ba Update an overlooked instance of xcode 10.0 as well Martin Kroeker 2020-12-01 22:05:35 +01:00
  • 9621062eba Update OSX xcode version to 11.5 Martin Kroeker 2020-12-01 12:23:30 +01:00
  • b766c1e9bb Improve the performance of zasum and casum with AVX512 intrinsic Gengxin Xie 2020-12-01 16:49:26 +08:00
  • 22574b474e Suppress -mfma as well for gcc 4.6 Martin Kroeker 2020-11-30 21:41:51 +01:00
  • f662022994 Move the version check to avoid overwriting unprocessed compiler data Martin Kroeker 2020-11-30 17:24:27 +01:00
  • 5e81e81478 Merge pull request #3014 from RajalakshmiSR/dgemvnp10 Martin Kroeker 2020-11-30 08:18:24 +01:00
  • 7d46e31de1 POWER10: Optimize dgemv_n Rajalakshmi Srinivasaraghavan 2020-11-29 15:28:28 -06:00
  • 62a2eb884f Add SSE flags for x86 Martin Kroeker 2020-11-29 15:33:07 +01:00
  • 2e99e2699b Add workaround for gcc 4.6 miscompiling assembly kernels with -mavx Martin Kroeker 2020-11-29 15:32:17 +01:00
  • 006b13299f Merge pull request #3012 from martin-frbg/restore-getarch Martin Kroeker 2020-11-29 13:27:47 +01:00
  • ca17d3dc3d Restore RISCV entries accidentally trashed by my PR 3005 Martin Kroeker 2020-11-29 13:19:51 +01:00
  • 52ed2741c5 Merge pull request #3010 from ggouaillardet/topic/fj_compilers Martin Kroeker 2020-11-29 11:36:43 +01:00
  • 3b4c016110 link math lib on FreeBSD cyy 2020-11-29 17:17:07 +08:00
  • 358100ec15 add Fujitsu compilers Gilles Gouaillardet 2020-11-29 13:57:57 +09:00
  • 3788b6d156 Merge pull request #3005 from martin-frbg/ssefix Martin Kroeker 2020-11-23 08:35:32 +01:00
  • bc5b1ddf0d Merge pull request #3004 from martin-frbg/bsd_getauxval Martin Kroeker 2020-11-23 08:35:12 +01:00
  • 2f42d23104 Merge pull request #3002 from martin-frbg/issue3000 Martin Kroeker 2020-11-22 22:51:26 +01:00
  • b72dd007dc Merge pull request #3001 from martin-frbg/issue2996 Martin Kroeker 2020-11-22 22:50:41 +01:00
  • 11ebe5fa25 Avoid redefinition warning Martin Kroeker 2020-11-22 21:16:07 +01:00
  • 01f01dae98 Add -msse if supported Martin Kroeker 2020-11-22 21:15:08 +01:00
  • e7bf8ced6c Build fix for systems that do not support getauxval Martin Kroeker 2020-11-22 20:20:28 +01:00
  • 0256294921 Fix syntax mixup Martin Kroeker 2020-11-22 17:41:44 +01:00
  • 2b114c3f30 Restore proper Makefile Martin Kroeker 2020-11-22 17:16:22 +01:00
  • 60e1fddca7 Ensure that the same (large) BUFFERSIZE is used for all cpus in DYNAMIC_ARCH builds Martin Kroeker 2020-11-22 16:48:22 +01:00
  • ebb8788696 Use ifneq instead of ifdef for CROSS option Martin Kroeker 2020-11-22 16:33:34 +01:00
  • 857afcc41d Use ifeq instead of ifdef for user-definable build options Martin Kroeker 2020-11-22 16:31:44 +01:00
  • 5fa305172a Use ifeq instead of ifdef for user-definable options Martin Kroeker 2020-11-22 16:29:56 +01:00
  • d3ff1f889f Convert ifndefs to ifneq Martin Kroeker 2020-11-22 16:27:17 +01:00
  • 65eb7afaf4 Change ifndef CROSS to ifneq Martin Kroeker 2020-11-22 16:25:36 +01:00
  • 8a6b17f97d Change ifndefs to ifneq Martin Kroeker 2020-11-22 16:19:31 +01:00
  • 0f863f96e4 Merge pull request #112 from xianyi/develop Martin Kroeker 2020-11-22 16:17:19 +01:00
  • 437702e0e1 Merge pull request #2965 from epsilon-0/develop Martin Kroeker 2020-11-22 12:25:33 +01:00
  • f1bf040b25 Merge pull request #2988 from xiegengxin/smp-asum Martin Kroeker 2020-11-22 12:24:13 +01:00
  • 613e3b2baf Merge pull request #2997 from Flamefire/reproduce_crash Martin Kroeker 2020-11-22 12:22:57 +01:00
  • 05a0ea2340 Merge branch 'risc-v' into develop Xianyi Zhang 2020-11-22 16:05:32 +08:00
  • 7037849498 Merge branch 'develop' into risc-v Xianyi Zhang 2020-11-22 16:04:50 +08:00
  • c6c9c24d1b Update doc for C910. Xianyi Zhang 2020-11-22 16:02:19 +08:00
  • 6dd71af0c3 Merge pull request #2995 from Flamefire/fix_thread_buffer_init Martin Kroeker 2020-11-20 09:42:10 +01:00
  • a05dc6e62b Add reproducer test for crash after fork Alexander Grund 2020-11-19 15:24:57 +01:00
  • 60005eb47b Don't overwrite blas_thread_buffer if already set Alexander Grund 2020-11-19 14:39:00 +01:00
  • 043f3d6faa POWER10: Use POWER9 as a fallback Anton Blanchard 2020-11-19 21:04:10 +11:00
  • fdf71d66b3 POWER10: Fix ld version detection Anton Blanchard 2020-11-19 20:50:42 +11:00
  • 7e9cb39a25 Merge pull request #2981 from Qiyu8/fix-sum Martin Kroeker 2020-11-16 08:40:46 +01:00
  • be075d53cf Merge pull request #2983 from Qiyu8/optimize-srot Martin Kroeker 2020-11-16 08:38:37 +01:00
  • b00a0de132 remove the -mfma flag in when the host has AVX. Qiyu8 2020-11-16 09:14:56 +08:00
  • d341a0fea0 Merge pull request #2989 from martin-frbg/cmake-fma Martin Kroeker 2020-11-13 12:35:09 +01:00
  • ec4d77c47c Add -mfma for HAVE_FMA3 in the non-DYNAMIC_ARCH case as well Martin Kroeker 2020-11-13 09:16:34 +01:00
  • 02699226d0 Merge pull request #111 from xianyi/develop Martin Kroeker 2020-11-13 09:14:23 +01:00
  • d6e7e05bb3 Improve the performance of dasum and sasum when SMP is defined Gengxin Xie 2020-11-13 14:20:52 +08:00
  • ae0b1dea19 modify system.cmake to enable fma flag Qiyu8 2020-11-13 10:20:24 +08:00
  • e0dac6b53b fix the CI failure of target specific option mismatch Qiyu8 2020-11-12 20:31:03 +08:00
  • e5c2ceb675 fix the CI failure of lack the head Qiyu8 2020-11-12 17:35:17 +08:00
  • a87e537b8c modify macro Qiyu8 2020-11-11 15:53:48 +08:00
  • 5bc0a7583f only FMA3 and vector larger than 128 have positive effects. Qiyu8 2020-11-11 15:18:01 +08:00
  • 8c0b206d4c Optimize the performance of rot by using universal intrinsics Qiyu8 2020-11-11 14:33:12 +08:00
  • c4c591ac5a fix sum optimize issues Qiyu8 2020-11-10 16:16:38 +08:00
  • 1ea6cfefdb Refs #2899. Merge branch 'damonyu1989-openblas-open-910' into risc-v Xianyi Zhang 2020-11-10 09:38:43 +08:00
  • fc35b72ae1 Refs #2899 Merge branch 'openblas-open-910' of git://github.com/damonyu1989/OpenBLAS into damonyu1989-openblas-open-910 Xianyi Zhang 2020-11-10 09:38:04 +08:00
  • 913cc9a4ca Merge branch 'develop' into risc-v Xianyi Zhang 2020-11-10 09:18:25 +08:00
  • ff16329cb7 Merge pull request #2972 from xiegengxin/rot-intrinsic Martin Kroeker 2020-11-08 22:43:00 +01:00
  • 433637ccd8 Merge pull request #2980 from martin-frbg/fixgetarch Martin Kroeker 2020-11-08 17:39:05 +01:00
  • ec088bf33a Fix missing AVX2 and FMA3 capabilities in FORCE_target mode Martin Kroeker 2020-11-08 13:15:40 +01:00
  • 110c7a6de0 Merge pull request #2979 from RajalakshmiSR/dot_power10 Martin Kroeker 2020-11-08 10:19:34 +01:00
  • d2faa1be4e Merge pull request #2978 from martin-frbg/fixdynfeatures Martin Kroeker 2020-11-08 10:19:17 +01:00
  • 1c4cfdc139 Stay compatible with old gmake that did not support undefine Martin Kroeker 2020-11-08 00:12:55 +01:00
  • f6a57d8f63 Update Makefile.system Martin Kroeker 2020-11-08 00:01:36 +01:00
  • f4b7ba12b7 Update Makefile.system Martin Kroeker 2020-11-07 23:37:21 +01:00
  • 6e364981a8 Optimize sdot/ddot for POWER10 Rajalakshmi Srinivasaraghavan 2020-11-07 15:21:58 -06:00
  • b976a0bf40 Remove previous workaround for compiler flags related to cpu capabilities in x86_64 DYNAMIC_ARCH builds Martin Kroeker 2020-11-07 20:39:56 +01:00
  • a04f532edf Reset cpu property flags between build cycles in DYNAMIC_ARCH mode Martin Kroeker 2020-11-07 20:37:03 +01:00
  • ccb9731c7b Fix propagation of cpu properties to compiler options Martin Kroeker 2020-11-07 20:30:15 +01:00
  • a29338aaa6 Remove extraneous quotes that caused a cmake policy warning Martin Kroeker 2020-11-07 20:27:42 +01:00
  • 438a8e5624 Fix placement of getarch call and spurious cpu property accumulation in DYNAMIC_ARCH builds Martin Kroeker 2020-11-07 20:26:12 +01:00
  • e5967810b7 Merge pull request #110 from xianyi/develop Martin Kroeker 2020-11-07 20:22:41 +01:00
  • ff74319ea5 Merge pull request #2977 from martin-frbg/issue2976 Martin Kroeker 2020-11-07 14:41:34 +01:00
  • 28d2dfe2b3 Fix macro name used in ifdef Martin Kroeker 2020-11-07 12:17:49 +01:00
  • 725ffbf041 fix typo Gengxin Xie 2020-11-05 16:25:17 +08:00