Commit Graph

  • 865829cfac Merge pull request #3121 from RajalakshmiSR/mmarename Martin Kroeker 2021-02-27 19:15:49 +01:00
  • 0571c3187b POWER10: Rename mma builtins Rajalakshmi Srinivasaraghavan 2021-02-26 20:56:34 -06:00
  • d12a2d0d04 Merge pull request #3120 from martin-frbg/3118-x Martin Kroeker 2021-02-26 11:50:47 +01:00
  • 2d369bd916 fix undefined CC variable Martin Kroeker 2021-02-26 09:09:43 +01:00
  • 93843c55b6 Merge pull request #15 from xianyi/develop Martin Kroeker 2021-02-26 09:06:25 +01:00
  • e3a6132e12 Merge pull request #3119 from xianyi/revert-3118-issue3018-2 Martin Kroeker 2021-02-26 04:18:33 +01:00
  • 736f0146c3 Revert "Fix undefined CC in f_check (again)" revert-3118-issue3018-2 Martin Kroeker 2021-02-26 04:18:04 +01:00
  • 897fc2b6ef Merge pull request #3118 from martin-frbg/issue3018-2 Martin Kroeker 2021-02-25 13:48:41 +01:00
  • 441c116105 fix undefined CC again Martin Kroeker 2021-02-25 13:47:34 +01:00
  • 8ecd80a34a Merge pull request #14 from xianyi/develop Martin Kroeker 2021-02-25 13:45:27 +01:00
  • 4ba53db0da Merge pull request #3117 from haampie/fix-perl Martin Kroeker 2021-02-24 18:39:28 +01:00
  • 6c365ff648 Merge pull request #3114 from martin-frbg/issue3113 Martin Kroeker 2021-02-24 18:38:25 +01:00
  • e33bcdbb7b Merge pull request #3115 from martin-frbg/issue2532 Martin Kroeker 2021-02-24 18:37:36 +01:00
  • ec6b354c32 use /usr/bin/env perl Harmen Stoppels 2021-02-24 14:07:20 +01:00
  • 292d1af1a0 Update omatcopy_rt.c Martin Kroeker 2021-02-24 09:34:14 +01:00
  • 325b398e3c Update omatcopy_rt.c Martin Kroeker 2021-02-24 09:13:12 +01:00
  • 6f5667b4d4 Enable optimized S/D OMATCOPY_RT Martin Kroeker 2021-02-24 09:03:41 +01:00
  • cceeee7806 Add optimized omatcopy_rt Martin Kroeker 2021-02-24 09:00:54 +01:00
  • 0a4546b742 Typo fix Martin Kroeker 2021-02-23 13:14:35 +01:00
  • b1eed27a54 Replace naive omatcopy_rt with 4x4 blocked implementation Martin Kroeker 2021-02-22 21:35:42 +01:00
  • 1a3ad4b670 Fix signatures of the TLS-mode dll_callback and p_process_term functions for Win64 Martin Kroeker 2021-02-22 19:40:36 +01:00
  • 86a5f98e4a Merge pull request #13 from xianyi/develop Martin Kroeker 2021-02-22 19:31:41 +01:00
  • 1caa44bea9 Merge pull request #3111 from hawkinsp/forkrace Martin Kroeker 2021-02-19 09:57:18 +01:00
  • dbbf92c1d1 Fix race in blas_thread_shutdown. Peter Hawkins 2021-02-18 13:46:50 -05:00
  • cb429d6b12 Merge pull request #3110 from martin-frbg/issue3108 Martin Kroeker 2021-02-18 15:45:25 +01:00
  • b0bded3f2f Fix get_num_procs() in the USE_TLS branch for non-glibc systems Martin Kroeker 2021-02-18 11:14:05 +01:00
  • f9aaf22fc3 Merge pull request #3105 from martin-frbg/tigerlake Martin Kroeker 2021-02-12 13:29:53 +01:00
  • 35ff3c731d Merge pull request #3106 from RajalakshmiSR/ppcbe Martin Kroeker 2021-02-12 13:29:23 +01:00
  • 63fa6c832e Fix build issue on POWER8 with DYNAMIC_ARCH Rajalakshmi Srinivasaraghavan 2021-02-11 21:28:03 -06:00
  • e4e5042e38 Recognize Intel Tiger Lake as SkylakeX Martin Kroeker 2021-02-11 20:17:11 +01:00
  • ae53e3e233 Recognize Intel Tiger Lake as SkylakeX Martin Kroeker 2021-02-11 20:16:27 +01:00
  • 074d9bff7f Merge pull request #3104 from martin-frbg/issue3103 Martin Kroeker 2021-02-11 15:42:47 +01:00
  • f36862603a Merge pull request #3101 from jake-arkinstall/issue-3100 Martin Kroeker 2021-02-11 15:42:18 +01:00
  • 47691c031f Use Haswell optimizations for Zen as well Martin Kroeker 2021-02-11 09:26:15 +01:00
  • ce7ddd8921 Use Haswell optimizations for Zen as well Martin Kroeker 2021-02-11 09:25:36 +01:00
  • 950c047b49 Use Haswell optimizations for Zen as well Martin Kroeker 2021-02-11 09:24:51 +01:00
  • 46509953a9 Use Haswell optimizations for Zen as well Martin Kroeker 2021-02-11 09:24:16 +01:00
  • db348dcff2 Enable optimized srot/drot kernels from Haswell Martin Kroeker 2021-02-11 09:23:05 +01:00
  • a33f471065 Merge pull request #3102 from martin-frbg/issue3099 Martin Kroeker 2021-02-11 08:56:46 +01:00
  • ece3ce581e Strip parenthesized (pkgversion) data from GCC version string to avoid misinterpretation Martin Kroeker 2021-02-10 14:22:59 +01:00
  • 8189a98d85 Merge pull request #12 from xianyi/develop Martin Kroeker 2021-02-10 14:17:24 +01:00
  • d7a77091a3 Addressed issue #3100, removing an unnecessary write to the include directory Jake Arkinstall 2021-02-10 12:11:17 +00:00
  • 3e1e74fca6 Merge pull request #3094 from xoviat/patch-1 Martin Kroeker 2021-02-02 13:36:17 +01:00
  • 33b5670122 Merge pull request #3096 from martin-frbg/fixclangcmake Martin Kroeker 2021-02-02 13:33:15 +01:00
  • 95e19e2e23 fix case in compiler name check Martin Kroeker 2021-02-02 10:53:46 +01:00
  • 99ac042702 remove spurious lines (probably editor malfunction) Martin Kroeker 2021-02-01 21:02:53 +01:00
  • 774b9f8653 handle AppleClang in Cooperlake support condition Martin Kroeker 2021-02-01 20:18:53 +01:00
  • eb1d2344f7 Fix compiler version check for Intel Cooperlake support (clang-cl does not accept -dumpversion) Martin Kroeker 2021-02-01 19:45:25 +01:00
  • 6fa9860dbe appveyor: cleanup and add openmp run xoviat 2021-01-30 21:28:12 -06:00
  • 0cc36770f1 Merge pull request #3073 from xoviat/embedded Martin Kroeker 2021-01-31 18:02:41 +01:00
  • 558cd543bf Merge pull request #3093 from martin-frbg/fix3064 Martin Kroeker 2021-01-30 22:21:28 +01:00
  • bd906e3410 fix copy-paste error in build rules for cblas_crotg and cblas_zrotg Martin Kroeker 2021-01-30 16:46:25 +01:00
  • 35086cb501 Merge pull request #3092 from RajalakshmiSR/cscal_p10 Martin Kroeker 2021-01-30 16:23:37 +01:00
  • 2056ffc227 Optimize cscal function for POWER10 Rajalakshmi Srinivasaraghavan 2021-01-29 13:51:43 -06:00
  • 7745439312 Merge pull request #3091 from martin-frbg/lapack477-2 Martin Kroeker 2021-01-29 13:37:23 +01:00
  • c4b5abbe43 fix data type Martin Kroeker 2021-01-29 10:45:36 +01:00
  • f87842483e fix calculation of non-exceptional shift (from Reference-LAPACK PR 477) Martin Kroeker 2021-01-29 09:56:12 +01:00
  • 3dbb32c734 Merge pull request #11 from xianyi/develop Martin Kroeker 2021-01-29 09:52:21 +01:00
  • 609ea80276 enable testing xoviat 2021-01-27 16:39:52 -06:00
  • 3dfecaaf7c require nofortran to be set on msvc xoviat 2021-01-27 16:39:15 -06:00
  • 3165c915b6 fix test helpers xoviat 2021-01-27 15:24:49 -06:00
  • 457ccc42c9 Merge branch 'develop' into msvc xoviat 2021-01-27 14:15:59 -06:00
  • 00880c720a Merge pull request #3087 from martin-frbg/lapack477 Martin Kroeker 2021-01-27 19:11:55 +01:00
  • 856bc36533 Add exceptional shift to fix rare convergence problems Martin Kroeker 2021-01-27 13:41:45 +01:00
  • fe71887b68 Merge pull request #10 from xianyi/develop Martin Kroeker 2021-01-27 13:39:26 +01:00
  • 10094bd885 Merge pull request #3076 from martin-frbg/dyn-thunderx Martin Kroeker 2021-01-27 13:25:45 +01:00
  • eea0c0f2ed Merge pull request #3085 from alexhenrie/memory_alloc Martin Kroeker 2021-01-26 20:11:42 +01:00
  • 85be43e0df Merge pull request #3083 from martin-frbg/develop Martin Kroeker 2021-01-26 15:13:35 +01:00
  • 0cb9e9fc8d Remove the VORTEX support bits again for now Martin Kroeker 2021-01-25 19:02:21 +01:00
  • cb61d3b46b Add DYNAMIC_LIST support for ARM64 Martin Kroeker 2021-01-25 13:13:20 +01:00
  • 113840da12 Fix null pointer check in blas_memory_alloc Alex Henrie 2021-01-24 22:20:44 -07:00
  • deb2e66bcc Add DYNAMIC_LIST support for ARM64 Martin Kroeker 2021-01-24 23:18:52 +01:00
  • 9b2d69aa80 Add DYNAMIC_LIST option for ARM64 Martin Kroeker 2021-01-24 23:18:01 +01:00
  • e3ff4cdd23 Merge pull request #9 from xianyi/develop Martin Kroeker 2021-01-24 23:14:45 +01:00
  • 0745ba43a4 Merge pull request #3082 from RajalakshmiSR/scalp10 Martin Kroeker 2021-01-24 19:03:40 +01:00
  • 3ede843d50 Optimize s/dscal function for POWER10 Rajalakshmi Srinivasaraghavan 2021-01-24 07:48:28 -06:00
  • 2e8d6e8690 add functions for embedded xoviat 2021-01-23 22:12:17 -06:00
  • 69a5558203 Merge pull request #3059 from Guobing-Chen/BF16_gemm Martin Kroeker 2021-01-23 19:08:05 +01:00
  • d6905403e3 Merge pull request #3068 from alexhenrie/scan-build Martin Kroeker 2021-01-23 19:06:29 +01:00
  • 411926b572 Merge pull request #3079 from RajalakshmiSR/rotp10 Martin Kroeker 2021-01-22 08:26:00 +01:00
  • 439b93f6d2 Optimize s/drot function for POWER10 Rajalakshmi Srinivasaraghavan 2021-01-21 13:24:45 -06:00
  • d6cf67778c Merge pull request #3075 from martin-frbg/issue3074 Martin Kroeker 2021-01-21 08:51:30 +01:00
  • b94dab5250 patch to support power10 in builtin_cpu_is was backported to gcc 10.2, so allow that as wel Martin Kroeker 2021-01-20 21:34:36 +01:00
  • 6178974cd9 Update .drone.yml Martin Kroeker 2021-01-20 20:21:27 +01:00
  • 0b9e4d1278 Add gcc10/arm64 DYNAMIC_ARCH build Martin Kroeker 2021-01-20 18:30:05 +01:00
  • 63fa3c3f8f Require gcc 11 for builtin_cpu_is(power10) Martin Kroeker 2021-01-20 15:41:04 +01:00
  • 3612d9a57a Merge pull request #8 from xianyi/develop Martin Kroeker 2021-01-20 15:38:30 +01:00
  • b60de4447a add cortex-m platform xoviat 2021-01-19 08:57:44 -06:00
  • 16dddb760e Merge pull request #3070 from RajalakshmiSR/cdot Martin Kroeker 2021-01-16 15:47:34 +01:00
  • eff7c9166e Optimize cdot function for POWER10 Rajalakshmi Srinivasaraghavan 2021-01-15 13:40:34 -06:00
  • f1bf2603e6 Remove dead assignment to dflag in rotmg functions Alex Henrie 2021-01-14 19:40:32 -07:00
  • 6f32991eae Don't define the mode variable when not needed in gemm functions Alex Henrie 2021-01-14 19:40:31 -07:00
  • 202fc9e8ed Fix uninitialized argument value in dasum_k Alex Henrie 2021-01-14 19:40:31 -07:00
  • e378b24487 Merge pull request #3067 from albertziegenhagel/fix-generic-cmake Martin Kroeker 2021-01-14 21:35:19 +01:00
  • 3628b22d49 Merge pull request #3064 from martin-frbg/issue3063 Martin Kroeker 2021-01-14 16:47:59 +01:00
  • af2b0d0205 Merge pull request #3066 from martin-frbg/buffsizefix Martin Kroeker 2021-01-14 16:00:38 +01:00
  • 4bf988959a Merge pull request #3062 from austinpagan/GemmPreferedSize3 Martin Kroeker 2021-01-14 15:59:53 +01:00
  • a0e4fb3a28 Merge pull request #3061 from martin-frbg/arm64-pgi Martin Kroeker 2021-01-14 15:59:21 +01:00
  • 2c445be8ba Merge pull request #3051 from martin-frbg/rocketlake Martin Kroeker 2021-01-14 15:56:25 +01:00
  • e3f4063683 Fix building "generic" TRMM kernel with CMake Albert Ziegenhagel 2021-01-14 10:00:49 +01:00