Commit Graph

  • 7779bb6fb1
    Make IWORK array larger to avoid overflow Martin Kroeker 2023-09-14 20:21:06 +0200
  • 0606102460
    Merge pull request #4229 from martin-frbg/issue4228 Martin Kroeker 2023-09-14 16:15:54 +0200
  • fb97cc4d5e
    Add la_constants.o to SCLAUX/DZLAUX Martin Kroeker 2023-09-14 10:46:23 +0200
  • 6a611db560 memory: show correct number of max threads Tiziano Müller 2023-09-10 08:44:07 +0200
  • 6bc079687f
    Merge pull request #4218 from XiWeiGu/loongarch64_sgemv Martin Kroeker 2023-09-08 13:35:35 +0200
  • cd36b8fff7
    Merge pull request #4214 from martin-frbg/issue4212 Martin Kroeker 2023-09-05 20:43:44 +0200
  • 09911f077e
    Disable SVE targets for DYNAMIC_ARCH when compiling with (homebrew)gcc on macOS/arm64 Martin Kroeker 2023-09-05 16:33:40 +0200
  • c3f2a3c0ca
    Update version to 0.3.24.dev Martin Kroeker 2023-09-04 08:40:25 +0200
  • 4867cf5dd7
    Update version to 0.3.24.dev Martin Kroeker 2023-09-04 08:39:40 +0200
  • f2cf929374 LoongArch64: Add sgemv kernel gxw 2023-08-31 16:59:37 +0800
  • f29a0d1a7d
    Merge pull request #4211 from xianyi/release-0.3.0 Martin Kroeker 2023-09-03 23:25:58 +0200
  • 9f815cf1bf
    Update version to 0.3.24 v0.3.24 Martin Kroeker 2023-09-03 22:58:32 +0200
  • 3c49711f1e
    Update version to 0.3.24 Martin Kroeker 2023-09-03 22:57:22 +0200
  • 2c68822cde
    Merge pull request #4210 from xianyi/develop Martin Kroeker 2023-09-03 22:55:22 +0200
  • 3c51bd0fbf
    Merge pull request #4209 from martin-frbg/changelog0324 Martin Kroeker 2023-09-03 22:51:03 +0200
  • 5d73041068
    Update Changelog for 0.3.24 Martin Kroeker 2023-09-03 19:05:53 +0200
  • 8e6d93359d
    Merge pull request #4196 from TiborGY/obsolete_inlines Martin Kroeker 2023-09-03 14:12:42 +0200
  • 33797c44fc
    Merge pull request #4143 from martin-frbg/issue4130 Martin Kroeker 2023-09-01 14:20:25 +0200
  • ee310e3533
    Merge pull request #4208 from XiWeiGu/loongarch64_toolchain Martin Kroeker 2023-09-01 10:50:01 +0200
  • 42909ce57d
    Merge branch 'xianyi:develop' into issue4130 Martin Kroeker 2023-09-01 09:05:58 +0200
  • a2a184572c
    update zrotg Martin Kroeker 2023-08-31 23:42:12 +0200
  • 394a1fd1bf LoongArch64: Compatible with early internal toolchain gxw 2023-08-31 15:44:22 +0800
  • 12d8f219d6
    Merge pull request #4207 from martin-frbg/issue4174-2 Martin Kroeker 2023-08-26 12:05:37 +0200
  • 9c4ae4d4fb
    Merge pull request #4206 from martin-frbg/issue4201-2 Martin Kroeker 2023-08-26 10:17:27 +0200
  • 3bb70b8ca4
    Merge pull request #4205 from martin-frbg/fixintmain Martin Kroeker 2023-08-26 08:38:38 +0200
  • 3b6050ac04
    clarify the comment on the out-of-bounds check from #723 Martin Kroeker 2023-08-26 02:00:00 +0200
  • 22a402bc2c
    clarify the comment on the out-of-bounds check from #723 Martin Kroeker 2023-08-26 01:58:08 +0200
  • 88435104c8
    Merge pull request #4204 from martin-frbg/llvm17-2 Martin Kroeker 2023-08-26 00:32:18 +0200
  • fc8894dd98
    Workaround miscompilation by NVIDIA nvc Martin Kroeker 2023-08-26 00:30:17 +0200
  • be57c595aa
    Merge pull request #4203 from martin-frbg/issue4201 Martin Kroeker 2023-08-25 22:55:38 +0200
  • 7a6203ffa1
    restore default Neoverse SVE build instructions for non-NVIDIA compilers Martin Kroeker 2023-08-25 18:25:51 +0200
  • 7f7d3896dd
    Fix missing type declaration for main Martin Kroeker 2023-08-25 18:07:47 +0200
  • 2c3034ff7f
    Disable the C/ZASUM AVX512 microkernels when compiling with LLVM17 as well Martin Kroeker 2023-08-25 17:22:51 +0200
  • 49689fbef7
    Add support for compiling SVE kernels with the NVIDIA HPC compiler Martin Kroeker 2023-08-25 17:11:04 +0200
  • 8794544b43
    Add support for compiling the Neoverse SVE kernels with the NVIDIA HPC compiler Martin Kroeker 2023-08-25 16:47:32 +0200
  • e9f1b2d26f
    Expand the SVE compatibility check for the NVIDIA HPC compiler Martin Kroeker 2023-08-25 16:45:56 +0200
  • d69f57c8c2
    Merge pull request #4200 from XiWeiGu/loongarch64_sgemm Martin Kroeker 2023-08-23 13:05:34 +0200
  • 553cc1372f LoongArch64: Add sgemm_kernel gxw 2023-08-18 17:39:44 +0800
  • 12ede72ab7
    Merge pull request #4192 from imciner2/im/clangfix Martin Kroeker 2023-08-21 15:46:35 +0200
  • 76d675bd55
    Add NaN tests Martin Kroeker 2023-08-20 14:57:31 +0200
  • 3d10fb003e
    Add NaN tests Martin Kroeker 2023-08-19 12:20:42 +0200
  • 8d9f701fbf
    Merge pull request #4195 from TiborGY/BF16_ignore Martin Kroeker 2023-08-19 12:16:44 +0200
  • 7f67ba9147
    Merge pull request #4198 from martin-frbg/issue4197 Martin Kroeker 2023-08-19 07:51:51 +0200
  • 214be14c1d
    Correct INFO returned for lda in non-CBLAS s/dgeadd Martin Kroeker 2023-08-18 22:48:30 +0200
  • 1b09f4b2bb
    Merge pull request #4193 from imciner2/im/ppcgnu Martin Kroeker 2023-08-17 22:56:08 +0200
  • 79c15db348 Fix power10 gcc intrinsic check Ian McInerney 2023-08-14 21:36:35 +0100
  • b5ba95a6c0 Modernize obsolete inline order TGY 2023-08-16 00:48:40 +0200
  • 6da9baa55a upload some buildtests TiborGY 2023-08-16 00:24:26 +0200
  • 0d30daa772
    Add junk from BF16 test to .gitignore TiborGY 2023-08-16 00:07:17 +0200
  • 2e68d922d5
    Add NaN tests Martin Kroeker 2023-08-14 23:14:32 +0200
  • f986829698
    Add NaN tests Martin Kroeker 2023-08-14 23:13:46 +0200
  • dfacb63b2d
    Add NaN tests Martin Kroeker 2023-08-14 23:13:02 +0200
  • 3e87ac9a49
    Add tests for IAMAX with NaN values Martin Kroeker 2023-08-14 22:28:02 +0200
  • 9402651ef5
    Add NaN tests Martin Kroeker 2023-08-14 22:26:33 +0200
  • 4f21cdf688
    Add NaN tests Martin Kroeker 2023-08-14 22:25:50 +0200
  • 9a8d090eaa
    Add NaN tests Martin Kroeker 2023-08-14 22:25:03 +0200
  • 43f5e4251b
    Add NaN tests Martin Kroeker 2023-08-14 17:45:35 +0200
  • 8a8a8479be Fix cooperlake and sapphire rapids march flags on clang Ian McInerney 2023-08-14 15:41:28 +0100
  • 82827762c0
    Merge branch 'xianyi:develop' into nanobench Christopher Sidebottom 2023-08-14 15:45:22 +0100
  • 95ce0b0c48
    Add NaN tests Martin Kroeker 2023-08-13 23:45:36 +0200
  • 562ef5fdca
    Merge pull request #4169 from felixonmars/patch-1 Martin Kroeker 2023-08-12 17:20:56 +0200
  • 0e5d56ae4a
    Merge pull request #4170 from felixonmars/patch-2 Martin Kroeker 2023-08-12 09:21:05 +0200
  • ebc157fcc9
    Merge pull request #4190 from martin-frbg/issue4186-2 Martin Kroeker 2023-08-10 23:12:59 +0200
  • 34da1a067d
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 17:01:50 +0200
  • 07e32c4cb8
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 17:00:18 +0200
  • c211da0688
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:58:57 +0200
  • a34a0a7abc
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:56:52 +0200
  • 54d3246fc6
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:55:17 +0200
  • 7dd441d5db
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:53:33 +0200
  • f692178792
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:52:09 +0200
  • d15ffb7fdf
    Allow negative INCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:50:44 +0200
  • a2d867f4d1
    Allow negative iNCX (API change from version 3.10 of the reference implementation) Martin Kroeker 2023-08-10 16:49:05 +0200
  • 9a0e9c8b69
    Merge pull request #4171 from boomanaiden154/clang-libomp-fixes Martin Kroeker 2023-08-10 16:32:33 +0200
  • 7af0f41762
    Merge pull request #4189 from martin-frbg/issue4186 Martin Kroeker 2023-08-10 14:11:12 +0200
  • 4cc804c754
    Prepare for INCX < 0 in new NRM2 implementation from BLAS 3.10 Martin Kroeker 2023-08-09 16:13:23 +0200
  • 4d0f000db6 MIPS: Enable MSA gxw 2023-08-07 16:55:59 +0800
  • afdc56a421
    Merge pull request #4158 from XiWeiGu/loongarch64_update_dgemm_kernel Martin Kroeker 2023-08-07 12:44:09 +0200
  • 91e5513f3b
    Merge pull request #4184 from XiWeiGu/dgemv Martin Kroeker 2023-08-07 08:47:19 +0200
  • e8b571d245 LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2 gxw 2023-07-11 10:01:12 +0800
  • 71fcee6eef LoongArch64: Update dgemm kernel gxw 2023-06-29 11:11:08 +0800
  • 0f521ece25
    Merge pull request #4183 from martin-frbg/issue4181 Martin Kroeker 2023-08-06 18:59:50 +0200
  • 232420bdf5
    Merge pull request #4182 from xianyi/revert-4153-dgemv Martin Kroeker 2023-08-06 16:00:32 +0200
  • 41c31bc1d4
    Revert "LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S" revert-4153-dgemv Martin Kroeker 2023-08-06 16:00:03 +0200
  • 61d803547a
    Apply USE_TRMM to MIPS64_GENERIC as to GENERIC Martin Kroeker 2023-08-06 15:17:38 +0200
  • f8ee309402
    Merge pull request #4153 from XiWeiGu/dgemv Martin Kroeker 2023-08-06 08:49:16 +0200
  • 12e98482e9
    Merge pull request #4179 from martin-frbg/jenkinsfix Martin Kroeker 2023-08-05 22:47:26 +0200
  • 51c218d17a
    Update Jenkinsfile Martin Kroeker 2023-08-05 18:33:15 +0200
  • df978c90cd
    Update Jenkinsfile.pwr Martin Kroeker 2023-08-05 18:32:41 +0200
  • ef4a7e3fca
    Merge pull request #4127 from XiWeiGu/LoongArch64-CI Martin Kroeker 2023-08-05 18:19:47 +0200
  • b63e4581a3
    Merge pull request #4016 from mmuetzel/ci-msys2 Martin Kroeker 2023-08-05 15:59:34 +0200
  • 53378296c8 CI: Build with NO_AVX512 for the runners that use Flang 16. Markus Mützel 2023-08-05 13:47:38 +0200
  • 1c3fcaaf42 CI (MSYS2): Re-run failed tests verbosely. Markus Mützel 2023-04-24 18:32:03 +0200
  • f334bd9041 CI (MSYS2): Use LLVM Flang on CLANG64 runners. Add CLANG32 runner. Markus Mützel 2023-04-21 10:36:21 +0200
  • 57256623f4 fc.cmake: Add support for LLVM Flang. Markus Mützel 2023-04-21 10:20:54 +0200
  • ec1e96aac8 LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S gxw 2023-07-11 10:01:12 +0800
  • 96bf226bca gh-actions: Add loongarch64 CI gxw 2023-06-30 17:34:08 +0800
  • db9a42f8c3 LoongArch64: using getauxval to do runtime check gxw 2023-06-30 16:31:47 +0800
  • d46772e037 LoongArch64: Add compiler feature checks gxw 2023-06-30 16:19:38 +0800
  • 8a171350db
    Merge pull request #4178 from martin-frbg/llvm17 Martin Kroeker 2023-08-04 20:56:00 +0200
  • ef23240ab8
    Merge pull request #4177 from martin-frbg/issue4176 Martin Kroeker 2023-08-04 20:55:22 +0200