Commit Graph

  • c8f53b85ce Merge remote-tracking branch 'origin/develop' into vectorizeBF16GEMV Chip Kerchner 2024-10-11 11:10:20 -0500
  • bd6f77e3ce deploy: 18a23c23f7 martin-frbg 2024-10-11 06:54:36 +0000
  • 18a23c23f7
    Merge pull request #4929 from martin-frbg/issue4905 Martin Kroeker 2024-10-11 08:54:02 +0200
  • 5a79446bdb
    Merge pull request #4918 from HaoZeke/testFixes Martin Kroeker 2024-10-10 21:53:18 +0200
  • 7ba6591ff2
    Merge branch 'OpenMathLib:develop' into issue4905 Martin Kroeker 2024-10-10 21:50:38 +0200
  • 550bc77832
    Fix expectation values for CblasRowMajor order Martin Kroeker 2024-10-10 20:39:29 +0200
  • 48ad5bea0f deploy: e0ad20f72b martin-frbg 2024-10-10 14:18:38 +0000
  • e0ad20f72b
    Merge pull request #4932 from martin-frbg/cirrusosxndk Martin Kroeker 2024-10-10 16:18:07 +0200
  • e4bc5e4718
    remove stray quote Martin Kroeker 2024-10-10 11:02:56 +0200
  • b89fb9632f
    Update Android NDK install path for M1/armv7 crossbuild Martin Kroeker 2024-10-10 10:19:11 +0200
  • d9518a082d
    Update .cirrus.yml Martin Kroeker 2024-10-10 09:32:09 +0200
  • 298af5ecab
    Update .cirrus.yml Martin Kroeker 2024-10-09 23:08:19 +0200
  • 5a89a0b5e9
    Update .cirrus.yml Martin Kroeker 2024-10-09 22:48:50 +0200
  • 54e8dc92f7
    Update .cirrus.yml Martin Kroeker 2024-10-09 22:14:00 +0200
  • 9f7412595b
    Update .cirrus.yml Martin Kroeker 2024-10-09 22:08:31 +0200
  • 4a59b54ad3
    Update .cirrus.yml Martin Kroeker 2024-10-09 21:47:17 +0200
  • ba1d40483a
    Update .cirrus.yml Martin Kroeker 2024-10-09 21:20:07 +0200
  • e52d9b4cf1
    Merge pull request #4928 from austinpagan/czgemm_in_c Martin Kroeker 2024-10-09 20:26:21 +0200
  • dbd83762f9
    Merge pull request #4926 from NickelWenzel/fix_arm64_windows_and_uwp Martin Kroeker 2024-10-09 19:48:16 +0200
  • 9762464718
    Fix CBLAS interface filling in the wrong triangle for Row-Major Martin Kroeker 2024-10-09 18:06:39 +0200
  • d71c9b8bf5
    Exclude the WhereAmI function when compiling with MSVC Martin Kroeker 2024-10-09 17:21:52 +0200
  • 0b7fb5c791 CGEMM & ZGEMM using C code. Gordon Fossum 2024-10-09 09:42:23 -0500
  • bee123e8e3
    fix: add missing NO_AFFINITY checks NickelWenzel 2024-10-09 16:36:40 +0200
  • 1d2af7ab1a deploy: 7ac5b9011f martin-frbg 2024-10-09 14:19:22 +0000
  • 7ac5b9011f
    Merge pull request #4923 from martin-frbg/zen5 Martin Kroeker 2024-10-09 16:18:47 +0200
  • 3ab8b1408e LoongArch64: Update README.md gxw 2024-10-08 21:08:09 +0800
  • 2c3b87a082
    Add preliminary cpu autodetection for Zen5/5c Martin Kroeker 2024-10-08 23:07:42 +0200
  • 73c1882129
    Merge pull request #4922 from martin-frbg/issue4904-2 Martin Kroeker 2024-10-07 13:24:14 +0200
  • 522ee5f0ea deploy: bc0691a556 martin-frbg 2024-10-07 06:26:38 +0000
  • bc0691a556
    Merge pull request #4920 from martin-frbg/issue4917 Martin Kroeker 2024-10-07 08:26:03 +0200
  • b0346e72f4
    update names of loongarch64 targets for cross-compilation Martin Kroeker 2024-10-06 22:48:33 +0200
  • 9c707dc6b9
    Update dynamic arch list to new target scheme Martin Kroeker 2024-10-06 22:46:03 +0200
  • 9783dd07ab
    Rename KERNEL.LOONGSONGENERIC to KERNEL.LA64_GENERIC Martin Kroeker 2024-10-06 22:43:11 +0200
  • dda8b0427a deploy: 0dfe42d62a martin-frbg 2024-10-06 20:29:58 +0000
  • 0dfe42d62a
    Merge pull request #4919 from martin-frbg/issue4916-2 Martin Kroeker 2024-10-06 22:29:28 +0200
  • d6bb8dcfd1 Common code. Chip Kerchner 2024-10-06 14:13:43 -0500
  • 8a1710dd0d
    don't apply switch_ratio to tail of loop Martin Kroeker 2024-10-06 20:03:32 +0200
  • c9e92348a6
    Handle inf/nan if dummy2 flag is set Martin Kroeker 2024-10-06 19:57:17 +0200
  • d9f368dfe6
    TST: Signal abort for ctest failures correctly Rohit Goswami 2024-07-29 03:51:21 +0000
  • 722e4ae07a
    MAINT: Explicitly replace instead of unknown Rohit Goswami 2024-07-30 15:24:23 +0000
  • a6b7751881
    BUG: Allow tests to be run multiple times Rohit Goswami 2024-07-30 15:14:05 +0000
  • 9ac0fb0111 Merge branch 'develop' into vectorizeBF16GEMV Chip Kerchner 2024-10-04 06:49:53 -0500
  • 624e9d110e
    Merge pull request #4916 from martin-frbg/issue4901 Martin Kroeker 2024-10-03 23:25:45 +0200
  • d714013ab9
    change sgemm kernel to 4x4 as the 16x4 altivec goes out of bounds Martin Kroeker 2024-10-03 22:04:20 +0200
  • 7c4f3638fd
    switch PPCG4 SGEMM kernel to 4x4 Martin Kroeker 2024-10-03 22:00:15 +0200
  • 915a6d6e44 Add casting. Chip Kerchner 2024-10-03 14:08:21 -0500
  • 7ec3c16d82 Remove beta from optimized functions. Chip Kerchner 2024-10-03 13:27:33 -0500
  • 54afc24e4d
    Merge pull request #4906 from XiWeiGu/arm64_cmake_small_matrix_opt Martin Kroeker 2024-10-03 20:05:11 +0200
  • b4495a8fb8
    Merge branch 'develop' into arm64_cmake_small_matrix_opt Martin Kroeker 2024-10-03 20:04:52 +0200
  • 68eefe60b9
    Merge pull request #4915 from martin-frbg/issue4907 Martin Kroeker 2024-10-03 18:29:29 +0200
  • 4f00f02567
    Do not add -mabi flags for Loongson when the compiler is flang Martin Kroeker 2024-10-03 16:06:33 +0200
  • f817f26062
    Add simpler EPILOGUE for clang Martin Kroeker 2024-10-03 16:01:10 +0200
  • a492181665
    filter out Loongarch -mabi options for flang-new Martin Kroeker 2024-10-03 15:58:47 +0200
  • f9249718ce deploy: de421b7764 martin-frbg 2024-10-03 13:54:31 +0000
  • de421b7764
    Merge pull request #4904 from XiWeiGu/la64_cross_cmake Martin Kroeker 2024-10-03 15:53:57 +0200
  • af451529c1
    Update CONTRIBUTORS.md garadeaniket 2024-10-03 18:05:05 +0530
  • a56f7305a8
    Update CONTRIBUTORS.md garadeaniket 2024-10-03 17:51:56 +0530
  • 3dd427e9db
    Update KERNEL.ARMV8SVE garadeaniket 2024-10-03 17:40:18 +0530
  • 971d395a5d
    NEW SVE BLAS FILES ADDED garadeaniket 2024-10-03 17:36:45 +0530
  • c0e44259ee deploy: edaf5933c4 martin-frbg 2024-10-02 21:37:44 +0000
  • edaf5933c4
    Merge pull request #4913 from martin-frbg/issue4912 Martin Kroeker 2024-10-02 23:37:15 +0200
  • 71131406ae
    Declare the input array in CBLAS_?GEADD as const Martin Kroeker 2024-10-02 18:32:48 +0200
  • 7cc00f68c9 Remove more duplicate. Chip Kerchner 2024-10-01 11:23:32 -0500
  • e238a68c03 Remove duplicate. Chip Kerchner 2024-10-01 11:06:23 -0500
  • f10d47c4bb
    Merge pull request #4910 from martin-frbg/issue4908 Martin Kroeker 2024-10-01 17:49:12 +0200
  • 32095b0cbb Remove parameter. Chip Kerchner 2024-10-01 09:32:42 -0500
  • dd3cb940bd deploy: a1073f5eed martin-frbg 2024-10-01 13:29:53 +0000
  • a1073f5eed
    Merge pull request #4900 from XiWeiGu/la64_core_rename Martin Kroeker 2024-10-01 15:29:16 +0200
  • fa77561396
    add openmp option to pkgconfig template Martin Kroeker 2024-10-01 13:32:45 +0200
  • 176107d23a
    Add -fopenmp to cflags in pkgconfig file if set Martin Kroeker 2024-10-01 13:31:14 +0200
  • 0228d36211
    move -fopenmp to CFLAGS Martin Kroeker 2024-09-30 21:38:05 +0200
  • 7087b0a7d0 ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake gxw 2024-09-29 10:31:26 +0800
  • 30af9278dc LoongArch64: Enable cmake cross-compilation gxw 2024-09-26 16:55:06 +0800
  • 48698b2b1d LoongArch64: Rename core gxw 2024-09-18 17:20:43 +0800
  • e8d27ffc5a
    Merge f74805609e into 92f7a2dc3e TiborGY 2024-09-29 02:56:42 +0530
  • a2c46dae26
    Merge 8847cbdf3a into 92f7a2dc3e Griffin Downs 2024-09-29 02:51:12 +0530
  • c8788208c8 Fixing block issue with transpose version. Chip Kerchner 2024-09-27 13:27:03 -0500
  • d7c0d87cd1 Small changes. Chip Kerchner 2024-09-26 15:21:29 -0500
  • eb6f3a05ef Common MMA code. Chip Kerchner 2024-09-26 09:28:56 -0500
  • fb287d17fc Common code. Chip Kerchner 2024-09-25 16:31:36 -0500
  • 8ab6245771 Small change. Chip Kerchner 2024-09-24 16:50:21 -0500
  • df19375560 Almost final code for MMA. Chip Kerchner 2024-09-24 16:30:01 -0500
  • 05aa63e738 More MMA BF16 GEMV code. Chip Kerchner 2024-09-24 12:54:02 -0500
  • c9ce37d527 Force vector pairs in clang. Chip Kerchner 2024-09-23 08:43:58 -0500
  • 89a12fa083 MMA BF16 GEMV code. Chip Kerchner 2024-09-23 06:32:14 -0500
  • e9824ae798 deploy: 92f7a2dc3e martin-frbg 2024-09-19 12:15:38 +0000
  • 92f7a2dc3e
    Merge pull request #4899 from martin-frbg/flangmtune Martin Kroeker 2024-09-19 14:15:06 +0200
  • 969bb949b1
    Strip any mtune option from FFLAGS is the compiler is flang-new Martin Kroeker 2024-09-19 11:10:28 +0200
  • 30733e7d6c deploy: fca86e359c martin-frbg 2024-09-16 09:17:50 +0000
  • fca86e359c
    Merge pull request #4887 from goplanid/develop Martin Kroeker 2024-09-16 11:17:19 +0200
  • 7947970f9d Move common code. Chip Kerchner 2024-09-13 06:22:13 -0500
  • 60c1519e01
    Merge pull request #4896 from martin-frbg/update_azure_mac_hpc Martin Kroeker 2024-09-12 21:09:28 +0200
  • c8313d9d80
    Merge pull request #4895 from martin-frbg/update_homebrewjob Martin Kroeker 2024-09-12 21:09:10 +0200
  • b588e922a1
    Update oneAPI download location for Mac to final Martin Kroeker 2024-09-12 18:13:46 +0200
  • 4178905fa7
    Update version of upload-artifacts following deprecation Martin Kroeker 2024-09-12 16:39:20 +0200
  • 70ea109d67 deploy: 5f70e245a2 martin-frbg 2024-09-12 13:10:29 +0000
  • 5f70e245a2
    Merge pull request #4894 from martin-frbg/issue4893 Martin Kroeker 2024-09-12 15:09:54 +0200
  • 383e0b133e
    remove suppression of gcc14's incompatible pointer error Martin Kroeker 2024-09-11 22:21:09 +0200
  • 869a169c57
    Fix ZAXPYTEST prototype Martin Kroeker 2024-09-11 22:18:14 +0200
  • 72216d28c2 Fix bug with inc_y adding results twice. Chip Kerchner 2024-09-11 08:47:32 -0500