Commit Graph

  • 044f622140
    Merge fdabfa5d5c into 453b9e4886 Martin Kroeker 2024-11-02 12:55:05 +0000
  • cb5e6197c4
    Merge c47f57bac5 into 453b9e4886 Martin Kroeker 2024-11-02 12:55:05 +0000
  • d9eb0522b2
    Merge 8450c13fb1 into 453b9e4886 Sacha 2024-11-02 12:55:04 +0000
  • 918f90160d
    Merge c1a5a71d1c into 453b9e4886 Markus Mützel 2024-11-02 12:55:04 +0000
  • c3bf021ff8
    Merge 82827762c0 into 453b9e4886 Christopher Sidebottom 2024-11-02 12:55:04 +0000
  • ca3c09b058
    Merge 6d05b63bce into 453b9e4886 Peter Edwards 2024-11-02 12:55:04 +0000
  • ef5c339013
    Merge 6e674b549a into 453b9e4886 Martin Kroeker 2024-11-02 12:55:04 +0000
  • b0a40d1e98
    Merge 9094edb854 into 453b9e4886 Martin Kroeker 2024-11-02 12:55:04 +0000
  • 625fe3418e
    Merge 4da1a0b1da into 453b9e4886 Markus Mützel 2024-11-02 12:55:04 +0000
  • 742a7abe1a
    Merge b3eaac3497 into 453b9e4886 Martin Kroeker 2024-11-02 12:55:04 +0000
  • 33ac70f470
    Merge d67a534b9e into 453b9e4886 Martin Kroeker 2024-11-02 12:55:04 +0000
  • 7f44d27bcc
    Merge 5f8744d4e4 into 453b9e4886 Christopher Sidebottom 2024-11-02 12:55:04 +0000
  • 820df5f4a8
    Merge 3b2421cba0 into 453b9e4886 CDAC-SSDG 2024-11-01 01:06:52 +0800
  • 919c221cec deploy: 453b9e4886 gh-pages martin-frbg 2024-10-31 16:47:30 +0000
  • 453b9e4886
    Merge pull request #4961 from h-vetinari/flang develop Martin Kroeker 2024-10-31 17:46:55 +0100
  • d3272e51eb explicitly link to OpenMP H. Vetinari 2024-07-01 09:05:00 +1100
  • c634114c8a
    Merge pull request #4960 from martin-frbg/gemmtr Martin Kroeker 2024-10-30 18:37:26 +0100
  • 598bd21991
    Merge pull request #4958 from XiWeiGu/x86_64_opt_somatcopy_ct_avx Martin Kroeker 2024-10-30 16:04:00 +0100
  • c9d1a3b912
    Merge pull request #4954 from XiWeiGu/la64_update_gh_actions Martin Kroeker 2024-10-30 14:35:57 +0100
  • c3e7d08fb4
    Copy GEMMT to its new name GEMMTR Martin Kroeker 2024-10-30 12:56:16 +0100
  • 0cf656fd3e
    Add copies of GEMMT under its new name GEMMTR Martin Kroeker 2024-10-30 12:55:14 +0100
  • 2edf548203 deploy: 24b5ccaf4b martin-frbg 2024-10-30 11:49:09 +0000
  • 24b5ccaf4b
    Merge pull request #4202 from TiborGY/inlines_pt2 Martin Kroeker 2024-10-30 12:48:35 +0100
  • 3b2421cba0
    Add files via upload Juliya32 2024-10-30 14:23:42 +0530
  • 012fe4da36
    Delete kernel/arm64/rot_kernel_sve.c Juliya32 2024-10-30 14:23:15 +0530
  • d90ee00f85
    Delete kernel/arm64/rot_kernel_c.c Juliya32 2024-10-30 14:22:51 +0530
  • 668e28adc4
    Delete kernel/arm64/rot.c Juliya32 2024-10-30 14:22:31 +0530
  • fa880ab1cf
    Update KERNEL.ARMV8SVE SushilPratap04 2024-10-30 14:09:37 +0530
  • 7822ae9617
    Added sve kernels for rot routine. SushilPratap04 2024-10-30 14:05:21 +0530
  • b8bc2a752e
    Added sve optimized kernels for swap routine SushilPratap04 2024-10-30 14:02:57 +0530
  • 0667cf6c92
    Added optimized scal routine files CDAC-SSDG 2024-10-30 14:01:09 +0530
  • 2718b37fed
    Update CONTRIBUTORS.md CDAC-SSDG 2024-10-30 13:57:13 +0530
  • 73c6a28073 x86_64: opt somatcopy_ct with AVX gxw 2024-10-29 06:31:58 +0000
  • f1855d5c91
    Add GEMMTR Martin Kroeker 2024-10-28 21:15:03 +0100
  • 75dab46765
    Duplicate GEMMT functions using the new name GEMMTR recently coined by the Reference BLAS Martin Kroeker 2024-10-28 21:14:14 +0100
  • f66e6d32c2
    Merge pull request #4953 from NickelWenzel/fix_trtrs_return_types Martin Kroeker 2024-10-25 23:29:24 +0200
  • a8bb105ed6
    Merge pull request #4848 from haampie/fix/cmake-min-version Martin Kroeker 2024-10-25 20:59:13 +0200
  • 0e6a2cc93c
    bump the minimum_required version instead Martin Kroeker 2024-10-25 16:47:52 +0200
  • 815cb24944 remove unused INLINE macro definitions TGY 2023-08-16 06:06:00 +0200
  • 2c65e504bd deploy: ac736820d7 martin-frbg 2024-10-25 11:44:27 +0000
  • ac736820d7
    Merge pull request #4955 from cdaley/optimize_gemv_forwarding Martin Kroeker 2024-10-25 13:43:54 +0200
  • 8f595382c4 gh-actions: Test LoongArch64 with gcc14 from Ubuntu 24.04 gxw 2024-10-25 03:12:15 +0000
  • cb48505251 optimize gemv forwarding on ARM64 systems Chris Daley 2024-10-24 21:05:26 -0700
  • 79f4bbd4cd fix: return types of *trtrs routines nickel 2024-10-24 11:20:02 +0200
  • 6405318ea8 deploy: 72461f1c8c martin-frbg 2024-10-23 14:40:35 +0000
  • 72461f1c8c
    Merge pull request #4950 from ayappanec/fix-aix-build Martin Kroeker 2024-10-23 16:40:02 +0200
  • 020cce1068 Fix build issues with gcc compiler as well Ayappan Perumal 2024-10-23 04:24:06 -0500
  • b6ec73e77c Fix AIX build Ayappan Perumal 2024-10-21 07:38:03 -0500
  • 97749d4d8a deploy: 8a0cd5fcef martin-frbg 2024-10-20 19:53:28 +0000
  • 8a0cd5fcef
    Merge pull request #4949 from martin-frbg/mingw32-14.2 Martin Kroeker 2024-10-20 21:52:57 +0200
  • 4dba6ce6ea
    work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests Martin Kroeker 2024-10-20 20:25:06 +0200
  • a93ec74e95
    Merge pull request #4948 from martin-frbg/fixhavesve Martin Kroeker 2024-10-18 20:00:42 +0200
  • c4bb4e74fc
    NeoverseN2 has SVE too Martin Kroeker 2024-10-18 14:50:55 +0200
  • 86720778ef
    write HAVE_SVE to config where applicable Martin Kroeker 2024-10-18 14:14:43 +0200
  • 286161c23b deploy: 016bdb9b0b martin-frbg 2024-10-18 12:03:36 +0000
  • 016bdb9b0b
    Merge pull request #4946 from XiWeiGu/la64_omatcopy_lasx Martin Kroeker 2024-10-18 14:03:06 +0200
  • ffaa5765a4 Bench: Add omatcopy gxw 2024-10-17 12:32:54 +0000
  • a93897276b
    Merge pull request #4943 from martin-frbg/update_readme Martin Kroeker 2024-10-17 21:13:48 +0200
  • 3fc1225dd6
    Merge branch 'OpenMathLib:develop' into update_readme Martin Kroeker 2024-10-17 21:08:58 +0200
  • 33078d11e4
    stress importance of TARGET setting in DYNAMIC_ARCH builds Martin Kroeker 2024-10-17 21:07:49 +0200
  • 0cb3240a11 deploy: 15a57598f5 martin-frbg 2024-10-17 17:21:41 +0000
  • 15a57598f5
    Merge pull request #4944 from ChipKerchner/vectorizeBF16GEMV Martin Kroeker 2024-10-17 19:21:07 +0200
  • ab71a1edf2 Better VSX. Chip Kerchner 2024-10-17 08:25:02 -0500
  • bb31bbef52 LoongArch64: Opt somatcopy_ct with LASX gxw 2024-10-17 11:45:13 +0000
  • b37129341b LoongArch64: Opt somatcopy_cn with LASX gxw 2024-10-17 11:27:55 +0000
  • acf6cab304 LoongArch64: Opt somatcopy_rn with LASX gxw 2024-10-17 09:50:02 +0000
  • 15edb441bf LoongArch64: Opt somatcopy_rt with LASX gxw 2024-10-14 17:36:56 +0800
  • 457d1c6972
    remove unused CI badges, wiki->docs, xianyi->OpenMathLib Martin Kroeker 2024-10-17 10:33:08 +0200
  • 24b39a8af6
    Update CMakeLists.txt Martin Kroeker 2024-10-16 15:21:39 +0200
  • a23de0a334 deploy: 6a60eb1a02 martin-frbg 2024-10-16 07:39:01 +0000
  • 6a60eb1a02
    Merge pull request #4924 from XiWeiGu/la64_readme Martin Kroeker 2024-10-16 09:38:18 +0200
  • 692015bc76
    svg spaette 2024-10-15 19:22:38 -0500
  • a03b7247d2
    Update CONTRIBUTORS.md garadeaniket 2024-10-15 12:16:48 +0530
  • d78f88f99d
    Update CONTRIBUTORS.md garadeaniket 2024-10-15 12:15:41 +0530
  • c04aca1f35
    Update KERNEL.ARMV8SVE garadeaniket 2024-10-15 11:46:58 +0530
  • 0a9ccb8cb6
    Delete kernel/arm64/gemv_n.c garadeaniket 2024-10-15 11:05:02 +0530
  • dc32e753d8
    Update CONTRIBUTORS.md garadeaniket 2024-10-15 10:46:21 +0530
  • 8483a71169
    Merge pull request #4937 from martin-frbg/lapack1064 Martin Kroeker 2024-10-14 21:52:41 +0200
  • 22628f1a69
    Fix leading dimension for B (Reference-LAPACK PR 1064) Martin Kroeker 2024-10-14 18:59:03 +0200
  • 27ed6da331
    Fix leading dimension for B (Reference-LAPACK PR 1064) Martin Kroeker 2024-10-14 18:57:50 +0200
  • 7018c1b001
    Fix leading dimension for B (Reference-LAPACK PR 1064) Martin Kroeker 2024-10-14 18:56:44 +0200
  • a659f40fe1
    Fix leading dimension for B (Reference-LAPACK PR 1064) Martin Kroeker 2024-10-14 18:53:30 +0200
  • 191a33a916 deploy: c979c1d948 martin-frbg 2024-10-14 06:13:58 +0000
  • c979c1d948
    Merge pull request #4936 from martin-frbg/fixmips64generic Martin Kroeker 2024-10-14 08:13:27 +0200
  • a47b3c8867
    Fix unroll parameter selection for MIPS64_GENERIC Martin Kroeker 2024-10-13 22:54:34 +0200
  • 2391dc1c0f Merge branch 'vectorizeBF16GEMV' of github.ibm.com:PowerAppLibs/OpenBLAS into vectorizeBF16GEMV Chip Kerchner 2024-10-13 13:48:33 -0500
  • 36bd3eeddf Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). Chip Kerchner 2024-10-13 13:46:11 -0500
  • f8e113f27b Replace types with include file. Chip Kerchner 2024-10-13 10:55:03 -0500
  • fd23aae405
    Update prebuild.cmake Martin Kroeker 2024-10-13 13:20:24 +0200
  • 270abe35b4
    Update prebuild.cmake Martin Kroeker 2024-10-13 12:22:44 +0200
  • d992808da9
    Update dynamic_arch.yml Martin Kroeker 2024-10-13 11:26:56 +0200
  • a53a197934 Merge remote-tracking branch 'origin/develop' into vectorizeBF16GEMV Chip Kerchner 2024-10-12 15:15:17 -0500
  • 4318ef2eed
    Update prebuild.cmake Martin Kroeker 2024-10-12 21:19:10 +0200
  • 38b70e156b
    Update prebuild.cmake Martin Kroeker 2024-10-12 21:16:39 +0200
  • 491e5d9646
    Update param.h Martin Kroeker 2024-10-12 18:39:13 +0200
  • 2302328a78 deploy: 3184b7f209 martin-frbg 2024-10-12 15:20:14 +0000
  • 3184b7f209
    Merge pull request #4933 from ChipKerchner/thread_sbgemv Martin Kroeker 2024-10-12 17:19:41 +0200
  • e5721d4f1f
    Update Makefile Martin Kroeker 2024-10-12 12:24:02 +0200
  • 0082240044 Merge branch 'thread_sbgemv' into vectorizeBF16GEMV Chip Kerchner 2024-10-11 16:13:59 -0500
  • 1d51ca5798 Change multi-threading logic for SBGEMV to be the same as SGEMV. Chip Kerchner 2024-10-11 16:08:48 -0500