Commit Graph

  • 13c764eaaa
    Merge pull request #4434 from martin-frbg/issue4433 Martin Kroeker 2024-01-15 23:36:07 +0100
  • 025a1b2c7b
    Only use mtune=native when not cross-compiling Martin Kroeker 2024-01-15 22:40:21 +0100
  • 2527afaaa2
    Merge pull request #4429 from martin-frbg/issue4428 Martin Kroeker 2024-01-15 11:26:12 +0100
  • 0d2e486edf
    Handle NAN and INF Martin Kroeker 2024-01-15 11:18:59 +0100
  • a782103b9c
    Merge pull request #4425 from martin-frbg/issue2392 Martin Kroeker 2024-01-14 21:57:57 +0100
  • 152a6c43b6
    Add blas_omp_threads_local Martin Kroeker 2024-01-14 19:59:55 +0100
  • 8a9d492af7
    Add default for blas_omp_threads_local Martin Kroeker 2024-01-14 19:58:49 +0100
  • b3341527ad
    Merge pull request #4426 from martin-frbg/issue4415 Martin Kroeker 2024-01-13 23:27:13 +0100
  • 9fab60d32f
    Remove matrix dimension 6 from SGS to avoid spurious errors from FMA Martin Kroeker 2024-01-13 20:39:05 +0100
  • bf66af3dc0
    remove matrix dimension 6 from DGS to avoid spurious errors from FMA Martin Kroeker 2024-01-13 20:37:36 +0100
  • 87d31af2ae
    Add openblas_set_num_threads_local() Martin Kroeker 2024-01-13 20:06:24 +0100
  • 2e2e538b7c
    Add openblas_set_num_threads_local() and use of blas_omp_threads_local in OMP parallel regions Martin Kroeker 2024-01-13 20:02:43 +0100
  • f9b2d7f225
    Merge pull request #3253 from wi24rd/patch-1 Martin Kroeker 2024-01-13 19:55:01 +0100
  • 5f5b7c4f45
    Merge pull request #4423 from martin-frbg/issue4422 Martin Kroeker 2024-01-12 16:30:50 +0100
  • f31bea07dd
    Merge pull request #4419 from martin-frbg/issue4413 Martin Kroeker 2024-01-12 14:27:08 +0100
  • 20413ee6ec
    Update zscal.c Martin Kroeker 2024-01-12 13:11:13 +0100
  • b57627c27f
    Handle NAN and INF Martin Kroeker 2024-01-12 12:03:08 +0100
  • d1ead06bd8
    define NAN and INFINITY if needed Martin Kroeker 2024-01-12 09:29:13 +0100
  • 995a990e24
    Make AVX512 BFLOAT16 kernels conditional on compiler capability Martin Kroeker 2024-01-12 00:12:46 +0100
  • 1dada6d65d
    Add compiler test and flag for AVX512BF16 capability Martin Kroeker 2024-01-12 00:10:56 +0100
  • 7df363e1e2
    temporarily disable the MSA C/ZSCAL kernels Martin Kroeker 2024-01-12 00:08:52 +0100
  • 3599f2de8b
    Merge pull request #4421 from ChipKerchner/power10Copies_DGEMM Martin Kroeker 2024-01-10 07:49:00 +0100
  • 5c5c1a1220 Merge remote-tracking branch 'origin/develop' into power10Copies_DGEMM Chip-Kerchner 2024-01-09 07:18:41 -0600
  • 058dd2a4cb Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. Chip-Kerchner 2024-01-08 14:16:09 -0600
  • 1c31f56e5a
    Handle NAN Martin Kroeker 2024-01-08 16:11:25 +0100
  • 7ee1ee38e2
    Handle NaN in input Martin Kroeker 2024-01-08 14:20:07 +0100
  • f637e12713
    Handle INF and NAN Martin Kroeker 2024-01-08 09:52:38 +0100
  • 25b0c48082
    Update zscal.c Martin Kroeker 2024-01-08 09:49:18 +0100
  • 5e7f714e93
    Update zscal.c Martin Kroeker 2024-01-08 08:17:40 +0100
  • cf8b03ae8b
    Use NAN rather than SNAN for portability Martin Kroeker 2024-01-07 23:09:57 +0100
  • 7a6a24647d
    Merge pull request #4420 from martin-frbg/revertstuff Martin Kroeker 2024-01-07 23:04:22 +0100
  • f0808d856b
    Handle NAN in input Martin Kroeker 2024-01-07 20:27:29 +0100
  • acf17a825d
    Handle NAN in input Martin Kroeker 2024-01-07 20:26:16 +0100
  • f052bd4705
    revert accidental direct commit to develop Martin Kroeker 2024-01-07 20:21:48 +0100
  • 91bbde7f64
    revert accidental direct commit to develop Martin Kroeker 2024-01-07 20:21:05 +0100
  • 2173356d5b
    Update zscal_vector.c Martin Kroeker 2024-01-07 19:45:28 +0100
  • b08a208365
    Update zscal_vector.c Martin Kroeker 2024-01-07 19:14:41 +0100
  • 0c33b57f5f
    Handle NAN in input Martin Kroeker 2024-01-07 18:40:19 +0100
  • 903589f84b
    Update zscal.c Martin Kroeker 2024-01-07 18:37:00 +0100
  • 711433fcf0
    Update zscal.c Martin Kroeker 2024-01-07 18:01:58 +0100
  • d3d99c34f2
    Fix handling of NAN and INF Martin Kroeker 2024-01-07 17:56:51 +0100
  • c9df62e883
    Fix handling of NAN Martin Kroeker 2024-01-07 17:49:40 +0100
  • def4996170
    Fix handling of NAN and INF arguments Martin Kroeker 2024-01-07 15:29:42 +0100
  • e48627c999
    Add tests for ZSCAL with NaN and Inf arguments Martin Kroeker 2024-01-06 23:55:52 +0100
  • 1412d2deeb
    Update version to 0.3.26.dev Martin Kroeker 2024-01-02 22:33:01 +0100
  • 4f5da84e2f
    Update version to 0.3.26.dev Martin Kroeker 2024-01-02 22:32:27 +0100
  • 1ad742844b
    Merge pull request #4409 from OpenMathLib/release-0.3.0 Martin Kroeker 2024-01-02 22:31:38 +0100
  • 6c77e5e314
    Update Makefile.rule v0.3.26 Martin Kroeker 2024-01-02 22:25:05 +0100
  • fde8bb9903
    Update version to 0.3.26 Martin Kroeker 2024-01-02 22:24:33 +0100
  • 8fe7f80271
    Merge pull request #4408 from OpenMathLib/develop Martin Kroeker 2024-01-02 22:23:31 +0100
  • cddd35fae1
    Merge pull request #4407 from martin-frbg/changelog0326 Martin Kroeker 2024-01-02 22:21:16 +0100
  • 03713bc464
    Update Changelog for 0.3.26 Martin Kroeker 2024-01-02 22:08:49 +0100
  • cdff44e4d3
    Merge pull request #4406 from martin-frbg/issue3291 Martin Kroeker 2024-01-02 22:02:56 +0100
  • 8278d0d093
    Merge pull request #4353 from erikbs/feature/fix-xerbla-linking-on-older-mac-versions Martin Kroeker 2024-01-02 19:55:05 +0100
  • 504f9b0c5e
    Increase S/D GEMM PQ to match typical L2 size as forNeoverseV1 Martin Kroeker 2024-01-02 18:46:21 +0100
  • 2762270f84
    Update .cirrus.yml Martin Kroeker 2024-01-02 09:11:48 +0100
  • c7edc3d18e
    Update Makefile Martin Kroeker 2024-01-01 23:54:02 +0100
  • ba31f9292a
    benchmark neoverse dgemm Martin Kroeker 2024-01-01 21:47:24 +0100
  • 534de14a02
    Merge pull request #4402 from martin-frbg/lapack967 Martin Kroeker 2023-12-31 16:31:28 +0100
  • 4a15d72420
    AzureCI: Update alpine-chroot-install (#4403) Martin Kroeker 2023-12-31 16:30:57 +0100
  • c5e6740704
    fix formatting Martin Kroeker 2023-12-31 15:34:54 +0100
  • e017ef8f37
    Update alpine-chroot-install Martin Kroeker 2023-12-31 15:06:21 +0100
  • 0c43c6fa99
    Merge pull request #4341 from catap/openblas.pc.in Martin Kroeker 2023-12-31 13:25:06 +0100
  • 00d7476b4b
    Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) Martin Kroeker 2023-12-31 12:39:21 +0100
  • 1b668479de
    Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) Martin Kroeker 2023-12-31 12:37:52 +0100
  • bd787c8a1a
    Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) Martin Kroeker 2023-12-31 12:36:47 +0100
  • d3451af03f
    Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) Martin Kroeker 2023-12-31 12:35:37 +0100
  • 5a20bc5e02
    Merge pull request #4401 from martin-frbg/fix4398 Martin Kroeker 2023-12-31 10:15:59 +0100
  • 2802478449
    revert change to Loongson2k1000 zgemm Martin Kroeker 2023-12-30 23:35:51 +0100
  • 910ab7f698
    Merge branch 'OpenMathLib:develop' into fix4398 Martin Kroeker 2023-12-30 22:51:31 +0100
  • 44b5b9e39f
    Update C/ZGEMM MN for Loongson2k1000 Martin Kroeker 2023-12-30 22:50:40 +0100
  • c1d1c2748f
    Merge branch 'OpenMathLib:develop' into fix4398 Martin Kroeker 2023-12-30 20:51:26 +0100
  • 9d89bcfbf0
    Merge pull request #4399 from martin-frbg/fixloongsonci Martin Kroeker 2023-12-30 20:50:55 +0100
  • d2b7f0f9c6
    define CGEMM INCOPY/ITCOPY kernels Martin Kroeker 2023-12-30 20:49:53 +0100
  • 2327b13b3f
    define CGEMM INCOPY/ITCOPY kernels Martin Kroeker 2023-12-30 20:48:40 +0100
  • 0f648ebcd1
    use alternate download for the CLFS cross-compiler package Martin Kroeker 2023-12-30 20:31:32 +0100
  • 519b40fad9
    Merge pull request #4398 from yinshiyou/la-dev Martin Kroeker 2023-12-30 19:51:08 +0100
  • a5d0d21378 loongarch64: Add zgemm and cgemm optimization pengxu 2023-12-29 15:10:01 +0800
  • 546f13558c loongarch64: Add {c/z}swap and {c/z}sum optimization gxw 2023-12-29 11:03:53 +0800
  • edabb93668 loongarch64: Refine axpby optimization functions. Hao Chen 2023-12-29 15:08:10 +0800
  • 1ec5dded43 loongarch64: Add c/zrot optimization functions. Hao Chen 2023-12-28 21:23:59 +0800
  • 3c53ded315 loongarch64: Add c/znrm2 optimization functions. Hao Chen 2023-12-28 20:26:01 +0800
  • fbd612f8c4 loongarch64: Add ic/zamin optimization functions. Hao Chen 2023-12-28 20:07:58 +0800
  • d97272cb35 loongarch64: Add c/zdot optimization functions. Hao Chen 2023-12-28 19:09:18 +0800
  • 65a0aeb128 loongarch64: Add c/zcopy optimization functions. Hao Chen 2023-12-28 17:45:17 +0800
  • 2a34fb4b80 loongarch64: Add and refine scal optimization functions. Hao Chen 2023-12-27 18:17:51 +0800
  • 8785e948b5 loongarch64: Add camin optimization function. Hao Chen 2023-12-27 17:04:46 +0800
  • 0753848e03 loongarch64: Refine and add axpy optimization functions. Hao Chen 2023-12-27 16:54:01 +0800
  • 06fd5b5995 loongarch64: Add and Refine asum optimization functions. Hao Chen 2023-12-27 10:44:02 +0800
  • e771be185e Optimize copy functions with lsx. guxiwei 2023-12-21 14:28:06 +0800
  • 179ed51d3b Add dgemm_kernel_8x4.S file. Hao Chen 2023-12-21 14:18:39 +0800
  • 173a65d4e6 loongarch64: Add and refine iamax optimization functions. Hao Chen 2023-12-25 15:11:04 +0800
  • ea70e165c7 loongarch64: Refine rot optimization. zhoupeng 2023-12-28 20:07:59 +0800
  • 116aee7527 loongarch64: Refine imin optimization. zhoupeng 2023-12-28 15:17:28 +0800
  • 8be2654193 loongarch64: Refine imax optimization. zhoupeng 2023-12-28 10:24:24 +0800
  • 154baad454 loongarch64: Refine iamin optimization. zhoupeng 2023-12-27 16:04:33 +0800
  • 36c12c4971 loongarch64: Refine copy,swap,nrm2,sum optimization. Shiyou Yin 2023-12-27 11:30:17 +0800
  • c6996a80e9 loongarch64: Refine amax,amin,max,min optimization. Shiyou Yin 2023-12-08 16:06:17 +0800
  • 21564bde2c
    Merge pull request #4394 from martin-frbg/dyn_vortex Martin Kroeker 2023-12-28 13:35:55 +0100
  • 75fe9c21e5 Scale P and Q with L2 cache size for SVE Chris Sidebottom 2023-12-27 17:52:19 +0000