Commit Graph

7761 Commits

Author SHA1 Message Date
Martin Kroeker 6c77e5e314
Update Makefile.rule 2024-01-02 22:25:05 +01:00
Martin Kroeker fde8bb9903
Update version to 0.3.26 2024-01-02 22:24:33 +01:00
Martin Kroeker 8fe7f80271
Merge pull request #4408 from OpenMathLib/develop
merge develop for 0.3.26 release
2024-01-02 22:23:31 +01:00
Martin Kroeker cddd35fae1
Merge pull request #4407 from martin-frbg/changelog0326
Update Changelog for 0.3.26
2024-01-02 22:21:16 +01:00
Martin Kroeker 03713bc464
Update Changelog for 0.3.26 2024-01-02 22:08:49 +01:00
Martin Kroeker cdff44e4d3
Merge pull request #4406 from martin-frbg/issue3291
Increase S/D GEMM PQ parameters for Neoverse N1
2024-01-02 22:02:56 +01:00
Martin Kroeker 8278d0d093
Merge pull request #4353 from erikbs/feature/fix-xerbla-linking-on-older-mac-versions
Fix _xerbla_ linking on older Mac versions
2024-01-02 19:55:05 +01:00
Martin Kroeker 504f9b0c5e
Increase S/D GEMM PQ to match typical L2 size as forNeoverseV1 2024-01-02 18:46:21 +01:00
Martin Kroeker 534de14a02
Merge pull request #4402 from martin-frbg/lapack967
Fix an uninitialized read (use of wrong variable) in ?BBCSD (Reference-LAPACK PR 967)
2023-12-31 16:31:28 +01:00
Martin Kroeker 4a15d72420
AzureCI: Update alpine-chroot-install (#4403)
* Update alpine-chroot-install
2023-12-31 16:30:57 +01:00
Martin Kroeker 0c43c6fa99
Merge pull request #4341 from catap/openblas.pc.in
cmake/openblas.pc.in: fixed version and URL
2023-12-31 13:25:06 +01:00
Martin Kroeker 00d7476b4b
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:39:21 +01:00
Martin Kroeker 1b668479de
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:37:52 +01:00
Martin Kroeker bd787c8a1a
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:36:47 +01:00
Martin Kroeker d3451af03f
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:35:37 +01:00
Martin Kroeker 5a20bc5e02
Merge pull request #4401 from martin-frbg/fix4398
Update CGEMM MN for Loongson2K1000
2023-12-31 10:15:59 +01:00
Martin Kroeker 2802478449
revert change to Loongson2k1000 zgemm 2023-12-30 23:35:51 +01:00
Martin Kroeker 910ab7f698
Merge branch 'OpenMathLib:develop' into fix4398 2023-12-30 22:51:31 +01:00
Martin Kroeker 44b5b9e39f
Update C/ZGEMM MN for Loongson2k1000 2023-12-30 22:50:40 +01:00
Martin Kroeker 9d89bcfbf0
Merge pull request #4399 from martin-frbg/fixloongsonci
CI, Loongarch64: use alternate download for the cross-compiler package
2023-12-30 20:50:55 +01:00
Martin Kroeker 0f648ebcd1
use alternate download for the CLFS cross-compiler package 2023-12-30 20:31:32 +01:00
Martin Kroeker 519b40fad9
Merge pull request #4398 from yinshiyou/la-dev
Add Optimizations for LoongArch.
2023-12-30 19:51:08 +01:00
pengxu a5d0d21378 loongarch64: Add zgemm and cgemm optimization 2023-12-29 18:06:26 +08:00
gxw 546f13558c loongarch64: Add {c/z}swap and {c/z}sum optimization 2023-12-29 17:30:57 +08:00
Hao Chen edabb93668 loongarch64: Refine axpby optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen 1ec5dded43 loongarch64: Add c/zrot optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen 3c53ded315 loongarch64: Add c/znrm2 optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen fbd612f8c4 loongarch64: Add ic/zamin optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen d97272cb35 loongarch64: Add c/zdot optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen 65a0aeb128 loongarch64: Add c/zcopy optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen 2a34fb4b80 loongarch64: Add and refine scal optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen 8785e948b5 loongarch64: Add camin optimization function. 2023-12-29 17:30:57 +08:00
Hao Chen 0753848e03 loongarch64: Refine and add axpy optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen 06fd5b5995 loongarch64: Add and Refine asum optimization functions. 2023-12-29 17:30:57 +08:00
guxiwei e771be185e Optimize copy functions with lsx.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen 179ed51d3b Add dgemm_kernel_8x4.S file. 2023-12-29 17:30:57 +08:00
Hao Chen 173a65d4e6 loongarch64: Add and refine iamax optimization functions. 2023-12-29 17:30:57 +08:00
zhoupeng ea70e165c7 loongarch64: Refine rot optimization. 2023-12-29 17:30:57 +08:00
zhoupeng 116aee7527 loongarch64: Refine imin optimization. 2023-12-29 17:30:57 +08:00
zhoupeng 8be2654193 loongarch64: Refine imax optimization. 2023-12-29 17:30:57 +08:00
zhoupeng 154baad454 loongarch64: Refine iamin optimization. 2023-12-29 17:30:57 +08:00
Shiyou Yin 36c12c4971 loongarch64: Refine copy,swap,nrm2,sum optimization. 2023-12-29 17:30:57 +08:00
Shiyou Yin c6996a80e9 loongarch64: Refine amax,amin,max,min optimization. 2023-12-29 17:30:57 +08:00
Martin Kroeker 21564bde2c
Merge pull request #4394 from martin-frbg/dyn_vortex
Add Apple M as NeoverseN1 in ARM64 DYNAMIC_ARCH runtime detection
2023-12-28 13:35:55 +01:00
Martin Kroeker e9c32ed165
Merge pull request #4384 from yetist/develop
Fix: build failed on LoongArch
2023-12-27 14:05:01 +01:00
Martin Kroeker e7a895e714
Add Apple M as NeoverseN1 2023-12-25 12:36:05 +01:00
Martin Kroeker 474ce0ace9
Merge pull request #4393 from martin-frbg/pr4389-2
Remove redundant targets from the default ARM64 DYNAMIC_ARCH list
2023-12-25 12:30:56 +01:00
Martin Kroeker 1106460bb3
remove redundant targets from the default ARM64 DYNAMIC_ARCH list 2023-12-25 12:29:56 +01:00
Martin Kroeker 236acee706
Merge pull request #4389 from Mousius/reduce-dynamic-targets
Use functionally equivalent dynamic targets
2023-12-25 12:27:42 +01:00
Xiaotian Wu d2f4f1b28a CI: update toolchains for LoongArch64 2023-12-25 16:04:43 +08:00