Commit Graph

7772 Commits

Author SHA1 Message Date
Martin Kroeker
25b0c48082 Update zscal.c 2024-01-08 09:49:18 +01:00
Martin Kroeker
5e7f714e93 Update zscal.c 2024-01-08 08:17:40 +01:00
Martin Kroeker
cf8b03ae8b Use NAN rather than SNAN for portability 2024-01-07 23:09:57 +01:00
Martin Kroeker
f0808d856b Handle NAN in input 2024-01-07 20:27:29 +01:00
Martin Kroeker
acf17a825d Handle NAN in input 2024-01-07 20:26:16 +01:00
Martin Kroeker
c9df62e883 Fix handling of NAN 2024-01-07 17:49:40 +01:00
Martin Kroeker
def4996170 Fix handling of NAN and INF arguments 2024-01-07 15:29:42 +01:00
Martin Kroeker
e48627c999 Add tests for ZSCAL with NaN and Inf arguments 2024-01-06 23:55:52 +01:00
Martin Kroeker
1412d2deeb Update version to 0.3.26.dev 2024-01-02 22:33:01 +01:00
Martin Kroeker
4f5da84e2f Update version to 0.3.26.dev 2024-01-02 22:32:27 +01:00
Martin Kroeker
1ad742844b Merge pull request #4409 from OpenMathLib/release-0.3.0
merge release branch back into develop to copy tag
2024-01-02 22:31:38 +01:00
Martin Kroeker
6c77e5e314 Update Makefile.rule v0.3.26 2024-01-02 22:25:05 +01:00
Martin Kroeker
fde8bb9903 Update version to 0.3.26 2024-01-02 22:24:33 +01:00
Martin Kroeker
8fe7f80271 Merge pull request #4408 from OpenMathLib/develop
merge develop for 0.3.26 release
2024-01-02 22:23:31 +01:00
Martin Kroeker
cddd35fae1 Merge pull request #4407 from martin-frbg/changelog0326
Update Changelog for 0.3.26
2024-01-02 22:21:16 +01:00
Martin Kroeker
03713bc464 Update Changelog for 0.3.26 2024-01-02 22:08:49 +01:00
Martin Kroeker
cdff44e4d3 Merge pull request #4406 from martin-frbg/issue3291
Increase S/D GEMM PQ parameters for Neoverse N1
2024-01-02 22:02:56 +01:00
Martin Kroeker
8278d0d093 Merge pull request #4353 from erikbs/feature/fix-xerbla-linking-on-older-mac-versions
Fix _xerbla_ linking on older Mac versions
2024-01-02 19:55:05 +01:00
Martin Kroeker
504f9b0c5e Increase S/D GEMM PQ to match typical L2 size as forNeoverseV1 2024-01-02 18:46:21 +01:00
Martin Kroeker
534de14a02 Merge pull request #4402 from martin-frbg/lapack967
Fix an uninitialized read (use of wrong variable) in ?BBCSD (Reference-LAPACK PR 967)
2023-12-31 16:31:28 +01:00
Martin Kroeker
4a15d72420 AzureCI: Update alpine-chroot-install (#4403)
* Update alpine-chroot-install
2023-12-31 16:30:57 +01:00
Martin Kroeker
0c43c6fa99 Merge pull request #4341 from catap/openblas.pc.in
cmake/openblas.pc.in: fixed version and URL
2023-12-31 13:25:06 +01:00
Martin Kroeker
00d7476b4b Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:39:21 +01:00
Martin Kroeker
1b668479de Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:37:52 +01:00
Martin Kroeker
bd787c8a1a Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:36:47 +01:00
Martin Kroeker
d3451af03f Fix uninitialized read/wrong variable (Reference-LAPACK PR 967) 2023-12-31 12:35:37 +01:00
Martin Kroeker
5a20bc5e02 Merge pull request #4401 from martin-frbg/fix4398
Update CGEMM MN for Loongson2K1000
2023-12-31 10:15:59 +01:00
Martin Kroeker
2802478449 revert change to Loongson2k1000 zgemm 2023-12-30 23:35:51 +01:00
Martin Kroeker
910ab7f698 Merge branch 'OpenMathLib:develop' into fix4398 2023-12-30 22:51:31 +01:00
Martin Kroeker
44b5b9e39f Update C/ZGEMM MN for Loongson2k1000 2023-12-30 22:50:40 +01:00
Martin Kroeker
9d89bcfbf0 Merge pull request #4399 from martin-frbg/fixloongsonci
CI, Loongarch64: use alternate download for the cross-compiler package
2023-12-30 20:50:55 +01:00
Martin Kroeker
0f648ebcd1 use alternate download for the CLFS cross-compiler package 2023-12-30 20:31:32 +01:00
Martin Kroeker
519b40fad9 Merge pull request #4398 from yinshiyou/la-dev
Add Optimizations for LoongArch.
2023-12-30 19:51:08 +01:00
pengxu
a5d0d21378 loongarch64: Add zgemm and cgemm optimization 2023-12-29 18:06:26 +08:00
gxw
546f13558c loongarch64: Add {c/z}swap and {c/z}sum optimization 2023-12-29 17:30:57 +08:00
Hao Chen
edabb93668 loongarch64: Refine axpby optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen
1ec5dded43 loongarch64: Add c/zrot optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
3c53ded315 loongarch64: Add c/znrm2 optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen
fbd612f8c4 loongarch64: Add ic/zamin optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen
d97272cb35 loongarch64: Add c/zdot optimization functions. 2023-12-29 17:30:57 +08:00
Hao Chen
65a0aeb128 loongarch64: Add c/zcopy optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
2a34fb4b80 loongarch64: Add and refine scal optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
8785e948b5 loongarch64: Add camin optimization function. 2023-12-29 17:30:57 +08:00
Hao Chen
0753848e03 loongarch64: Refine and add axpy optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
06fd5b5995 loongarch64: Add and Refine asum optimization functions. 2023-12-29 17:30:57 +08:00
guxiwei
e771be185e Optimize copy functions with lsx.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
179ed51d3b Add dgemm_kernel_8x4.S file. 2023-12-29 17:30:57 +08:00
Hao Chen
173a65d4e6 loongarch64: Add and refine iamax optimization functions. 2023-12-29 17:30:57 +08:00
zhoupeng
ea70e165c7 loongarch64: Refine rot optimization. 2023-12-29 17:30:57 +08:00
zhoupeng
116aee7527 loongarch64: Refine imin optimization. 2023-12-29 17:30:57 +08:00