Martin Kroeker
|
0c43c6fa99
|
Merge pull request #4341 from catap/openblas.pc.in
cmake/openblas.pc.in: fixed version and URL
|
2023-12-31 13:25:06 +01:00 |
Martin Kroeker
|
00d7476b4b
|
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967)
|
2023-12-31 12:39:21 +01:00 |
Martin Kroeker
|
1b668479de
|
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967)
|
2023-12-31 12:37:52 +01:00 |
Martin Kroeker
|
bd787c8a1a
|
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967)
|
2023-12-31 12:36:47 +01:00 |
Martin Kroeker
|
d3451af03f
|
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967)
|
2023-12-31 12:35:37 +01:00 |
Martin Kroeker
|
5a20bc5e02
|
Merge pull request #4401 from martin-frbg/fix4398
Update CGEMM MN for Loongson2K1000
|
2023-12-31 10:15:59 +01:00 |
Martin Kroeker
|
2802478449
|
revert change to Loongson2k1000 zgemm
|
2023-12-30 23:35:51 +01:00 |
Martin Kroeker
|
910ab7f698
|
Merge branch 'OpenMathLib:develop' into fix4398
|
2023-12-30 22:51:31 +01:00 |
Martin Kroeker
|
44b5b9e39f
|
Update C/ZGEMM MN for Loongson2k1000
|
2023-12-30 22:50:40 +01:00 |
Martin Kroeker
|
9d89bcfbf0
|
Merge pull request #4399 from martin-frbg/fixloongsonci
CI, Loongarch64: use alternate download for the cross-compiler package
|
2023-12-30 20:50:55 +01:00 |
Martin Kroeker
|
0f648ebcd1
|
use alternate download for the CLFS cross-compiler package
|
2023-12-30 20:31:32 +01:00 |
Martin Kroeker
|
519b40fad9
|
Merge pull request #4398 from yinshiyou/la-dev
Add Optimizations for LoongArch.
|
2023-12-30 19:51:08 +01:00 |
pengxu
|
a5d0d21378
|
loongarch64: Add zgemm and cgemm optimization
|
2023-12-29 18:06:26 +08:00 |
gxw
|
546f13558c
|
loongarch64: Add {c/z}swap and {c/z}sum optimization
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
edabb93668
|
loongarch64: Refine axpby optimization functions.
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
1ec5dded43
|
loongarch64: Add c/zrot optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
3c53ded315
|
loongarch64: Add c/znrm2 optimization functions.
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
fbd612f8c4
|
loongarch64: Add ic/zamin optimization functions.
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
d97272cb35
|
loongarch64: Add c/zdot optimization functions.
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
65a0aeb128
|
loongarch64: Add c/zcopy optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
2a34fb4b80
|
loongarch64: Add and refine scal optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
8785e948b5
|
loongarch64: Add camin optimization function.
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
0753848e03
|
loongarch64: Refine and add axpy optimization functions.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
06fd5b5995
|
loongarch64: Add and Refine asum optimization functions.
|
2023-12-29 17:30:57 +08:00 |
guxiwei
|
e771be185e
|
Optimize copy functions with lsx.
Signed-off-by: Hao Chen <chenhao@loongson.cn>
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
179ed51d3b
|
Add dgemm_kernel_8x4.S file.
|
2023-12-29 17:30:57 +08:00 |
Hao Chen
|
173a65d4e6
|
loongarch64: Add and refine iamax optimization functions.
|
2023-12-29 17:30:57 +08:00 |
zhoupeng
|
ea70e165c7
|
loongarch64: Refine rot optimization.
|
2023-12-29 17:30:57 +08:00 |
zhoupeng
|
116aee7527
|
loongarch64: Refine imin optimization.
|
2023-12-29 17:30:57 +08:00 |
zhoupeng
|
8be2654193
|
loongarch64: Refine imax optimization.
|
2023-12-29 17:30:57 +08:00 |
zhoupeng
|
154baad454
|
loongarch64: Refine iamin optimization.
|
2023-12-29 17:30:57 +08:00 |
Shiyou Yin
|
36c12c4971
|
loongarch64: Refine copy,swap,nrm2,sum optimization.
|
2023-12-29 17:30:57 +08:00 |
Shiyou Yin
|
c6996a80e9
|
loongarch64: Refine amax,amin,max,min optimization.
|
2023-12-29 17:30:57 +08:00 |
Martin Kroeker
|
21564bde2c
|
Merge pull request #4394 from martin-frbg/dyn_vortex
Add Apple M as NeoverseN1 in ARM64 DYNAMIC_ARCH runtime detection
|
2023-12-28 13:35:55 +01:00 |
Martin Kroeker
|
e9c32ed165
|
Merge pull request #4384 from yetist/develop
Fix: build failed on LoongArch
|
2023-12-27 14:05:01 +01:00 |
Martin Kroeker
|
e7a895e714
|
Add Apple M as NeoverseN1
|
2023-12-25 12:36:05 +01:00 |
Martin Kroeker
|
474ce0ace9
|
Merge pull request #4393 from martin-frbg/pr4389-2
Remove redundant targets from the default ARM64 DYNAMIC_ARCH list
|
2023-12-25 12:30:56 +01:00 |
Martin Kroeker
|
1106460bb3
|
remove redundant targets from the default ARM64 DYNAMIC_ARCH list
|
2023-12-25 12:29:56 +01:00 |
Martin Kroeker
|
236acee706
|
Merge pull request #4389 from Mousius/reduce-dynamic-targets
Use functionally equivalent dynamic targets
|
2023-12-25 12:27:42 +01:00 |
Xiaotian Wu
|
d2f4f1b28a
|
CI: update toolchains for LoongArch64
|
2023-12-25 16:04:43 +08:00 |
Wu Xiaotian
|
0baf462dbc
|
Fix: build failed on LoongArch
According to the documentation at https://github.com/loongson/la-abi-specs/blob/release/lapcs.adoc#the-base-abi-variants, valid -mabi parameters are lp64s, lp64f, lp64d, ilp32s, ilp32f and ilp32d.
|
2023-12-25 16:04:43 +08:00 |
Martin Kroeker
|
63a83939a1
|
Merge pull request #4390 from Mousius/reduce-kernel-duplication
Reduce duplication in kernel definitions
|
2023-12-24 18:04:26 +01:00 |
Martin Kroeker
|
dba404055d
|
Merge pull request #4392 from martin-frbg/lapack959
Fix issues related to the ?GEDMD functions (Reference-LAPACK PR 959)
|
2023-12-24 10:44:15 +01:00 |
Martin Kroeker
|
c6fa921027
|
Add tests for ?GEDMD (Reference-LAPACK PR 959)
|
2023-12-23 23:39:53 +01:00 |
Martin Kroeker
|
283713e4c5
|
Add tests for ?GEDMD (Reference-LAPACK PR 959)
|
2023-12-23 23:32:45 +01:00 |
Martin Kroeker
|
201f22f49a
|
Fix issues related to ?GEDMD (Reference-LAPACK PR 959)
|
2023-12-23 23:27:38 +01:00 |
Martin Kroeker
|
05dde8ef04
|
Merge pull request #4391 from martin-frbg/lapack942
Handle corner cases of LWORK (Reference-LAPACK PR 942)
|
2023-12-23 23:11:46 +01:00 |
Martin Kroeker
|
45ef0d7361
|
Handle corner cases of LWORK (Reference-LAPACK PR 942)
|
2023-12-23 20:16:33 +01:00 |
Martin Kroeker
|
c082669ad4
|
Handle corner cases of LWORK (Reference-LAPACK PR 942)
|
2023-12-23 20:05:03 +01:00 |
Martin Kroeker
|
29d6024ec5
|
Handle corner cases of LWORK (Reference-LAPACK PR 942)
|
2023-12-23 19:44:11 +01:00 |