Martin Kroeker
d3451af03f
Fix uninitialized read/wrong variable (Reference-LAPACK PR 967)
2023-12-31 12:35:37 +01:00
Martin Kroeker
5a20bc5e02
Merge pull request #4401 from martin-frbg/fix4398
...
Update CGEMM MN for Loongson2K1000
2023-12-31 10:15:59 +01:00
Martin Kroeker
2802478449
revert change to Loongson2k1000 zgemm
2023-12-30 23:35:51 +01:00
Martin Kroeker
910ab7f698
Merge branch 'OpenMathLib:develop' into fix4398
2023-12-30 22:51:31 +01:00
Martin Kroeker
44b5b9e39f
Update C/ZGEMM MN for Loongson2k1000
2023-12-30 22:50:40 +01:00
Martin Kroeker
9d89bcfbf0
Merge pull request #4399 from martin-frbg/fixloongsonci
...
CI, Loongarch64: use alternate download for the cross-compiler package
2023-12-30 20:50:55 +01:00
Martin Kroeker
0f648ebcd1
use alternate download for the CLFS cross-compiler package
2023-12-30 20:31:32 +01:00
Martin Kroeker
519b40fad9
Merge pull request #4398 from yinshiyou/la-dev
...
Add Optimizations for LoongArch.
2023-12-30 19:51:08 +01:00
pengxu
a5d0d21378
loongarch64: Add zgemm and cgemm optimization
2023-12-29 18:06:26 +08:00
gxw
546f13558c
loongarch64: Add {c/z}swap and {c/z}sum optimization
2023-12-29 17:30:57 +08:00
Hao Chen
edabb93668
loongarch64: Refine axpby optimization functions.
2023-12-29 17:30:57 +08:00
Hao Chen
1ec5dded43
loongarch64: Add c/zrot optimization functions.
...
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
3c53ded315
loongarch64: Add c/znrm2 optimization functions.
2023-12-29 17:30:57 +08:00
Hao Chen
fbd612f8c4
loongarch64: Add ic/zamin optimization functions.
2023-12-29 17:30:57 +08:00
Hao Chen
d97272cb35
loongarch64: Add c/zdot optimization functions.
2023-12-29 17:30:57 +08:00
Hao Chen
65a0aeb128
loongarch64: Add c/zcopy optimization functions.
...
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
2a34fb4b80
loongarch64: Add and refine scal optimization functions.
...
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
8785e948b5
loongarch64: Add camin optimization function.
2023-12-29 17:30:57 +08:00
Hao Chen
0753848e03
loongarch64: Refine and add axpy optimization functions.
...
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
06fd5b5995
loongarch64: Add and Refine asum optimization functions.
2023-12-29 17:30:57 +08:00
guxiwei
e771be185e
Optimize copy functions with lsx.
...
Signed-off-by: Hao Chen <chenhao@loongson.cn>
2023-12-29 17:30:57 +08:00
Hao Chen
179ed51d3b
Add dgemm_kernel_8x4.S file.
2023-12-29 17:30:57 +08:00
Hao Chen
173a65d4e6
loongarch64: Add and refine iamax optimization functions.
2023-12-29 17:30:57 +08:00
zhoupeng
ea70e165c7
loongarch64: Refine rot optimization.
2023-12-29 17:30:57 +08:00
zhoupeng
116aee7527
loongarch64: Refine imin optimization.
2023-12-29 17:30:57 +08:00
zhoupeng
8be2654193
loongarch64: Refine imax optimization.
2023-12-29 17:30:57 +08:00
zhoupeng
154baad454
loongarch64: Refine iamin optimization.
2023-12-29 17:30:57 +08:00
Shiyou Yin
36c12c4971
loongarch64: Refine copy,swap,nrm2,sum optimization.
2023-12-29 17:30:57 +08:00
Shiyou Yin
c6996a80e9
loongarch64: Refine amax,amin,max,min optimization.
2023-12-29 17:30:57 +08:00
Martin Kroeker
21564bde2c
Merge pull request #4394 from martin-frbg/dyn_vortex
...
Add Apple M as NeoverseN1 in ARM64 DYNAMIC_ARCH runtime detection
2023-12-28 13:35:55 +01:00
Martin Kroeker
e9c32ed165
Merge pull request #4384 from yetist/develop
...
Fix: build failed on LoongArch
2023-12-27 14:05:01 +01:00
Martin Kroeker
e7a895e714
Add Apple M as NeoverseN1
2023-12-25 12:36:05 +01:00
Martin Kroeker
474ce0ace9
Merge pull request #4393 from martin-frbg/pr4389-2
...
Remove redundant targets from the default ARM64 DYNAMIC_ARCH list
2023-12-25 12:30:56 +01:00
Martin Kroeker
1106460bb3
remove redundant targets from the default ARM64 DYNAMIC_ARCH list
2023-12-25 12:29:56 +01:00
Martin Kroeker
236acee706
Merge pull request #4389 from Mousius/reduce-dynamic-targets
...
Use functionally equivalent dynamic targets
2023-12-25 12:27:42 +01:00
Xiaotian Wu
d2f4f1b28a
CI: update toolchains for LoongArch64
2023-12-25 16:04:43 +08:00
Wu Xiaotian
0baf462dbc
Fix: build failed on LoongArch
...
According to the documentation at https://github.com/loongson/la-abi-specs/blob/release/lapcs.adoc#the-base-abi-variants , valid -mabi parameters are lp64s, lp64f, lp64d, ilp32s, ilp32f and ilp32d.
2023-12-25 16:04:43 +08:00
Martin Kroeker
63a83939a1
Merge pull request #4390 from Mousius/reduce-kernel-duplication
...
Reduce duplication in kernel definitions
2023-12-24 18:04:26 +01:00
Martin Kroeker
dba404055d
Merge pull request #4392 from martin-frbg/lapack959
...
Fix issues related to the ?GEDMD functions (Reference-LAPACK PR 959)
2023-12-24 10:44:15 +01:00
Martin Kroeker
c6fa921027
Add tests for ?GEDMD (Reference-LAPACK PR 959)
2023-12-23 23:39:53 +01:00
Martin Kroeker
283713e4c5
Add tests for ?GEDMD (Reference-LAPACK PR 959)
2023-12-23 23:32:45 +01:00
Martin Kroeker
201f22f49a
Fix issues related to ?GEDMD (Reference-LAPACK PR 959)
2023-12-23 23:27:38 +01:00
Martin Kroeker
05dde8ef04
Merge pull request #4391 from martin-frbg/lapack942
...
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 23:11:46 +01:00
Martin Kroeker
45ef0d7361
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 20:16:33 +01:00
Martin Kroeker
c082669ad4
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 20:05:03 +01:00
Martin Kroeker
29d6024ec5
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 19:44:11 +01:00
Martin Kroeker
0814491d96
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 19:37:03 +01:00
Martin Kroeker
5c11b2ff41
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 19:27:20 +01:00
Martin Kroeker
8ce44c18a0
Handle corner cases of LWORK (Reference-LAPACK PR 942)
2023-12-23 19:24:10 +01:00
Chris Sidebottom
dc20a78188
Use functionally equivalent dynamic targets
...
Similar to `drivers/other/dynamic.c`, I've looked for functionally
equivalent targets and mapped them in the default DYNAMIC_ARCH build.
Users can still build specific cores using DYNAMIC_LIST.
2023-12-23 12:45:27 +00:00