Ashwin Sekhar T K
97d671eb61
arm: add softfp support in zgemm/ztrmm vfp kernels
2017-07-02 02:54:32 +05:30
Ashwin Sekhar T K
305cd2e8b4
arm: add softfp support in cgemm/ctrmm vfp kernels
2017-07-02 02:42:32 +05:30
Ashwin Sekhar T K
09bc6ebe5b
arm: add softfp support in dgemm/dtrmm vfp kernels
2017-07-02 02:24:38 +05:30
Ashwin Sekhar T K
872a11a2bf
arm: add softfp support in sgemm/strmm vfp kernels
2017-07-02 02:23:48 +05:30
Ashwin Sekhar T K
8f83d3f961
arm: add softfp support in vfp gemv kernels
2017-07-02 01:03:31 +05:30
Ashwin Sekhar T K
e25f4c01d6
arm: add softfp support in kernel/arm/nrm2_vfp*.S
2017-07-01 19:57:28 +05:30
Ashwin Sekhar T K
89924b3d5b
arm: Use assembly implementations based on the ARM abi
...
In case of softfp abi, assembly implementations of only those APIs are
used which doesnt have a floating point argument or return value.
In case of hard abi, all assembly implementations are used.
2017-06-30 18:21:05 +05:30
Werner Saar
63a7d7fb24
updated gemv_n_vfpv3.S for armv7
2016-01-25 15:00:13 +01:00
Werner Saar
de3e2d4349
updated trmm kernels for armv7
2016-01-25 11:08:56 +01:00
wernsaar
c4a423a642
bugfixes for lapack on ARM Platform
2014-07-09 12:21:39 +02:00
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
...
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
2014-06-27 12:05:18 -07:00
wernsaar
c38379c9dd
bugfixes for ARM regarding lapack tests
2014-05-14 13:03:45 +02:00
wernsaar
f9daebba0a
checked in bugfixes for ARM
2014-02-16 11:45:47 +01:00
wernsaar
2d3c884294
added complex gemv kernels for ARMV6 and ARMV7
2013-11-29 17:06:33 +01:00
wernsaar
d2b20c5c51
add optimized axpy kernel
2013-11-25 12:25:58 +01:00
wernsaar
7f210587f0
renamed some ncopy and tcopy files
2013-11-22 00:20:25 +01:00
wernsaar
19cd5c64a2
renamed swap_vfpv3.S to swap_vfp.S
2013-11-21 23:19:32 +01:00
wernsaar
9adf87495e
renamed some dot kernels
2013-11-21 23:07:51 +01:00
wernsaar
cd93cae5a7
renamed rot_vfpv3.S to rot_vfp.S
2013-11-21 22:49:28 +01:00
wernsaar
8565afb3c2
renamed asum_vfpv3.S to asum_vfp.S
2013-11-21 22:26:27 +01:00
wernsaar
5bf7cf8d67
renamed scal_vfpv3.S to scal_vfp.S
2013-11-21 22:03:36 +01:00
wernsaar
29a005c635
renamed iamax assembler kernel
2013-11-21 21:12:33 +01:00
wernsaar
f1be3a168a
renamed some BLAS kernels, which are compatible to ARMV6
2013-11-21 20:48:57 +01:00
wernsaar
bf04544902
added gemv_n kernel for single and double precision
2013-11-19 15:07:20 +01:00
wernsaar
86283c0be1
added gemv_t kernel for single and double precision
2013-11-19 09:55:54 +01:00
wernsaar
f27cabfd08
added nrm2 kernel for all precisions
2013-11-16 16:17:17 +01:00
wernsaar
23dd474cd0
added rot kernel for all precisions
2013-11-15 14:08:57 +01:00
wernsaar
f1b452e160
added scal kernel for all precisions
2013-11-15 11:56:43 +01:00
wernsaar
3dabd7e6e6
added swap-kernel for all precisions
2013-11-14 19:06:19 +01:00
wernsaar
6f4a0ebe38
added max- und min-kernels for all precisions
2013-11-14 13:52:47 +01:00
wernsaar
00f33c0134
added asum_kernel for all precisions and complex
2013-11-11 14:20:59 +01:00
wernsaar
5b36cc0f47
added blas level1 dot kernels for complex and double complex
2013-11-08 09:08:11 +01:00
wernsaar
c8f1aeb154
added optimized blas level1 dot kernels for single and double precision
2013-11-07 17:22:03 +01:00
wernsaar
80a2e901b1
added dgemm_tcopy_4_vfpv3.S and sgemm_tcopy_4_vfpv3.S
2013-11-06 20:01:18 +01:00
wernsaar
ac50bccbd2
added cgemm_ncopy_2_vfpv3.S and made assembler labels unique
2013-11-05 20:21:35 +01:00
wernsaar
82015beaef
added zgemm_ncopy_2_vfpv3.S and made assembler labels unique
2013-11-05 19:31:22 +01:00
wernsaar
02bc36ac79
added sgemm_ncopy routine and made some improvements on cgemm_kernel for ARMV7
2013-11-01 18:22:27 +01:00
wernsaar
85484a42df
added kernels for cgemm, ctrmm, zgemm and ztrmm
2013-10-16 18:00:41 +02:00
wernsaar
e0b968c3a7
Changed kernels for dgemm and dtrmm
2013-10-05 12:59:44 +02:00
wernsaar
70411af888
initial checkin of kernel/arm
2013-09-28 19:02:25 +02:00