Werner Saar
|
ceaa931e48
|
updated gemv kernel for armv6
|
2016-01-24 16:31:19 +01:00 |
Werner Saar
|
eaa63165df
|
updated cgemv and zgemv kernels for armv6
|
2016-01-24 14:42:38 +01:00 |
Werner Saar
|
c65357c566
|
updated trmm_kernels for armv6
|
2016-01-24 13:03:33 +01:00 |
Werner Saar
|
e63e9f9f26
|
updated gemm_kernels for armv6
|
2016-01-24 11:55:50 +01:00 |
Werner Saar
|
aafd3ab60e
|
updated cdot and zdot on arm
|
2016-01-24 10:56:49 +01:00 |
Werner Saar
|
d2f84c9c8a
|
Ref #740: updated nrm2_vfp.S
|
2016-01-23 17:47:58 +01:00 |
Werner Saar
|
ca32253f32
|
Ref #740: updated asum_vfp.S and iamax_vfp.S
|
2016-01-23 14:44:34 +01:00 |
Werner Saar
|
9066d1f982
|
Ref #750 and Ref #740 : bugfix for sdot, dsdot and ddot on arm
|
2016-01-23 11:59:51 +01:00 |
Werner Saar
|
692d9c881c
|
Ref #740: simple solution to clear floating point register on arm
|
2016-01-17 15:37:12 +01:00 |
Zhang Xianyi
|
3602a2cd1f
|
#736 Revert #733 patch to fix bus error on ARM.
|
2016-01-12 22:19:58 +00:00 |
Yichao Yu
|
594b9f4c73
|
Do not use vsub to clear the register values since it doesn't work with non-normal numbers.
|
2016-01-05 16:54:05 +00:00 |
Ralph Campbell
|
c053559ed9
|
Minor C code fixes in kernel/arm
|
2015-11-09 14:15:49 +05:30 |
Zhang Xianyi
|
f874465bb8
|
Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.
Disable CBLAS and LAPACK.
|
2015-08-10 14:10:44 -05:00 |
Zhang Xianyi
|
ab0a0a75fc
|
Merge branch 'develop' into cmake
|
2015-08-03 23:59:01 -05:00 |
Zhang Xianyi
|
1cf2b10224
|
Use pure C generic target on x86 and x86_64.
make TARGET=GENERIC
?gemm3m is unimplemented on generic target.
|
2015-08-03 23:55:56 -05:00 |
Zhang Xianyi
|
dcd5ba4443
|
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
|
2015-07-22 04:06:39 +08:00 |
Hank Anderson
|
84d90d6ed8
|
Fixed some compiler errors/warnings for clang.
|
2015-02-25 11:52:25 -06:00 |
Zhang Xianyi
|
ea7f9dacf4
|
Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1.
|
2015-02-26 01:47:11 +08:00 |
Martin Koehler
|
39cc6b21d3
|
Add ATLAS-style ?geadd function
|
2015-02-16 13:46:20 +01:00 |
Zhang Xianyi
|
229ce2ccd1
|
Add cortex-a9 and cortex-a15 targets.
|
2015-01-12 08:55:29 +00:00 |
wernsaar
|
101dd08173
|
add reference in C for symv_U
|
2014-08-16 13:52:50 +02:00 |
wernsaar
|
493d4fe7e5
|
added reference in C for symv_L
|
2014-08-16 11:36:48 +02:00 |
wernsaar
|
c4a423a642
|
bugfixes for lapack on ARM Platform
|
2014-07-09 12:21:39 +02:00 |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
2014-06-27 12:05:18 -07:00 |
wernsaar
|
ff9cfca24c
|
Ref #385: added missing return instruction
|
2014-06-12 15:52:14 +02:00 |
wernsaar
|
cee257f384
|
Ref #51: added blas extensions zomatcopy and comatcopy
|
2014-06-10 10:34:54 +02:00 |
wernsaar
|
8c8f596238
|
Ref #51: added blas extension domatcopy as not opimized reference
|
2014-06-09 17:11:07 +02:00 |
wernsaar
|
faf3ac0aad
|
Ref #285: added axpby kernels
|
2014-06-08 11:54:24 +02:00 |
Zhang Xianyi
|
406f5bd22b
|
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/arm/KERNEL.ARMV6
|
2014-05-21 11:24:39 +08:00 |
wernsaar
|
aaddb05411
|
bugfix for ARMV6
|
2014-05-17 13:00:36 +02:00 |
wernsaar
|
e826a5a6af
|
some modifications regarding lapack test
|
2014-05-16 20:37:41 +02:00 |
wernsaar
|
c38379c9dd
|
bugfixes for ARM regarding lapack tests
|
2014-05-14 13:03:45 +02:00 |
wernsaar
|
a0b07c1440
|
bugfixs for ARM regarding lapack tests
|
2014-05-14 12:59:20 +02:00 |
wernsaar
|
43fbdb7a5a
|
added ARMV5 as reference platform
|
2014-05-13 17:25:19 +02:00 |
wernsaar
|
777cebc8c7
|
added ZERO check to zscal.c because bug in lapack-testing
|
2014-05-13 16:31:00 +02:00 |
wernsaar
|
aa5c73e20f
|
added ZERO check to zscal.c because bug in lapack-test
|
2014-05-13 16:25:21 +02:00 |
wernsaar
|
5e5ef28ca0
|
added ZERO check because bug in lapack-test
|
2014-05-13 15:36:03 +02:00 |
wernsaar
|
650ed34336
|
added ZERO check because bug in lapack-test
|
2014-05-13 15:31:36 +02:00 |
wernsaar
|
f9daebba0a
|
checked in bugfixes for ARM
|
2014-02-16 11:45:47 +01:00 |
wernsaar
|
2d3c884294
|
added complex gemv kernels for ARMV6 and ARMV7
|
2013-11-29 17:06:33 +01:00 |
wernsaar
|
d54a061713
|
optimized gemv_n_vfp.S
|
2013-11-28 17:40:21 +01:00 |
wernsaar
|
86afb47e83
|
added optimized ctrmm kernel for ARMV6
|
2013-11-28 14:35:07 +01:00 |
wernsaar
|
42a4dff056
|
added optimized ztrmm kernel for ARMV6
|
2013-11-28 13:41:06 +01:00 |
wernsaar
|
5bc322a66c
|
optimized strmm kernel for ARMV6
|
2013-11-28 12:45:38 +01:00 |
wernsaar
|
dec7ad0dfd
|
optimized dtrmm kernel for ARMV7
|
2013-11-28 12:32:12 +01:00 |
wernsaar
|
274304bd03
|
add optimized cgemm kernel for ARMV6
|
2013-11-28 11:54:38 +01:00 |
wernsaar
|
5007a534c4
|
optimized zgemm kernel for ARMV6
|
2013-11-28 10:04:43 +01:00 |
wernsaar
|
a537d7d8d7
|
optimized zgemm_kernel_2x2_vfp.S
|
2013-11-28 08:33:44 +01:00 |
wernsaar
|
b42145834f
|
optimized sgemm kernel for ARMV6
|
2013-11-28 08:08:08 +01:00 |
wernsaar
|
3d5e792c72
|
optimized sgemm kernel for ARMV6
|
2013-11-27 18:38:32 +01:00 |
wernsaar
|
a9bd12da2c
|
optimized dgemm kernel for ARMV6
|
2013-11-27 17:37:38 +01:00 |
wernsaar
|
697e198e8a
|
added zgemm_kernel for ARMV6
|
2013-11-27 16:15:06 +01:00 |
wernsaar
|
36b0f7fe1d
|
added optimized gemv_t kernel for ARMV6
|
2013-11-25 19:31:27 +01:00 |
wernsaar
|
d2b20c5c51
|
add optimized axpy kernel
|
2013-11-25 12:25:58 +01:00 |
wernsaar
|
25c6050593
|
add single and double precision gemv_n kernel for ARMV6
|
2013-11-24 12:03:28 +01:00 |
wernsaar
|
12e02a00e0
|
added ncopy kernels for ARMV6
|
2013-11-24 08:46:47 +01:00 |
wernsaar
|
29a3196f56
|
added optimized sgemm and strmm kernel for ARMV6
|
2013-11-23 18:09:41 +01:00 |
wernsaar
|
8776a73773
|
added optimized dgemm and dtrmm kernel for ARMV6
|
2013-11-23 16:24:52 +01:00 |
wernsaar
|
7e84acd3e8
|
fixed bug in SAVE macros, that are not found by any test routine
|
2013-11-23 14:35:19 +01:00 |
wernsaar
|
33d3ab6e09
|
small optimizations for zgemv kernels
|
2013-11-23 12:35:31 +01:00 |
wernsaar
|
9a0f978929
|
added nrm2 kernel for ARMV6
|
2013-11-22 17:21:10 +01:00 |
wernsaar
|
7f210587f0
|
renamed some ncopy and tcopy files
|
2013-11-22 00:20:25 +01:00 |
wernsaar
|
9f0a3a35b3
|
removed obsolete file sdot_vfpv3.S
|
2013-11-21 23:42:54 +01:00 |
wernsaar
|
dbae93110b
|
added sdot_vfp.S
|
2013-11-21 23:34:51 +01:00 |
wernsaar
|
19cd5c64a2
|
renamed swap_vfpv3.S to swap_vfp.S
|
2013-11-21 23:19:32 +01:00 |
wernsaar
|
9adf87495e
|
renamed some dot kernels
|
2013-11-21 23:07:51 +01:00 |
wernsaar
|
440db4cdda
|
delete rot_vfpv3.S
|
2013-11-21 22:52:24 +01:00 |
wernsaar
|
cd93cae5a7
|
renamed rot_vfpv3.S to rot_vfp.S
|
2013-11-21 22:49:28 +01:00 |
wernsaar
|
8565afb3c2
|
renamed asum_vfpv3.S to asum_vfp.S
|
2013-11-21 22:26:27 +01:00 |
wernsaar
|
5bf7cf8d67
|
renamed scal_vfpv3.S to scal_vfp.S
|
2013-11-21 22:03:36 +01:00 |
wernsaar
|
29a005c635
|
renamed iamax assembler kernel
|
2013-11-21 21:12:33 +01:00 |
wernsaar
|
f1be3a168a
|
renamed some BLAS kernels, which are compatible to ARMV6
|
2013-11-21 20:48:57 +01:00 |
wernsaar
|
410afda9b4
|
added cpu detection and target ARMV6, used in raspberry pi
|
2013-11-21 20:18:51 +01:00 |
wernsaar
|
bf04544902
|
added gemv_n kernel for single and double precision
|
2013-11-19 15:07:20 +01:00 |
wernsaar
|
86283c0be1
|
added gemv_t kernel for single and double precision
|
2013-11-19 09:55:54 +01:00 |
wernsaar
|
f27cabfd08
|
added nrm2 kernel for all precisions
|
2013-11-16 16:17:17 +01:00 |
wernsaar
|
23dd474cd0
|
added rot kernel for all precisions
|
2013-11-15 14:08:57 +01:00 |
wernsaar
|
f1b452e160
|
added scal kernel for all precisions
|
2013-11-15 11:56:43 +01:00 |
wernsaar
|
3dabd7e6e6
|
added swap-kernel for all precisions
|
2013-11-14 19:06:19 +01:00 |
wernsaar
|
6f4a0ebe38
|
added max- und min-kernels for all precisions
|
2013-11-14 13:52:47 +01:00 |
wernsaar
|
f750103336
|
small optimizations on dot-kernels
|
2013-11-11 15:47:56 +01:00 |
wernsaar
|
00f33c0134
|
added asum_kernel for all precisions and complex
|
2013-11-11 14:20:59 +01:00 |
wernsaar
|
5b36cc0f47
|
added blas level1 dot kernels for complex and double complex
|
2013-11-08 09:08:11 +01:00 |
wernsaar
|
c8f1aeb154
|
added optimized blas level1 dot kernels for single and double precision
|
2013-11-07 17:22:03 +01:00 |
wernsaar
|
8fa93be06e
|
added optimized blas level1 copy kernels
|
2013-11-07 17:18:56 +01:00 |
wernsaar
|
1e8128f41c
|
added cgemm_tcopy_2_vfpv3.S and zgemm_tcopy_2_vfpv3.S
|
2013-11-07 17:15:50 +01:00 |
wernsaar
|
80a2e901b1
|
added dgemm_tcopy_4_vfpv3.S and sgemm_tcopy_4_vfpv3.S
|
2013-11-06 20:01:18 +01:00 |
wernsaar
|
ac50bccbd2
|
added cgemm_ncopy_2_vfpv3.S and made assembler labels unique
|
2013-11-05 20:21:35 +01:00 |
wernsaar
|
82015beaef
|
added zgemm_ncopy_2_vfpv3.S and made assembler labels unique
|
2013-11-05 19:31:22 +01:00 |
wernsaar
|
370e3834a9
|
added missing file kernel/arm/Makefile
|
2013-11-03 11:54:39 +01:00 |
wernsaar
|
e31186efd4
|
deleted obsolete dgemm_kernel and dtrmm_kernel
|
2013-11-02 13:12:21 +01:00 |
wernsaar
|
2b801a00a5
|
small optimizations on sgemm_kernel for ARMV7
|
2013-11-02 13:06:11 +01:00 |
wernsaar
|
b3eab8fcb7
|
minor optimizations on zgemm_kernel for ARMV7
|
2013-11-02 09:43:53 +01:00 |
wernsaar
|
02bc36ac79
|
added sgemm_ncopy routine and made some improvements on cgemm_kernel for ARMV7
|
2013-11-01 18:22:27 +01:00 |
wernsaar
|
85484a42df
|
added kernels for cgemm, ctrmm, zgemm and ztrmm
|
2013-10-16 18:00:41 +02:00 |
wernsaar
|
3983011f0b
|
added sgemm- and strmm_kernel
|
2013-10-14 08:22:27 +02:00 |
wernsaar
|
2a1515c9dd
|
added dgemm_ncopy_4_vfpv3.S
|
2013-10-12 16:48:29 +02:00 |
wernsaar
|
31f51e78bc
|
minor optimizations on dgemm_kernel
|
2013-10-12 09:42:18 +02:00 |
wernsaar
|
e0b968c3a7
|
Changed kernels for dgemm and dtrmm
|
2013-10-05 12:59:44 +02:00 |
wernsaar
|
1c63180bb6
|
updated dgemm_kernel_8x2_vfpv3.S
|
2013-09-30 17:31:23 +02:00 |