Commit Graph

664 Commits

Author SHA1 Message Date
wernsaar
cd93cae5a7 renamed rot_vfpv3.S to rot_vfp.S 2013-11-21 22:49:28 +01:00
wernsaar
8565afb3c2 renamed asum_vfpv3.S to asum_vfp.S 2013-11-21 22:26:27 +01:00
wernsaar
5bf7cf8d67 renamed scal_vfpv3.S to scal_vfp.S 2013-11-21 22:03:36 +01:00
wernsaar
29a005c635 renamed iamax assembler kernel 2013-11-21 21:12:33 +01:00
wernsaar
f1be3a168a renamed some BLAS kernels, which are compatible to ARMV6 2013-11-21 20:48:57 +01:00
wernsaar
410afda9b4 added cpu detection and target ARMV6, used in raspberry pi 2013-11-21 20:18:51 +01:00
wernsaar
bf04544902 added gemv_n kernel for single and double precision 2013-11-19 15:07:20 +01:00
wernsaar
86283c0be1 added gemv_t kernel for single and double precision 2013-11-19 09:55:54 +01:00
wernsaar
f27cabfd08 added nrm2 kernel for all precisions 2013-11-16 16:17:17 +01:00
wernsaar
23dd474cd0 added rot kernel for all precisions 2013-11-15 14:08:57 +01:00
wernsaar
f1b452e160 added scal kernel for all precisions 2013-11-15 11:56:43 +01:00
wernsaar
3dabd7e6e6 added swap-kernel for all precisions 2013-11-14 19:06:19 +01:00
wernsaar
6f4a0ebe38 added max- und min-kernels for all precisions 2013-11-14 13:52:47 +01:00
wernsaar
f750103336 small optimizations on dot-kernels 2013-11-11 15:47:56 +01:00
wernsaar
00f33c0134 added asum_kernel for all precisions and complex 2013-11-11 14:20:59 +01:00
wernsaar
5b36cc0f47 added blas level1 dot kernels for complex and double complex 2013-11-08 09:08:11 +01:00
wernsaar
c8f1aeb154 added optimized blas level1 dot kernels for single and double precision 2013-11-07 17:22:03 +01:00
wernsaar
8fa93be06e added optimized blas level1 copy kernels 2013-11-07 17:18:56 +01:00
wernsaar
1e8128f41c added cgemm_tcopy_2_vfpv3.S and zgemm_tcopy_2_vfpv3.S 2013-11-07 17:15:50 +01:00
wernsaar
80a2e901b1 added dgemm_tcopy_4_vfpv3.S and sgemm_tcopy_4_vfpv3.S 2013-11-06 20:01:18 +01:00
wernsaar
ac50bccbd2 added cgemm_ncopy_2_vfpv3.S and made assembler labels unique 2013-11-05 20:21:35 +01:00
wernsaar
82015beaef added zgemm_ncopy_2_vfpv3.S and made assembler labels unique 2013-11-05 19:31:22 +01:00
wernsaar
370e3834a9 added missing file kernel/arm/Makefile 2013-11-03 11:54:39 +01:00
wernsaar
95aedfa0ff added missing file arm/Makefile in lapack/laswp 2013-11-03 11:19:32 +01:00
wernsaar
cba97daf3c added missing file cblas_noconst.h to the armv7 branch 2013-11-03 11:04:16 +01:00
wernsaar
5400a9f4e4 redefined functions for TIMING and YIELDING for ARMV7 processor 2013-11-03 10:34:04 +01:00
wernsaar
e31186efd4 deleted obsolete dgemm_kernel and dtrmm_kernel 2013-11-02 13:12:21 +01:00
wernsaar
2b801a00a5 small optimizations on sgemm_kernel for ARMV7 2013-11-02 13:06:11 +01:00
wernsaar
b3eab8fcb7 minor optimizations on zgemm_kernel for ARMV7 2013-11-02 09:43:53 +01:00
wernsaar
02bc36ac79 added sgemm_ncopy routine and made some improvements on cgemm_kernel for ARMV7 2013-11-01 18:22:27 +01:00
wernsaar
2d49db2f5b moved compiler flags from Makefile.rule to Makefile.arm 2013-10-16 19:04:42 +02:00
wernsaar
04391e6d9c optimized param.h 2013-10-16 18:04:34 +02:00
wernsaar
85484a42df added kernels for cgemm, ctrmm, zgemm and ztrmm 2013-10-16 18:00:41 +02:00
wernsaar
3983011f0b added sgemm- and strmm_kernel 2013-10-14 08:22:27 +02:00
wernsaar
2a1515c9dd added dgemm_ncopy_4_vfpv3.S 2013-10-12 16:48:29 +02:00
wernsaar
31f51e78bc minor optimizations on dgemm_kernel 2013-10-12 09:42:18 +02:00
wernsaar
e0b968c3a7 Changed kernels for dgemm and dtrmm 2013-10-05 12:59:44 +02:00
wernsaar
93f1074dd4 changed some values for arm 2013-09-30 18:03:56 +02:00
wernsaar
1c63180bb6 updated dgemm_kernel_8x2_vfpv3.S 2013-09-30 17:31:23 +02:00
wernsaar
22a8fcc4b7 add modified c_check perl program 2013-09-29 19:42:33 +02:00
wernsaar
9965d48005 added Makefile.arm 2013-09-29 18:55:21 +02:00
wernsaar
4a474ea7dc changed dgemm_kernel to use fused multiply add 2013-09-29 17:46:23 +02:00
wernsaar
69ce737cc5 modified Makefile.L3 for ARM 2013-09-28 19:13:47 +02:00
wernsaar
d13788d1b4 common files modified for ARM 2013-09-28 19:10:32 +02:00
wernsaar
70411af888 initial checkin of kernel/arm 2013-09-28 19:02:25 +02:00
Zhang Xianyi
a746724e84 Added backers. 2013-09-05 15:39:45 +08:00
Lars Buitinck
3f7b0cd994 Merge pull request #290 from larsmans/missing-threshold
check if GEMM_MULTITHREAD_THRESHOLD defined in gemm.c
Set a fallback value.
2013-08-29 00:33:55 +08:00
Zhang Xianyi
cc6db2ecfe Merge pull request #291 from larsmans/fix-makefile-prefix
fix default prefix handling in makefiles
2013-08-28 09:26:16 -07:00
Zhang Xianyi
3175be4b3d Merge pull request #289 from larsmans/no-noconst
get rid of the generated cblas_noconst.h file
2013-08-28 09:25:23 -07:00
Lars Buitinck
a29e6592da fix default prefix handling in makefiles
PREFIX wasn't communicated to Makefile.install (where it matters)
by Makefile. The result is that the default PREFIX is empty and
OpenBLAS was being installed in /lib.
2013-08-28 17:41:31 +02:00