wernsaar
|
1a352b24e6
|
updated KERNEL.HASWELL
|
2014-09-13 12:23:27 +02:00 |
wernsaar
|
5194818d4b
|
updated zgemv_t_4.c
|
2014-09-13 09:48:34 +02:00 |
wernsaar
|
8a39cdb1c1
|
added optimized zgemv_t kernel for haswell
|
2014-09-13 09:47:07 +02:00 |
wernsaar
|
fd2478c9e2
|
optimized interface/zgemv.c for multithreading
|
2014-09-12 19:18:23 +02:00 |
wernsaar
|
0a1390f2d8
|
enabled optimized zgemv_t kernel for bulldozer
|
2014-09-12 17:43:47 +02:00 |
wernsaar
|
a8b0812feb
|
optimized zgemv_t for bulldozer
|
2014-09-12 17:42:25 +02:00 |
wernsaar
|
a0fb68ab42
|
added optimized zgemv_t kernel for bulldozer
|
2014-09-12 17:04:22 +02:00 |
wernsaar
|
44c11165d5
|
bugfix in cgemv_t_4.c
|
2014-09-12 14:12:24 +02:00 |
wernsaar
|
564be4eb72
|
added optimized cgemv_t kernel
|
2014-09-12 13:38:01 +02:00 |
wernsaar
|
107c3ea7d5
|
added optimized zgemv_t routine
|
2014-09-12 12:35:20 +02:00 |
wernsaar
|
bb8d698335
|
optimized zgemv_n_microk_haswell-4.c for small size
|
2014-09-11 13:44:55 +02:00 |
wernsaar
|
e0192a6914
|
bugfix in zgemv_n_4.c
|
2014-09-11 13:18:00 +02:00 |
wernsaar
|
bced4594bb
|
added optimized zgemv_n kernel
|
2014-09-11 12:34:57 +02:00 |
wernsaar
|
cafba99b6b
|
bufix in cgemv_n_microk_haswell-4.c
|
2014-09-11 11:12:44 +02:00 |
wernsaar
|
ac8f232b2a
|
more optimizations
|
2014-09-11 10:25:48 +02:00 |
wernsaar
|
f98e1244c4
|
optimized cgemv_n_4.c
|
2014-09-10 19:26:14 +02:00 |
wernsaar
|
be95700b30
|
added optimized cgemv_kernel for haswell
|
2014-09-10 14:11:24 +02:00 |
wernsaar
|
4aa534ae93
|
added cgemv_n kernel, optimized for small sizes
|
2014-09-10 13:45:13 +02:00 |
Zhang Xianyi
|
1cba8e7b11
|
Merge pull request #446 from grisuthedragon/cblas_matcopy
Add a CBLAS interface for the BLAS extension s/d/c/z*matcopy routines.
|
2014-09-10 16:31:31 +08:00 |
Zhang Xianyi
|
d13e92f07e
|
Merge pull request #445 from wernsaar/develop
A lot of optimizations for gemv kernels
|
2014-09-10 16:28:14 +08:00 |
wernsaar
|
baa46e4fba
|
added and tested optimized dgemv_n kernel for haswell
|
2014-09-09 16:17:45 +02:00 |
wernsaar
|
faab7a181d
|
added optimized dgemv_n kernel for haswell
|
2014-09-09 15:32:32 +02:00 |
wernsaar
|
8109d8232c
|
optimized dgemv_t kernel for haswell
|
2014-09-09 14:38:08 +02:00 |
wernsaar
|
debc6d1a05
|
bugfix in KERNEL.HASWELL
|
2014-09-09 14:04:44 +02:00 |
wernsaar
|
e73a0113ec
|
added optimized gemv kernels
|
2014-09-09 13:54:55 +02:00 |
wernsaar
|
44f2bf9bae
|
added optimized dgemv_t kernel for haswell
|
2014-09-09 13:34:22 +02:00 |
Martin Koehler
|
a057e5434d
|
add CBLAS interface for s/d/c/zimatcopy
|
2014-09-09 09:52:13 +02:00 |
wernsaar
|
cd34e9701b
|
removed obsolete files
|
2014-09-08 19:15:31 +02:00 |
Martin Köhler
|
7794766d3c
|
Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them.
|
2014-09-08 17:57:44 +02:00 |
wernsaar
|
658939faaa
|
optimized dgemv_n kernel for small sizes
|
2014-09-08 15:22:35 +02:00 |
wernsaar
|
f511807fc0
|
modified multithreading threshold
|
2014-09-08 12:27:32 +02:00 |
wernsaar
|
c4d9d4e5f8
|
added haswell optimized kernel
|
2014-09-08 12:25:16 +02:00 |
wernsaar
|
7c0a94ff47
|
bugfix in sgemv_n_microk_haswell-4.c
|
2014-09-08 10:54:33 +02:00 |
wernsaar
|
cbbc80aad3
|
added optimized sgemv_t kernel for haswell
|
2014-09-08 10:13:39 +02:00 |
wernsaar
|
2be5c7a640
|
bugfix for windows
|
2014-09-07 21:48:42 +02:00 |
wernsaar
|
80f7786875
|
enabled optimized sgemv kernels for piledriver
|
2014-09-07 21:13:57 +02:00 |
wernsaar
|
553e275407
|
optimized sgemv_n kernel for sandybridge
|
2014-09-07 20:53:30 +02:00 |
wernsaar
|
7b3932b3f3
|
optimized sgemv_n kernel for nehalem
|
2014-09-07 19:20:08 +02:00 |
wernsaar
|
75207b1148
|
optimized sgemv_n for very small size of m
|
2014-09-07 18:23:48 +02:00 |
wernsaar
|
274828fa50
|
optimizations for very small sizes
|
2014-09-07 13:45:03 +02:00 |
wernsaar
|
5ae1731fe6
|
better optimzations for sgemv_t kernel
|
2014-09-06 21:28:57 +02:00 |
wernsaar
|
c8eaf3ae2d
|
optimized sgemv_t_4 kernel for very small sizes
|
2014-09-06 19:41:57 +02:00 |
wernsaar
|
3a7ab47ee9
|
optimized sgemv_t
|
2014-09-06 18:34:25 +02:00 |
wernsaar
|
cf5544b417
|
optimization for small size
|
2014-09-06 13:17:56 +02:00 |
wernsaar
|
d143f84dd2
|
added optimized sgemv_n kernel for haswell
|
2014-09-06 12:08:48 +02:00 |
wernsaar
|
7794237475
|
undef WHEREAMI
|
2014-09-06 11:01:42 +02:00 |
wernsaar
|
a64fe9bcc9
|
added optimized sgemv_n kernel for sandybridge
|
2014-09-06 08:41:53 +02:00 |
wernsaar
|
2021d0f9d6
|
experimentally removed expensive function calls
|
2014-09-05 15:05:53 +02:00 |
wernsaar
|
6df7a88930
|
optimized sgemv_t for sandybridge
|
2014-09-05 10:22:50 +02:00 |
wernsaar
|
53de943690
|
bugfix for sgemv_n_4.c
|
2014-09-04 18:55:52 +02:00 |