wernsaar
|
c4d9d4e5f8
|
added haswell optimized kernel
|
2014-09-08 12:25:16 +02:00 |
wernsaar
|
7c0a94ff47
|
bugfix in sgemv_n_microk_haswell-4.c
|
2014-09-08 10:54:33 +02:00 |
wernsaar
|
cbbc80aad3
|
added optimized sgemv_t kernel for haswell
|
2014-09-08 10:13:39 +02:00 |
wernsaar
|
2be5c7a640
|
bugfix for windows
|
2014-09-07 21:48:42 +02:00 |
wernsaar
|
80f7786875
|
enabled optimized sgemv kernels for piledriver
|
2014-09-07 21:13:57 +02:00 |
wernsaar
|
553e275407
|
optimized sgemv_n kernel for sandybridge
|
2014-09-07 20:53:30 +02:00 |
wernsaar
|
7b3932b3f3
|
optimized sgemv_n kernel for nehalem
|
2014-09-07 19:20:08 +02:00 |
wernsaar
|
75207b1148
|
optimized sgemv_n for very small size of m
|
2014-09-07 18:23:48 +02:00 |
wernsaar
|
274828fa50
|
optimizations for very small sizes
|
2014-09-07 13:45:03 +02:00 |
wernsaar
|
5ae1731fe6
|
better optimzations for sgemv_t kernel
|
2014-09-06 21:28:57 +02:00 |
wernsaar
|
c8eaf3ae2d
|
optimized sgemv_t_4 kernel for very small sizes
|
2014-09-06 19:41:57 +02:00 |
wernsaar
|
3a7ab47ee9
|
optimized sgemv_t
|
2014-09-06 18:34:25 +02:00 |
wernsaar
|
cf5544b417
|
optimization for small size
|
2014-09-06 13:17:56 +02:00 |
wernsaar
|
d143f84dd2
|
added optimized sgemv_n kernel for haswell
|
2014-09-06 12:08:48 +02:00 |
wernsaar
|
7794237475
|
undef WHEREAMI
|
2014-09-06 11:01:42 +02:00 |
wernsaar
|
a64fe9bcc9
|
added optimized sgemv_n kernel for sandybridge
|
2014-09-06 08:41:53 +02:00 |
wernsaar
|
2021d0f9d6
|
experimentally removed expensive function calls
|
2014-09-05 15:05:53 +02:00 |
wernsaar
|
6df7a88930
|
optimized sgemv_t for sandybridge
|
2014-09-05 10:22:50 +02:00 |
wernsaar
|
53de943690
|
bugfix for sgemv_n_4.c
|
2014-09-04 18:55:52 +02:00 |
wernsaar
|
7f910010a0
|
optimized sgemv_n kernel for small sizes
|
2014-09-04 13:09:27 +02:00 |
wernsaar
|
3a5d8dbff9
|
optimized sgemv_n_4.c
|
2014-09-03 15:34:30 +02:00 |
wernsaar
|
2a60c6d4b0
|
optimized sgemv_n for small sizes
|
2014-09-03 14:48:45 +02:00 |
wernsaar
|
0fc560ba23
|
bugfix for buffer overflow
|
2014-09-03 10:13:47 +02:00 |
wernsaar
|
d1800397f5
|
optimized interface/gemv.c for multithreading
|
2014-09-02 17:36:07 +02:00 |
wernsaar
|
f4ff889491
|
updated interface/gemv.c for multithreading
|
2014-09-02 16:30:04 +02:00 |
wernsaar
|
210bec9111
|
added plot-header to compare multithreading
|
2014-09-02 14:11:42 +02:00 |
wernsaar
|
f3b50dcf5b
|
removed obsolete instructions from sgemv_t_4.c
|
2014-09-02 13:35:41 +02:00 |
wernsaar
|
93eaba959d
|
optimized sgemv_t for bulldozer
|
2014-09-02 12:42:36 +02:00 |
wernsaar
|
9570e56965
|
optimized sgemv_t_4.c for small sizes
|
2014-09-01 15:11:37 +02:00 |
wernsaar
|
d7f91f8b4f
|
extended gemv.c benchmark
|
2014-09-01 15:07:36 +02:00 |
wernsaar
|
53f1277b6b
|
modified benchmark/gemv.c
|
2014-08-31 15:38:18 +02:00 |
wernsaar
|
bc99faef1b
|
optimized sgemv_t_4.c for uneven sizes
|
2014-08-31 14:33:15 +02:00 |
wernsaar
|
848c0f16f7
|
optimized sgemv_t_4.c for small size
|
2014-08-31 13:23:44 +02:00 |
wernsaar
|
e2fc8c8c2c
|
changed 1 test value (bug in lapack-testing?)
|
2014-08-30 13:58:02 +02:00 |
wernsaar
|
53e6dbf6ca
|
optimized sgemv_t kernel for small sizes
|
2014-08-30 13:36:27 +02:00 |
Zhang Xianyi
|
2702323f7d
|
Merge pull request #440 from wernsaar/develop
optimizations for leve1 and level2 blas functions
|
2014-08-28 12:43:54 +08:00 |
wernsaar
|
20cd850125
|
modification for clang compiler
|
2014-08-27 09:00:20 +02:00 |
wernsaar
|
5fa6158731
|
renoved flag no-integrated-as, because not working on macosx
|
2014-08-26 18:29:40 +02:00 |
wernsaar
|
84badf8086
|
EXPERIMENTAL: added the flag -no-integrated-as for clang compiler in Makefile.system
|
2014-08-26 17:36:32 +02:00 |
Zhang Xianyi
|
c8cc4a0d22
|
Fixed the typo in Changelog.txt
|
2014-08-26 16:14:34 +08:00 |
wernsaar
|
3885eebdb8
|
added optimized zaxpy bulldozer kernel
|
2014-08-25 15:52:35 +02:00 |
wernsaar
|
ee74445155
|
added optimized caxpy kernel for bulldozer
|
2014-08-25 14:53:28 +02:00 |
wernsaar
|
9d2ace8bac
|
added optimized daxpy kernel for bulldozer
|
2014-08-24 10:57:12 +02:00 |
wernsaar
|
b55f997302
|
added optimized daxpy kernel for nehalem
|
2014-08-23 17:53:07 +02:00 |
wernsaar
|
29125864b3
|
updated gemm.c
|
2014-08-23 17:28:01 +02:00 |
wernsaar
|
e45c960c2c
|
added optimized saxpy kernel for nehalem
|
2014-08-23 17:15:21 +02:00 |
wernsaar
|
55e81da379
|
added axpy benchmark-test
|
2014-08-23 13:12:44 +02:00 |
wernsaar
|
ac76b6267f
|
added optimized dgemv_n kernel for nehalem
|
2014-08-23 10:40:57 +02:00 |
wernsaar
|
f1b96c4846
|
added optimized ddot kernel for bulldozer
|
2014-08-22 21:19:29 +02:00 |
wernsaar
|
16d6be852d
|
added optimized ddot kernel for nehalem
|
2014-08-22 20:34:41 +02:00 |