Werner Saar
|
18e90ee2e3
|
bugfix: added static to functions
|
2015-05-13 13:31:26 +02:00 |
Werner Saar
|
e00cccc41e
|
added optimized dscal kernel for piledriver
|
2015-05-13 13:05:35 +02:00 |
Werner Saar
|
73f09bf64f
|
optimized dscal kernel for increment != 1
|
2015-05-13 12:14:39 +02:00 |
Werner Saar
|
02e772c7e4
|
added optimized dscal kernel for haswell
|
2015-05-12 17:19:58 +02:00 |
Werner Saar
|
7aee913991
|
added optimized dscal kernel for sandybridge
|
2015-05-12 16:27:43 +02:00 |
Werner Saar
|
e50a933037
|
added optimized dscal kernel for bulldozer
|
2015-05-12 12:28:44 +02:00 |
Werner Saar
|
133c11a156
|
updated dgemv_n kernel for nehalem
|
2015-04-30 14:38:06 +02:00 |
Werner Saar
|
30f52d53df
|
optimized dgemv_n kernel for haswell
|
2015-04-30 12:11:39 +02:00 |
Werner Saar
|
5e83d80725
|
optimized dger kernel for sandybridge
|
2015-04-28 16:58:11 +02:00 |
Werner Saar
|
b2e1797dc6
|
added optimized sger kernel for sandybridge
|
2015-04-28 15:33:38 +02:00 |
Werner Saar
|
e216f686cb
|
optimized saxpy and daxpy for sandybridge
|
2015-04-28 10:18:32 +02:00 |
Werner Saar
|
fc0e0391f3
|
bugfixes: replaced int with BLASLONG
|
2015-04-24 14:30:44 +02:00 |
Werner Saar
|
c22068c406
|
optimized sdot.c for increments != 1
|
2015-04-24 13:13:20 +02:00 |
Werner Saar
|
dee100d0e4
|
optimized saxpy.c for increments != 1
|
2015-04-24 11:52:59 +02:00 |
Werner Saar
|
0273966abb
|
optimized daxpy kernel for increments != 1
|
2015-04-24 11:39:17 +02:00 |
Werner Saar
|
3a67daa954
|
optimized ddot.c for increments != 1
|
2015-04-24 10:56:55 +02:00 |
Werner Saar
|
b4f2153dcd
|
added optimized ssymv kernels for sandybridge
|
2015-04-23 12:19:24 +02:00 |
Werner Saar
|
1c4b0eeae3
|
added optimized ssymv kernels for haswell
|
2015-04-23 10:23:13 +02:00 |
Werner Saar
|
1bec9abb9a
|
added optimized dsymv kernels for sandybridge
|
2015-04-22 12:09:43 +02:00 |
Werner Saar
|
3814bf60d3
|
added optimized dsymv kernels for haswell
|
2015-04-22 10:42:50 +02:00 |
Werner Saar
|
6d0db0151f
|
added optimized zaxpy-kernels
|
2015-04-16 11:19:37 +02:00 |
Zhang Xianyi
|
37b9033c90
|
Merge pull request #543 from jeromerobert/develop
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
|
2015-04-15 11:18:14 -05:00 |
Werner Saar
|
13889515b3
|
added optimized caxpy-kernel for sandybridge
|
2015-04-15 16:29:25 +02:00 |
Werner Saar
|
248c9340c3
|
added optimized caxpy-kernel for haswell
|
2015-04-15 15:16:31 +02:00 |
Werner Saar
|
e9f33b4ca7
|
added optimized caxpy-kernel for steamroller
|
2015-04-15 13:49:23 +02:00 |
Werner Saar
|
f5d847122a
|
updated caxpy_microk_bulldozer-2.c and caxpy.c
|
2015-04-15 11:59:38 +02:00 |
Jerome Robert
|
a4c96eca67
|
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
Refs #478, #482, 9798481 , fd9fd42
|
2015-04-15 11:46:48 +02:00 |
Werner Saar
|
baa0363ea2
|
add optimized ddot-kernel for piledriver
|
2015-04-14 15:09:13 +02:00 |
Werner Saar
|
34ba66606a
|
add optimized daxpy-kernel for piledriver
|
2015-04-14 14:23:29 +02:00 |
Werner Saar
|
f615dc7603
|
added optimized saxpy kernel for steamroller
|
2015-04-14 09:09:39 +02:00 |
Werner Saar
|
331c417637
|
optimized saxpy for piledriver
|
2015-04-14 08:34:11 +02:00 |
Werner Saar
|
d7a17ad85d
|
optimized sdot-kernel for pilediver
|
2015-04-13 13:19:21 +02:00 |
Werner Saar
|
d35f6c63c2
|
add optimized daxpy-kernel for steamroller
|
2015-04-13 12:22:43 +02:00 |
Werner Saar
|
166d76e864
|
added optimized sdot-kernel for steamroller
|
2015-04-11 08:48:18 +02:00 |
Werner Saar
|
f9f127d838
|
added optimized ddot kernel for steamroller
|
2015-04-10 16:18:03 +02:00 |
wernsaar
|
62231ab337
|
Merge pull request #538 from wernsaar/develop
Added optimized cdot- and zdot-kernels
|
2015-04-10 16:03:37 +02:00 |
Werner Saar
|
3119def9a7
|
updated cdot and zdot
|
2015-04-10 11:10:31 +02:00 |
Werner Saar
|
33b332372a
|
add optimized cdot- and zdot-kernel for sandybridge
|
2015-04-10 09:37:26 +02:00 |
Werner Saar
|
fd838c75bc
|
add optimized cdot- and zdot-kernel for haswell
|
2015-04-09 15:13:52 +02:00 |
Werner Saar
|
b57a60dac8
|
updated cdot and zdot for piledriver
|
2015-04-09 10:33:46 +02:00 |
Werner Saar
|
5c51163972
|
added optimized cdot- and zdot-kernel for steamroller
|
2015-04-09 09:45:23 +02:00 |
Werner Saar
|
9299d8cfd6
|
added optimized cdot- and zdot-kernels for bulldozer
|
2015-04-08 16:29:55 +02:00 |
Zhang Xianyi
|
0a3d3b945d
|
Refs #535. Fix the wrong vector instruction in sgemm sandy bridge kernel.
|
2015-04-08 03:55:49 +08:00 |
Werner Saar
|
60c6dec6e6
|
updated some lines for bulldozer
|
2015-04-06 18:47:16 +02:00 |
Werner Saar
|
47898cca35
|
added optimized saxpy- and daxpy-kernel for sandybridge
|
2015-04-06 16:05:16 +02:00 |
Werner Saar
|
53bb924287
|
added optimized saxpy- and daxpy-kernel for haswell
|
2015-04-06 12:33:16 +02:00 |
Werner Saar
|
a901b065d3
|
added optimized ddot-kernel for sandybridge
|
2015-04-05 20:19:38 +02:00 |
Werner Saar
|
3937e2a0a0
|
add optimized sdot-kernel for sandybridge
|
2015-04-05 19:47:05 +02:00 |
Werner Saar
|
9707d608d5
|
removed double definition line
|
2015-04-05 18:35:34 +02:00 |
Werner Saar
|
701b9d7556
|
added optimized sdot- and ddot-kernel for HASWELL
|
2015-04-05 17:57:53 +02:00 |