Martin Kroeker
|
a55694dd5b
|
Declare dot_compute static to avoid conflicts in multiarch builds
|
2018-03-16 22:23:36 +01:00 |
Martin Kroeker
|
85a41e9cdb
|
Add multithreading support for Haswell DDOT
copied from ashwinyes' implementation in dot_thunderx2t99.c
|
2018-03-16 16:58:47 +01:00 |
Denis Steckelmacher
|
c9ff735da6
|
Add ZEN support (tested for auto-detected static backend)
|
2017-03-19 15:32:50 +01:00 |
Werner Saar
|
298b13bba4
|
updated some kernel files for EXCAVATOR
|
2016-04-25 10:36:23 +02:00 |
Werner Saar
|
fc0e0391f3
|
bugfixes: replaced int with BLASLONG
|
2015-04-24 14:30:44 +02:00 |
Werner Saar
|
3a67daa954
|
optimized ddot.c for increments != 1
|
2015-04-24 10:56:55 +02:00 |
Werner Saar
|
baa0363ea2
|
add optimized ddot-kernel for piledriver
|
2015-04-14 15:09:13 +02:00 |
Werner Saar
|
f9f127d838
|
added optimized ddot kernel for steamroller
|
2015-04-10 16:18:03 +02:00 |
Werner Saar
|
a901b065d3
|
added optimized ddot-kernel for sandybridge
|
2015-04-05 20:19:38 +02:00 |
Werner Saar
|
701b9d7556
|
added optimized sdot- and ddot-kernel for HASWELL
|
2015-04-05 17:57:53 +02:00 |
Werner Saar
|
4319769b79
|
added target processor STEAMROLLER
|
2014-12-28 20:16:46 +08:00 |
wernsaar
|
ee74445155
|
added optimized caxpy kernel for bulldozer
|
2014-08-25 14:53:28 +02:00 |
wernsaar
|
16d6be852d
|
added optimized ddot kernel for nehalem
|
2014-08-22 20:34:41 +02:00 |