Werner Saar
|
c53be46d78
|
added optimized dasum kernel for POWER8
|
2016-03-28 12:17:15 +02:00 |
wernsaar
|
bbb2d73d73
|
Merge pull request #825 from wernsaar/develop
added optimized cswap and zswap kernel for POWER8
|
2016-03-27 19:04:06 +02:00 |
Werner Saar
|
659ed16591
|
added otimized cswap and zswap kernels for POWER8
|
2016-03-27 18:31:37 +02:00 |
Werner Saar
|
35c98a3556
|
added optimized zscal kernel for POWER8
|
2016-03-27 16:31:50 +02:00 |
Werner Saar
|
f1a5dd06c5
|
added optimized sscal kernel for POWER8
|
2016-03-27 11:05:56 +02:00 |
wernsaar
|
e125a3dc33
|
Merge pull request #824 from wernsaar/develop
added optimized drot-kernel and srot-kernel for POWER8
|
2016-03-27 10:43:17 +02:00 |
Werner Saar
|
35f1f21a7f
|
added drot- and srot-kernel optimimized for POWER8
|
2016-03-27 08:57:11 +02:00 |
Zhang Xianyi
|
7b4b7179ba
|
Merge pull request #819 from ashwinyes/develop_20160324_fixes_optimizations
Cortex-A57: Fixes and Optimizations
|
2016-03-27 00:04:20 -04:00 |
Werner Saar
|
7a92c1538e
|
added benchmark test for srot and drot
|
2016-03-26 07:14:13 +01:00 |
wernsaar
|
5727268141
|
Merge pull request #823 from wernsaar/develop
added optimized copy and swap kernels for POWER8
|
2016-03-25 18:08:48 +01:00 |
Werner Saar
|
3d9a50e841
|
added optimized sswap kernel for POWER8
|
2016-03-25 17:34:55 +01:00 |
Werner Saar
|
828c849b44
|
added optimized ccopy kernel for POWER8
|
2016-03-25 16:54:25 +01:00 |
Werner Saar
|
ecc0bc9813
|
added optimized scopy kernel for POWER8
|
2016-03-25 16:06:56 +01:00 |
Werner Saar
|
12f209b7b0
|
added optimized zswap kernel for POWER8
|
2016-03-25 15:27:34 +01:00 |
Werner Saar
|
7316a87930
|
added optimized dswap kernel for POWER8
|
2016-03-25 14:35:43 +01:00 |
Werner Saar
|
0bff057a87
|
added optimized dcopy kernel for POWER8
|
2016-03-25 13:03:02 +01:00 |
wernsaar
|
7ee1d29dd4
|
Merge pull request #822 from wernsaar/develop
added optimized dscal kernel for POWER8
|
2016-03-25 10:15:51 +01:00 |
Werner Saar
|
1e6cf9808c
|
added optimized dscal kernel for POWER8
|
2016-03-25 09:42:08 +01:00 |
Ashwin Sekhar T K
|
278511ad2d
|
Cortex-A57: Fix clang compilation errors
|
2016-03-24 10:42:04 +05:30 |
Ashwin Sekhar T K
|
3b5ffb49d3
|
Cortex-A57: Improve DGEMM 8x4 Implementation
|
2016-03-24 10:25:18 +05:30 |
wernsaar
|
8519e4ed9f
|
Merge pull request #817 from wernsaar/develop
added optimized zaxpy kernel for POWER8
|
2016-03-23 13:37:04 +01:00 |
Werner Saar
|
55eda3813b
|
added optimized zaxpy kernel for POWER8
|
2016-03-23 11:20:23 +01:00 |
Zhang Xianyi
|
53bfc83c26
|
Update appveyor version.
|
2016-03-22 11:37:35 -04:00 |
Zhang Xianyi
|
13ca89f6f0
|
Merge pull request #813 from theoractice/develop
Fix access violation on Windows while static linking in MSVC
|
2016-03-22 11:31:37 -04:00 |
wernsaar
|
461cf9ea38
|
Merge pull request #814 from wernsaar/develop
added optimized daxpy kernel for POWER8
|
2016-03-22 15:24:59 +01:00 |
Werner Saar
|
0664ba4c97
|
added optimized daxpy kernel for POWER8
|
2016-03-22 14:50:03 +01:00 |
Theoractice
|
aa744dfa59
|
Update memory.c
|
2016-03-22 20:02:37 +08:00 |
theoractice
|
61cf8f74d9
|
Fix access violation on Windows while static linking
|
2016-03-22 19:14:54 +08:00 |
Theoractice
|
de202fa375
|
Merge pull request #1 from xianyi/develop
upd
|
2016-03-22 05:33:20 -05:00 |
wernsaar
|
6f93b53590
|
Merge pull request #812 from wernsaar/develop
added optimized sdot kernel for POWER8
|
2016-03-21 13:59:44 +01:00 |
Werner Saar
|
11c44dede1
|
added optimized sdot kernel for POWER8
|
2016-03-21 13:18:23 +01:00 |
wernsaar
|
f00d642592
|
Merge pull request #811 from wernsaar/develop
added optimized zdot kernel for POWER8
|
2016-03-21 10:48:41 +01:00 |
Werner Saar
|
9e4584d069
|
added optimized zdot kernel for POWER8
|
2016-03-21 10:12:07 +01:00 |
Zhang Xianyi
|
2a5679da5f
|
Merge branch 'release-0.2.17' into develop
|
2016-03-20 20:52:43 -04:00 |
Zhang Xianyi
|
a71e8c82f6
|
Fix change log typo.
|
2016-03-20 20:52:15 -04:00 |
Zhang Xianyi
|
9b987badb0
|
Merge branch 'master' into develop
Bump to 0.2.18.dev
Conflicts:
CMakeLists.txt
Makefile.rule
|
2016-03-20 20:48:21 -04:00 |
Zhang Xianyi
|
1619b2f3c8
|
Merge branch 'release-0.2.17'
|
2016-03-20 20:44:01 -04:00 |
Zhang Xianyi
|
4f3153395a
|
Update doc for 0.2.17.
|
2016-03-20 20:43:42 -04:00 |
Zhang Xianyi
|
d7a1a7ff2a
|
Merge branch 'release-0.2.17' into develop
|
2016-03-20 09:24:28 -04:00 |
Zhang Xianyi
|
308e6195b7
|
Refs #807. Enable BUILD_LAPACK_DEPRECATED=1 by default.
|
2016-03-20 09:22:56 -04:00 |
Zhang Xianyi
|
7a3d7b1f52
|
Merge pull request #808 from theoractice/develop
Fix a minor compiler error in VisualStudio with CMake
|
2016-03-20 09:07:47 -04:00 |
wernsaar
|
74cc2d6623
|
Merge pull request #809 from wernsaar/develop
Ref #795: added optimized ddot kernel for POWER8
|
2016-03-20 13:16:41 +01:00 |
theoractice
|
fc3a558515
|
Fix a minor compiler error in VisualStudio with CMake
|
2016-03-20 18:58:18 +08:00 |
Werner Saar
|
cd9fafc054
|
ddot for POWER8: updated licence information
|
2016-03-20 11:19:27 +01:00 |
Werner Saar
|
84b92e6373
|
added optimized ddot kernel for POWER8
|
2016-03-20 11:06:06 +01:00 |
wernsaar
|
c279a53ed8
|
Merge pull request #806 from wernsaar/develop
adding optimized single precision blas level3 kernels for POWER8
|
2016-03-18 12:46:16 +01:00 |
Werner Saar
|
e1df5a6e23
|
fixed sgemm- and strmm-kernel
|
2016-03-18 12:12:03 +01:00 |
Werner Saar
|
5c658f8746
|
add optimized cgemm- and ctrmm-kernel for POWER8
|
2016-03-18 08:17:25 +01:00 |
Zhang Xianyi
|
ec4390a967
|
Bump devlop version to 0.2.17.dev.
|
2016-03-15 14:52:01 -04:00 |
Zhang Xianyi
|
fced5744fb
|
Merge branch 'release-0.2.16'
|
2016-03-15 14:49:10 -04:00 |