Commit Graph

1952 Commits

Author SHA1 Message Date
Zhang Xianyi
9253dadaa7 Bump to 0.2.19.dev. 2016-04-12 15:32:10 -04:00
Zhang Xianyi
1e03a62b67 Update doc for 0.2.18 version. 2016-04-12 15:28:31 -04:00
Zhang Xianyi
faa73690e4 Delete LOCAL_BUFFER_SIZE for other architectures. 2016-04-12 11:49:28 -04:00
Zhang Xianyi
f24d5307cf Refs #834. Fix zgemv config bug on Steamroller. 2016-04-12 22:26:11 +08:00
wernsaar
0a4276bc2f Merge pull request #837 from wernsaar/develop
updated zgemm- and ztrmm-kernel for POWER8
2016-04-08 11:13:27 +02:00
Werner Saar
08bddde3f3 updated benchmark Makefile for ESSL 2016-04-08 10:37:59 +02:00
Werner Saar
e173c51c04 updated zgemm- and ztrmm-kernel for POWER8 2016-04-08 09:05:37 +02:00
Werner Saar
9c42f0374a Updated cgemm- and sgemm-kernel for POWER8 SMP 2016-04-07 15:08:15 +02:00
Zhang Xianyi
d4380c1fe4 Refs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test failure on AMD bulldozer and piledriver. 2016-04-07 01:44:18 +08:00
Werner Saar
a51102e9b7 bugfixes for sgemm- and cgemm-kernel 2016-04-06 11:15:21 +02:00
wernsaar
7282419525 Merge pull request #833 from wernsaar/develop
updated optimized cgemm- and ctrmm-kernel for POWER8
2016-04-04 12:29:51 +02:00
Werner Saar
c5b1fbcb2e updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
wernsaar
e1cdd15b30 Merge pull request #832 from wernsaar/develop
updated cgemm- and ctrmm-kernel for POWER8
2016-04-03 15:05:25 +02:00
Werner Saar
d4c0330967 updated cgemm- and ctrmm-kernel for POWER8 2016-04-03 14:30:49 +02:00
Werner Saar
12540cedb5 added ESSL to Makefile for benchmarks 2016-04-03 07:21:48 +02:00
wernsaar
99adc8b062 Merge pull request #831 from wernsaar/develop
updated sgemm- and strmm-kernel for POWER8
2016-04-02 18:05:44 +02:00
Werner Saar
6a9bbfc227 updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
Zhang Xianyi
3349e9debd Merge pull request #830 from eschnett/patch-1
Correct small typo in comment
2016-04-01 17:35:22 -04:00
Erik Schnetter
dd7612358d Correct small typo in comment 2016-04-01 13:49:33 -04:00
Zhang Xianyi
e5a6ef3808 Merge pull request #829 from jeromerobert/bug828
Allow to force to do not use -j as make argument
2016-03-31 21:59:40 -04:00
Jerome Robert
7aac0aff8e Allow to force to do not use -j as make argument
Close #828 (hopefully)
2016-03-31 23:03:52 +02:00
wernsaar
26d7f06206 Merge pull request #827 from wernsaar/develop
added optimized dgemv_n kernel for POWER8
2016-03-30 12:04:49 +02:00
Werner Saar
68a69c5b50 added optimized dgemv_n kernel for POWER8 2016-03-30 11:10:53 +02:00
wernsaar
a571359afd Merge pull request #826 from wernsaar/develop
added optimized asum kernels for POWER8
2016-03-28 15:09:52 +02:00
Werner Saar
c2464a7c4a added optimized casum kernel for POWER8 2016-03-28 14:12:08 +02:00
Werner Saar
294f933869 added optimized zasum kernel for POWER8 2016-03-28 13:37:32 +02:00
Werner Saar
f59c9bd6ef added optimized sasum kernel for POWER8 2016-03-28 12:44:25 +02:00
Werner Saar
c53be46d78 added optimized dasum kernel for POWER8 2016-03-28 12:17:15 +02:00
wernsaar
bbb2d73d73 Merge pull request #825 from wernsaar/develop
added optimized cswap and zswap kernel for POWER8
2016-03-27 19:04:06 +02:00
Werner Saar
659ed16591 added otimized cswap and zswap kernels for POWER8 2016-03-27 18:31:37 +02:00
Werner Saar
35c98a3556 added optimized zscal kernel for POWER8 2016-03-27 16:31:50 +02:00
Werner Saar
f1a5dd06c5 added optimized sscal kernel for POWER8 2016-03-27 11:05:56 +02:00
wernsaar
e125a3dc33 Merge pull request #824 from wernsaar/develop
added optimized drot-kernel and srot-kernel for POWER8
2016-03-27 10:43:17 +02:00
Werner Saar
35f1f21a7f added drot- and srot-kernel optimimized for POWER8 2016-03-27 08:57:11 +02:00
Zhang Xianyi
7b4b7179ba Merge pull request #819 from ashwinyes/develop_20160324_fixes_optimizations
Cortex-A57: Fixes and Optimizations
2016-03-27 00:04:20 -04:00
Werner Saar
7a92c1538e added benchmark test for srot and drot 2016-03-26 07:14:13 +01:00
wernsaar
5727268141 Merge pull request #823 from wernsaar/develop
added optimized copy and swap kernels for POWER8
2016-03-25 18:08:48 +01:00
Werner Saar
3d9a50e841 added optimized sswap kernel for POWER8 2016-03-25 17:34:55 +01:00
Werner Saar
828c849b44 added optimized ccopy kernel for POWER8 2016-03-25 16:54:25 +01:00
Werner Saar
ecc0bc9813 added optimized scopy kernel for POWER8 2016-03-25 16:06:56 +01:00
Werner Saar
12f209b7b0 added optimized zswap kernel for POWER8 2016-03-25 15:27:34 +01:00
Werner Saar
7316a87930 added optimized dswap kernel for POWER8 2016-03-25 14:35:43 +01:00
Werner Saar
0bff057a87 added optimized dcopy kernel for POWER8 2016-03-25 13:03:02 +01:00
wernsaar
7ee1d29dd4 Merge pull request #822 from wernsaar/develop
added optimized dscal kernel for POWER8
2016-03-25 10:15:51 +01:00
Werner Saar
1e6cf9808c added optimized dscal kernel for POWER8 2016-03-25 09:42:08 +01:00
Ashwin Sekhar T K
278511ad2d Cortex-A57: Fix clang compilation errors 2016-03-24 10:42:04 +05:30
Ashwin Sekhar T K
3b5ffb49d3 Cortex-A57: Improve DGEMM 8x4 Implementation 2016-03-24 10:25:18 +05:30
wernsaar
8519e4ed9f Merge pull request #817 from wernsaar/develop
added optimized zaxpy kernel for POWER8
2016-03-23 13:37:04 +01:00
Werner Saar
55eda3813b added optimized zaxpy kernel for POWER8 2016-03-23 11:20:23 +01:00
Zhang Xianyi
53bfc83c26 Update appveyor version. 2016-03-22 11:37:35 -04:00