Werner Saar
|
56948dbf0f
|
optimized dgemm for POWER8
|
2016-04-29 12:52:47 +02:00 |
Ashwin Sekhar T K
|
0fb380c966
|
Update NUMA CPU binding
When the number of process can all be
accommodated within the current node,
then use cores from the current node only.
|
2016-04-29 11:58:15 +05:30 |
Zhang Xianyi
|
c95f5008fe
|
Merge pull request #858 from buffer51/develop
Fixed cross-suffix detection for path that contains dashes
|
2016-04-28 11:42:18 -04:00 |
buffer51
|
708dec5bb7
|
Use CROSS_SUFFIX only if CROSS is set
|
2016-04-27 22:23:02 -07:00 |
buffer51
|
20b0ed1da5
|
Fixed cross-suffix detection for path that contains dashes when the compiler itself doesn't
|
2016-04-27 12:09:44 -07:00 |
Werner Saar
|
6f43310de5
|
Merge pull request #856 from wernsaar/develop
optimized dgemm for POWER8
|
2016-04-27 16:34:15 +02:00 |
Werner Saar
|
782f75ba94
|
optimized param.h for POWER8
|
2016-04-27 15:48:09 +02:00 |
Werner Saar
|
0d0c6f7d7d
|
optimized dgemm for POWER8
|
2016-04-27 14:01:08 +02:00 |
Zhang Xianyi
|
0551e571dd
|
Merge pull request #852 from buffer51/develop
Added Android as a community-supported OS
|
2016-04-26 10:24:33 -04:00 |
Zhang Xianyi
|
7e253607c2
|
Merge pull request #851 from rndfax/develop
allow building tests when CROSS compiling but don't run them
|
2016-04-26 10:24:13 -04:00 |
buffer51
|
b5e98e4dda
|
Added Android as a community-supported OS
|
2016-04-26 03:14:03 -07:00 |
Aleksey Kuleshov
|
3d50ccdc0d
|
allow building tests when CROSS compiling but don't run them
|
2016-04-26 12:36:47 +03:00 |
Werner Saar
|
6abec09eb4
|
Merge pull request #850 from wernsaar/develop
Bugfixes and enhancements for EXCAVATOR
|
2016-04-25 12:00:43 +02:00 |
Werner Saar
|
40ac64ae4f
|
updated param.h for EXCAVATOR
|
2016-04-25 10:40:04 +02:00 |
Werner Saar
|
298b13bba4
|
updated some kernel files for EXCAVATOR
|
2016-04-25 10:36:23 +02:00 |
Werner Saar
|
78b05f6476
|
bugfix for EXCAVATOR and DYNAMIC_ARCH
|
2016-04-25 10:13:30 +02:00 |
Werner Saar
|
2b967590a0
|
bugfix in dynamic.c
|
2016-04-25 09:08:38 +02:00 |
Werner Saar
|
91b4233e06
|
Merge pull request #849 from wernsaar/develop
optimized gemm for POWER8
|
2016-04-23 16:25:27 +02:00 |
Werner Saar
|
089aad57f7
|
updated param.h for POWER8
|
2016-04-23 14:26:24 +02:00 |
Werner Saar
|
a3da10662f
|
added sgemm_tcopy_8_power8.S
|
2016-04-23 10:04:41 +02:00 |
Werner Saar
|
d46f07bb4e
|
added cgemm_tcopy_8_power8.S
|
2016-04-23 07:37:18 +02:00 |
Werner Saar
|
a670e8061e
|
Merge pull request #848 from wernsaar/develop
Optimized zgemm for POWER8 and tested zgemm again
|
2016-04-22 13:46:22 +02:00 |
Werner Saar
|
879a51165f
|
Optimized zgemm and tested zgemm again
|
2016-04-22 13:07:12 +02:00 |
Shivraj Patil
|
2c3dfe2bf3
|
MIPS P5600(32 bit) and I6400(64 bit) cores support added.
Seperated mips and mips64 files.
Configurations support for mips 32 bit.
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
|
2016-04-22 14:03:18 +05:30 |
Werner Saar
|
ef30e52c8f
|
Merge pull request #846 from wernsaar/develop
Optimized sgemm and dgemm for POWER8
|
2016-04-21 13:52:24 +02:00 |
Werner Saar
|
dd2b897795
|
added bugfixes for some make files and smallscaling.c
|
2016-04-21 12:54:32 +02:00 |
Werner Saar
|
9276c9012f
|
Optimized sgemm and dgemm and tested again.
|
2016-04-21 11:37:57 +02:00 |
Werner Saar
|
391584af85
|
optimized Makefile.power for POWER8
|
2016-04-20 15:28:28 +02:00 |
wernsaar
|
6fbca2a4a1
|
Merge pull request #845 from wernsaar/develop
optimized sgemm for power8
|
2016-04-20 13:44:22 +02:00 |
Werner Saar
|
0001260f4b
|
optimized sgemm
|
2016-04-20 13:06:38 +02:00 |
Werner Saar
|
3c6294ca3d
|
added optimized sgemm_tcopy for power8
|
2016-04-19 16:08:54 +02:00 |
Zhang Xianyi
|
dd43661cfd
|
Init IBM z system (s390x) porting.
|
2016-04-15 18:02:24 -04:00 |
Zhang Xianyi
|
9253dadaa7
|
Bump to 0.2.19.dev.
|
2016-04-12 15:32:10 -04:00 |
Zhang Xianyi
|
12ab1804b6
|
Merge branch 'develop'
|
2016-04-12 15:29:19 -04:00 |
Zhang Xianyi
|
1e03a62b67
|
Update doc for 0.2.18 version.
|
2016-04-12 15:28:31 -04:00 |
Zhang Xianyi
|
faa73690e4
|
Delete LOCAL_BUFFER_SIZE for other architectures.
|
2016-04-12 11:49:28 -04:00 |
Zhang Xianyi
|
f24d5307cf
|
Refs #834. Fix zgemv config bug on Steamroller.
|
2016-04-12 22:26:11 +08:00 |
Werner Saar
|
8037d78eed
|
bugfix for arm scal.c and zscal.c
|
2016-04-11 11:21:36 +02:00 |
Werner Saar
|
1ca750471a
|
added cholesky benchmarks to Makefile for ESSL
|
2016-04-10 11:28:20 +02:00 |
wernsaar
|
0a4276bc2f
|
Merge pull request #837 from wernsaar/develop
updated zgemm- and ztrmm-kernel for POWER8
|
2016-04-08 11:13:27 +02:00 |
Werner Saar
|
08bddde3f3
|
updated benchmark Makefile for ESSL
|
2016-04-08 10:37:59 +02:00 |
Werner Saar
|
e173c51c04
|
updated zgemm- and ztrmm-kernel for POWER8
|
2016-04-08 09:05:37 +02:00 |
Werner Saar
|
9c42f0374a
|
Updated cgemm- and sgemm-kernel for POWER8 SMP
|
2016-04-07 15:08:15 +02:00 |
Zhang Xianyi
|
d4380c1fe4
|
Refs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test failure on AMD bulldozer and piledriver.
|
2016-04-07 01:44:18 +08:00 |
Werner Saar
|
a51102e9b7
|
bugfixes for sgemm- and cgemm-kernel
|
2016-04-06 11:15:21 +02:00 |
wernsaar
|
7282419525
|
Merge pull request #833 from wernsaar/develop
updated optimized cgemm- and ctrmm-kernel for POWER8
|
2016-04-04 12:29:51 +02:00 |
Werner Saar
|
c5b1fbcb2e
|
updated optimized cgemm- and ctrmm-kernel for POWER8
|
2016-04-04 09:12:08 +02:00 |
wernsaar
|
e1cdd15b30
|
Merge pull request #832 from wernsaar/develop
updated cgemm- and ctrmm-kernel for POWER8
|
2016-04-03 15:05:25 +02:00 |
Werner Saar
|
d4c0330967
|
updated cgemm- and ctrmm-kernel for POWER8
|
2016-04-03 14:30:49 +02:00 |
Werner Saar
|
12540cedb5
|
added ESSL to Makefile for benchmarks
|
2016-04-03 07:21:48 +02:00 |