Commit Graph

7452 Commits

Author SHA1 Message Date
Zhang Xianyi 5faffc123f Merge pull request #869 from ksraste/develop
DTRSM optimization for MIPS P5600 and I6400 using MSA
2016-05-09 10:54:55 -04:00
Zhang Xianyi 81794ccb9a Merge pull request #868 from sva-img/develop
build fix for MIPS 32 bit
2016-05-09 10:54:30 -04:00
Kaustubh Raste edb5980c13 DTRSM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
2016-05-09 15:15:26 +05:30
Shivraj Patil 573d9218f2 build fix for MIPS 32 bit
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-09 14:45:12 +05:30
Zhang Xianyi 7e549d5f37 Merge pull request #866 from sva-img/develop
DGEMM optimization for MIPS P5600 and I6400 using MSA
2016-05-06 10:53:22 -04:00
Shivraj Patil 085cf236c2 conflict resolved by syncing with 'xianyi:develop'
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-04 11:07:14 +05:30
Zhang Xianyi 0d1c695508 Merge pull request #867 from IvanUkhov/space
Wrap CURDIR and DESTDIR in quotes
2016-05-03 17:06:31 -04:00
Ivan Ukhov efaf30d536 Wrap CURDIR and DESTDIR in quotes 2016-05-03 21:31:32 +02:00
Shivraj Patil b7b3d8ec8e DGEMM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-03 14:42:26 +05:30
Zhang Xianyi 2df60f7315 Merge pull request #863 from ashwinyes/develop_20160429_update_numa_binding
Update NUMA CPU binding
2016-04-29 11:46:24 -04:00
Zhang Xianyi cd7af5260a Merge pull request #847 from sva-img/develop
MIPS P5600(32 bit) and I6400(64 bit) cores support added.
2016-04-29 11:44:36 -04:00
Werner Saar 3a2e8c3537 Merge pull request #864 from wernsaar/develop
optimized dgemm for POWER8
2016-04-29 13:33:45 +02:00
Werner Saar 56948dbf0f optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
Ashwin Sekhar T K 0fb380c966 Update NUMA CPU binding
When the number of process can all be
accommodated within the current node,
then use cores from the current node only.
2016-04-29 11:58:15 +05:30
Zhang Xianyi c95f5008fe Merge pull request #858 from buffer51/develop
Fixed cross-suffix detection for path that contains dashes
2016-04-28 11:42:18 -04:00
buffer51 708dec5bb7 Use CROSS_SUFFIX only if CROSS is set 2016-04-27 22:23:02 -07:00
buffer51 20b0ed1da5 Fixed cross-suffix detection for path that contains dashes when the compiler itself doesn't 2016-04-27 12:09:44 -07:00
Werner Saar 6f43310de5 Merge pull request #856 from wernsaar/develop
optimized dgemm for POWER8
2016-04-27 16:34:15 +02:00
Werner Saar 782f75ba94 optimized param.h for POWER8 2016-04-27 15:48:09 +02:00
Werner Saar 0d0c6f7d7d optimized dgemm for POWER8 2016-04-27 14:01:08 +02:00
Zhang Xianyi 0551e571dd Merge pull request #852 from buffer51/develop
Added Android as a community-supported OS
2016-04-26 10:24:33 -04:00
Zhang Xianyi 7e253607c2 Merge pull request #851 from rndfax/develop
allow building tests when CROSS compiling but don't run them
2016-04-26 10:24:13 -04:00
buffer51 b5e98e4dda Added Android as a community-supported OS 2016-04-26 03:14:03 -07:00
Aleksey Kuleshov 3d50ccdc0d allow building tests when CROSS compiling but don't run them 2016-04-26 12:36:47 +03:00
Werner Saar 6abec09eb4 Merge pull request #850 from wernsaar/develop
Bugfixes and enhancements for EXCAVATOR
2016-04-25 12:00:43 +02:00
Werner Saar 40ac64ae4f updated param.h for EXCAVATOR 2016-04-25 10:40:04 +02:00
Werner Saar 298b13bba4 updated some kernel files for EXCAVATOR 2016-04-25 10:36:23 +02:00
Werner Saar 78b05f6476 bugfix for EXCAVATOR and DYNAMIC_ARCH 2016-04-25 10:13:30 +02:00
Werner Saar 2b967590a0 bugfix in dynamic.c 2016-04-25 09:08:38 +02:00
Werner Saar 91b4233e06 Merge pull request #849 from wernsaar/develop
optimized gemm for POWER8
2016-04-23 16:25:27 +02:00
Werner Saar 089aad57f7 updated param.h for POWER8 2016-04-23 14:26:24 +02:00
Werner Saar a3da10662f added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
Werner Saar d46f07bb4e added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
Werner Saar a670e8061e Merge pull request #848 from wernsaar/develop
Optimized zgemm for POWER8 and tested zgemm again
2016-04-22 13:46:22 +02:00
Werner Saar 879a51165f Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
Shivraj Patil 2c3dfe2bf3 MIPS P5600(32 bit) and I6400(64 bit) cores support added.
Seperated mips and mips64 files.
Configurations support for mips 32 bit.

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-04-22 14:03:18 +05:30
Werner Saar ef30e52c8f Merge pull request #846 from wernsaar/develop
Optimized sgemm and dgemm for POWER8
2016-04-21 13:52:24 +02:00
Werner Saar dd2b897795 added bugfixes for some make files and smallscaling.c 2016-04-21 12:54:32 +02:00
Werner Saar 9276c9012f Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
Werner Saar 391584af85 optimized Makefile.power for POWER8 2016-04-20 15:28:28 +02:00
wernsaar 6fbca2a4a1 Merge pull request #845 from wernsaar/develop
optimized sgemm for power8
2016-04-20 13:44:22 +02:00
Werner Saar 0001260f4b optimized sgemm 2016-04-20 13:06:38 +02:00
Werner Saar 3c6294ca3d added optimized sgemm_tcopy for power8 2016-04-19 16:08:54 +02:00
Zhang Xianyi dd43661cfd Init IBM z system (s390x) porting. 2016-04-15 18:02:24 -04:00
Zhang Xianyi 9253dadaa7 Bump to 0.2.19.dev. 2016-04-12 15:32:10 -04:00
Zhang Xianyi 12ab1804b6 Merge branch 'develop' 2016-04-12 15:29:19 -04:00
Zhang Xianyi 1e03a62b67 Update doc for 0.2.18 version. 2016-04-12 15:28:31 -04:00
Zhang Xianyi faa73690e4 Delete LOCAL_BUFFER_SIZE for other architectures. 2016-04-12 11:49:28 -04:00
Zhang Xianyi f24d5307cf Refs #834. Fix zgemv config bug on Steamroller. 2016-04-12 22:26:11 +08:00
Werner Saar 8037d78eed bugfix for arm scal.c and zscal.c 2016-04-11 11:21:36 +02:00