Commit Graph

3078 Commits

Author SHA1 Message Date
Werner Saar 6abec09eb4 Merge pull request #850 from wernsaar/develop
Bugfixes and enhancements for EXCAVATOR
2016-04-25 12:00:43 +02:00
Werner Saar 40ac64ae4f updated param.h for EXCAVATOR 2016-04-25 10:40:04 +02:00
Werner Saar 298b13bba4 updated some kernel files for EXCAVATOR 2016-04-25 10:36:23 +02:00
Werner Saar 78b05f6476 bugfix for EXCAVATOR and DYNAMIC_ARCH 2016-04-25 10:13:30 +02:00
Werner Saar 2b967590a0 bugfix in dynamic.c 2016-04-25 09:08:38 +02:00
Werner Saar 91b4233e06 Merge pull request #849 from wernsaar/develop
optimized gemm for POWER8
2016-04-23 16:25:27 +02:00
Werner Saar 089aad57f7 updated param.h for POWER8 2016-04-23 14:26:24 +02:00
Werner Saar a3da10662f added sgemm_tcopy_8_power8.S 2016-04-23 10:04:41 +02:00
Werner Saar d46f07bb4e added cgemm_tcopy_8_power8.S 2016-04-23 07:37:18 +02:00
Werner Saar a670e8061e Merge pull request #848 from wernsaar/develop
Optimized zgemm for POWER8 and tested zgemm again
2016-04-22 13:46:22 +02:00
Werner Saar 879a51165f Optimized zgemm and tested zgemm again 2016-04-22 13:07:12 +02:00
Shivraj Patil 2c3dfe2bf3 MIPS P5600(32 bit) and I6400(64 bit) cores support added.
Seperated mips and mips64 files.
Configurations support for mips 32 bit.

Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-04-22 14:03:18 +05:30
Werner Saar ef30e52c8f Merge pull request #846 from wernsaar/develop
Optimized sgemm and dgemm for POWER8
2016-04-21 13:52:24 +02:00
Werner Saar dd2b897795 added bugfixes for some make files and smallscaling.c 2016-04-21 12:54:32 +02:00
Werner Saar 9276c9012f Optimized sgemm and dgemm and tested again. 2016-04-21 11:37:57 +02:00
Werner Saar 391584af85 optimized Makefile.power for POWER8 2016-04-20 15:28:28 +02:00
wernsaar 6fbca2a4a1 Merge pull request #845 from wernsaar/develop
optimized sgemm for power8
2016-04-20 13:44:22 +02:00
Werner Saar 0001260f4b optimized sgemm 2016-04-20 13:06:38 +02:00
Werner Saar 3c6294ca3d added optimized sgemm_tcopy for power8 2016-04-19 16:08:54 +02:00
Zhang Xianyi dd43661cfd Init IBM z system (s390x) porting. 2016-04-15 18:02:24 -04:00
Zhang Xianyi 9253dadaa7 Bump to 0.2.19.dev. 2016-04-12 15:32:10 -04:00
Zhang Xianyi 12ab1804b6 Merge branch 'develop' 2016-04-12 15:29:19 -04:00
Zhang Xianyi 1e03a62b67 Update doc for 0.2.18 version. 2016-04-12 15:28:31 -04:00
Zhang Xianyi faa73690e4 Delete LOCAL_BUFFER_SIZE for other architectures. 2016-04-12 11:49:28 -04:00
Zhang Xianyi f24d5307cf Refs #834. Fix zgemv config bug on Steamroller. 2016-04-12 22:26:11 +08:00
Werner Saar 8037d78eed bugfix for arm scal.c and zscal.c 2016-04-11 11:21:36 +02:00
Werner Saar 1ca750471a added cholesky benchmarks to Makefile for ESSL 2016-04-10 11:28:20 +02:00
wernsaar 0a4276bc2f Merge pull request #837 from wernsaar/develop
updated zgemm- and ztrmm-kernel for POWER8
2016-04-08 11:13:27 +02:00
Werner Saar 08bddde3f3 updated benchmark Makefile for ESSL 2016-04-08 10:37:59 +02:00
Werner Saar e173c51c04 updated zgemm- and ztrmm-kernel for POWER8 2016-04-08 09:05:37 +02:00
Werner Saar 9c42f0374a Updated cgemm- and sgemm-kernel for POWER8 SMP 2016-04-07 15:08:15 +02:00
Zhang Xianyi d4380c1fe4 Refs xianyi/OpenBLAS-CI#10 , Fix sdot for scipy test_iterative.test_convergence test failure on AMD bulldozer and piledriver. 2016-04-07 01:44:18 +08:00
Werner Saar a51102e9b7 bugfixes for sgemm- and cgemm-kernel 2016-04-06 11:15:21 +02:00
wernsaar 7282419525 Merge pull request #833 from wernsaar/develop
updated optimized cgemm- and ctrmm-kernel for POWER8
2016-04-04 12:29:51 +02:00
Werner Saar c5b1fbcb2e updated optimized cgemm- and ctrmm-kernel for POWER8 2016-04-04 09:12:08 +02:00
wernsaar e1cdd15b30 Merge pull request #832 from wernsaar/develop
updated cgemm- and ctrmm-kernel for POWER8
2016-04-03 15:05:25 +02:00
Werner Saar d4c0330967 updated cgemm- and ctrmm-kernel for POWER8 2016-04-03 14:30:49 +02:00
Werner Saar 12540cedb5 added ESSL to Makefile for benchmarks 2016-04-03 07:21:48 +02:00
wernsaar 99adc8b062 Merge pull request #831 from wernsaar/develop
updated sgemm- and strmm-kernel for POWER8
2016-04-02 18:05:44 +02:00
Werner Saar 6a9bbfc227 updated sgemm- and strmm-kernel for POWER8 2016-04-02 17:16:36 +02:00
Zhang Xianyi 3349e9debd Merge pull request #830 from eschnett/patch-1
Correct small typo in comment
2016-04-01 17:35:22 -04:00
Erik Schnetter dd7612358d Correct small typo in comment 2016-04-01 13:49:33 -04:00
Zhang Xianyi e5a6ef3808 Merge pull request #829 from jeromerobert/bug828
Allow to force to do not use -j as make argument
2016-03-31 21:59:40 -04:00
Jerome Robert 7aac0aff8e Allow to force to do not use -j as make argument
Close #828 (hopefully)
2016-03-31 23:03:52 +02:00
wernsaar 26d7f06206 Merge pull request #827 from wernsaar/develop
added optimized dgemv_n kernel for POWER8
2016-03-30 12:04:49 +02:00
Werner Saar 68a69c5b50 added optimized dgemv_n kernel for POWER8 2016-03-30 11:10:53 +02:00
wernsaar a571359afd Merge pull request #826 from wernsaar/develop
added optimized asum kernels for POWER8
2016-03-28 15:09:52 +02:00
Werner Saar c2464a7c4a added optimized casum kernel for POWER8 2016-03-28 14:12:08 +02:00
Werner Saar 294f933869 added optimized zasum kernel for POWER8 2016-03-28 13:37:32 +02:00
Werner Saar f59c9bd6ef added optimized sasum kernel for POWER8 2016-03-28 12:44:25 +02:00