Commit Graph

3078 Commits

Author SHA1 Message Date
Kaustubh Raste c8a7860eb3 STRSM optimized
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
2016-05-30 21:17:00 +05:30
Zhang Xianyi 2daad2bcb5 Merge pull request #893 from biddisco/develop
Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PRO…
2016-05-30 14:52:58 +08:00
Zhang Xianyi bac478d17e Merge pull request #891 from rndfax/develop
mips64/axpy: fix error when INCY == 0
2016-05-30 14:52:40 +08:00
John Biddiscombe 053044ae4d Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
2016-05-25 09:13:28 +02:00
Aleksey Kuleshov fca66262c4 mips64/axpy: fix error when INCY == 0 2016-05-23 13:30:27 +03:00
Werner Saar 412bcd187a optimized dtrsm_logic_LT_16x4_power8.S and dtrsm_macros_LT_16x4_power8.S 2016-05-23 11:20:41 +02:00
Werner Saar bd06b246cc Merge pull request #890 from wernsaar/develop
optimized dtrsm_kernel_LT for POWER8
2016-05-22 16:01:35 +02:00
Werner Saar 8b140220c8 optimized dtrsm_kernel_LT for POWER8 2016-05-22 15:20:04 +02:00
Werner Saar 318cad9c37 added trsm bencharks for POWER8 to benchmark/Makefile 2016-05-22 13:51:47 +02:00
Werner Saar 8fb5a1aaff added optimized dtrsm_LT kernel for POWER8 2016-05-22 13:09:05 +02:00
Zhang Xianyi 7d0358475d Merge the patch for musl libc. 2016-05-22 01:08:44 +08:00
Zhang Xianyi b46f680f01 Merge pull request #887 from ksraste/develop
STRSM optimization for MIPS P5600 and I6400 using MSA
2016-05-21 07:17:21 +08:00
Kaustubh Raste ad9f317870 STRSM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
2016-05-20 10:59:03 +05:30
Zhang Xianyi a8fcd89d6d Merge pull request #886 from vriera/develop
Makefile.system: P5600 and I6400 cores need -mmsa
2016-05-19 19:59:09 +08:00
Zhang Xianyi 232335fd49 Merge pull request #885 from sva-img/develop
SGEMM optimization for MIPS P5600 and I6400 using MSA.
2016-05-19 19:58:32 +08:00
Vicente Olivert Riera e12cff87b8 Makefile.system: P5600 and I6400 cores need -mmsa
Signed-off-by: Vicente Olivert Riera <Vincent.Riera@imgtec.com>
2016-05-19 10:56:53 +01:00
Shivraj Patil c4ba40e308 SGEMM optimization for MIPS P5600 and I6400 using MSA. Unrolled k loop in DGEMM kernel function
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-19 11:04:42 +05:30
Zhang Xianyi 7a19065369 Merge pull request #878 from ksraste/develop
DTRSM bug fix for MIPS P5600 and I6400
2016-05-19 11:16:43 +08:00
Werner Saar 8a149e6294 Merge pull request #879 from wernsaar/develop
optimized dgemm and dgetrf for POWER8
2016-05-17 17:10:36 +02:00
Werner Saar 956be69e1d optimized getrf_single.c for POWER8 2016-05-17 16:19:53 +02:00
Werner Saar 6a2bde7a2d optimized dgemm and dgetrf for POWER8 2016-05-17 14:45:27 +02:00
Kaustubh Raste d7cbc7ac13 DTRSM bug fix for MIPS P5600 and I6400
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
2016-05-17 15:48:02 +05:30
Zhang Xianyi 8bf71e9e06 Merge pull request #877 from jeromerobert/bug873
Disable multi-threading in swap
2016-05-16 23:21:56 +08:00
Jerome Robert 40af513669 Disable multi-threading in swap
* Close #873
2016-05-16 13:07:55 +00:00
Werner Saar 88011f625d Merge pull request #876 from wernsaar/develop
optimized dgemm on power8 for 20 threads
2016-05-16 14:52:40 +02:00
Werner Saar 8310d4d3f7 optimized dgemm for 20 threads 2016-05-16 14:14:25 +02:00
Zhang Xianyi 5faffc123f Merge pull request #869 from ksraste/develop
DTRSM optimization for MIPS P5600 and I6400 using MSA
2016-05-09 10:54:55 -04:00
Zhang Xianyi 81794ccb9a Merge pull request #868 from sva-img/develop
build fix for MIPS 32 bit
2016-05-09 10:54:30 -04:00
Kaustubh Raste edb5980c13 DTRSM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Kaustubh Raste <kaustubh.raste@imgtec.com>
2016-05-09 15:15:26 +05:30
Shivraj Patil 573d9218f2 build fix for MIPS 32 bit
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-09 14:45:12 +05:30
Zhang Xianyi 7e549d5f37 Merge pull request #866 from sva-img/develop
DGEMM optimization for MIPS P5600 and I6400 using MSA
2016-05-06 10:53:22 -04:00
Shivraj Patil 085cf236c2 conflict resolved by syncing with 'xianyi:develop'
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-04 11:07:14 +05:30
Zhang Xianyi 0d1c695508 Merge pull request #867 from IvanUkhov/space
Wrap CURDIR and DESTDIR in quotes
2016-05-03 17:06:31 -04:00
Ivan Ukhov efaf30d536 Wrap CURDIR and DESTDIR in quotes 2016-05-03 21:31:32 +02:00
Shivraj Patil b7b3d8ec8e DGEMM optimization for MIPS P5600 and I6400 using MSA
Signed-off-by: Shivraj Patil <shivraj.patil@imgtec.com>
2016-05-03 14:42:26 +05:30
Zhang Xianyi 2df60f7315 Merge pull request #863 from ashwinyes/develop_20160429_update_numa_binding
Update NUMA CPU binding
2016-04-29 11:46:24 -04:00
Zhang Xianyi cd7af5260a Merge pull request #847 from sva-img/develop
MIPS P5600(32 bit) and I6400(64 bit) cores support added.
2016-04-29 11:44:36 -04:00
Werner Saar 3a2e8c3537 Merge pull request #864 from wernsaar/develop
optimized dgemm for POWER8
2016-04-29 13:33:45 +02:00
Werner Saar 56948dbf0f optimized dgemm for POWER8 2016-04-29 12:52:47 +02:00
Ashwin Sekhar T K 0fb380c966 Update NUMA CPU binding
When the number of process can all be
accommodated within the current node,
then use cores from the current node only.
2016-04-29 11:58:15 +05:30
Zhang Xianyi c95f5008fe Merge pull request #858 from buffer51/develop
Fixed cross-suffix detection for path that contains dashes
2016-04-28 11:42:18 -04:00
buffer51 708dec5bb7 Use CROSS_SUFFIX only if CROSS is set 2016-04-27 22:23:02 -07:00
buffer51 20b0ed1da5 Fixed cross-suffix detection for path that contains dashes when the compiler itself doesn't 2016-04-27 12:09:44 -07:00
Werner Saar 6f43310de5 Merge pull request #856 from wernsaar/develop
optimized dgemm for POWER8
2016-04-27 16:34:15 +02:00
Werner Saar 782f75ba94 optimized param.h for POWER8 2016-04-27 15:48:09 +02:00
Werner Saar 0d0c6f7d7d optimized dgemm for POWER8 2016-04-27 14:01:08 +02:00
Zhang Xianyi 0551e571dd Merge pull request #852 from buffer51/develop
Added Android as a community-supported OS
2016-04-26 10:24:33 -04:00
Zhang Xianyi 7e253607c2 Merge pull request #851 from rndfax/develop
allow building tests when CROSS compiling but don't run them
2016-04-26 10:24:13 -04:00
buffer51 b5e98e4dda Added Android as a community-supported OS 2016-04-26 03:14:03 -07:00
Aleksey Kuleshov 3d50ccdc0d allow building tests when CROSS compiling but don't run them 2016-04-26 12:36:47 +03:00