Ashwin Sekhar T K
|
318f0949c3
|
lapack-test fixes in nrm2 kernels for Cortex A57
|
2015-11-23 13:43:36 +05:30 |
Werner Saar
|
299cdcdc29
|
lapack fixes for Windos
|
2015-11-21 14:33:27 +01:00 |
Werner Saar
|
a8516c5b47
|
fixes for cross compile
|
2015-11-21 10:48:37 +01:00 |
Werner Saar
|
c40538eaeb
|
bugfix for cross compiling
|
2015-11-20 13:47:22 +01:00 |
Werner Saar
|
33e37d01b3
|
added lapack-3.6.0
|
2015-11-20 09:45:46 +01:00 |
Werner Saar
|
64db4576e6
|
removed lapack-3.5.0
|
2015-11-20 09:41:59 +01:00 |
Werner Saar
|
0d22551a6b
|
increase the stack size limit in the constructor
|
2015-11-20 09:23:01 +01:00 |
Ashwin Sekhar T K
|
1d121852c1
|
Fix blas_lock for arm64
|
2015-11-20 01:45:35 +05:30 |
Ashwin Sekhar T K
|
98965da2e8
|
lapack-test fixes for Cortex A57
|
2015-11-20 01:15:04 +05:30 |
Ashwin Sekhar T K
|
39937d15cd
|
Change BUFFER_SIZE for Cortex A57 to 20 MB
Change the GEMM_P, GEMM_Q, GEMM_R values for Cortex A57
|
2015-11-20 01:12:04 +05:30 |
Ray Glover
|
a9d7eee0dc
|
(Visual Studio) Don't use C99 complex numbers when building C++ code.
|
2015-11-17 17:29:30 +00:00 |
Zhang Xianyi
|
1367a64d09
|
Merge branch 'develop' of github.com:xianyi/OpenBLAS into arm_soft_fp_abi
|
2015-11-11 19:25:07 +00:00 |
Zhang Xianyi
|
e31948ceb0
|
Fix #686. Merge branch 'ashwinyes-develop' into develop
|
2015-11-11 04:30:26 +08:00 |
Zhang Xianyi
|
233ec2a1cc
|
Use 40 MB buffer for ARM Cortex A57.
|
2015-11-11 04:22:34 +08:00 |
Zhang Xianyi
|
a4c6a88a65
|
Delete vi swap file.
|
2015-11-11 04:19:43 +08:00 |
Zhang Xianyi
|
faf0811483
|
Merge branch 'develop' of https://github.com/ashwinyes/OpenBLAS into ashwinyes-develop
|
2015-11-11 04:16:22 +08:00 |
Zhang Xianyi
|
4e4a3e783f
|
Update develop version.
|
2015-11-11 04:14:58 +08:00 |
Zhang Xianyi
|
d00ada378f
|
Merge pull request #684 from sebastien-villemot/develop
Fix detection of POWER architecture in c_check.
|
2015-11-09 11:39:21 -06:00 |
Sébastien Villemot
|
41407acc19
|
Fix detection of POWER architecture in c_check.
This is necessary to avoid the false detection of a cross-compiling
environment.
|
2015-11-09 18:36:04 +01:00 |
Ashwin Sekhar T K
|
67874468a6
|
Fix bug in benchmark/gemm.c
|
2015-11-09 14:15:54 +05:30 |
Ashwin Sekhar T K
|
c99c43d51e
|
Optimized trmm kernels for CORTEXA57
|
2015-11-09 14:15:54 +05:30 |
Ashwin Sekhar T K
|
1397b47197
|
Optimized zgemm kernel for CORTEXA57
|
2015-11-09 14:15:53 +05:30 |
Ashwin Sekhar T K
|
45f78963ac
|
Optimized cgemm kernel for CORTEXA57
Also, add a generic ztrmm 4x4 kernel
|
2015-11-09 14:15:53 +05:30 |
Ashwin Sekhar T K
|
402443bf9c
|
Optimized dgemm kernel for CORTEXA57
|
2015-11-09 14:15:53 +05:30 |
Ashwin Sekhar T K
|
19fdbee291
|
Improve the sgemm kernel for CORTEXA57
|
2015-11-09 14:15:53 +05:30 |
Ashwin Sekhar T K
|
3b0cdfab1e
|
Optimized gemv kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:52 +05:30 |
Ashwin Sekhar T K
|
46efa6a1da
|
Optimized swap kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:52 +05:30 |
Ashwin Sekhar T K
|
ea1465cdf8
|
Optimized scal kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:52 +05:30 |
Ashwin Sekhar T K
|
fb4be3b3eb
|
Optimized rot kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:52 +05:30 |
Ashwin Sekhar T K
|
6c2f4ddbcd
|
Optimized nrm2 kernels for CORTEXA57
|
2015-11-09 14:15:51 +05:30 |
Ashwin Sekhar T K
|
870c4d49c0
|
Optimized dot kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:51 +05:30 |
Ashwin Sekhar T K
|
cd7684097c
|
Optimized copy kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:51 +05:30 |
Ashwin Sekhar T K
|
2690b71b1f
|
Optimized axpy kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:51 +05:30 |
Ashwin Sekhar T K
|
3e4acedf0e
|
Optimized asum kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:51 +05:30 |
Ashwin Sekhar T K
|
2610752dbb
|
Optimized iamax kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:50 +05:30 |
Ashwin Sekhar T K
|
dbb213655e
|
Optimized amax kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:50 +05:30 |
Ashwin Sekhar T K
|
9742dba595
|
Fix compiler errors in common.h
|
2015-11-09 14:15:50 +05:30 |
Ashwin Sekhar T K
|
f2f8a0fe8b
|
Adding arm64 target CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
|
2015-11-09 14:15:50 +05:30 |
Ralph Campbell
|
55a0b27c01
|
Minor C code fixes in interface/
|
2015-11-09 14:15:49 +05:30 |
Ralph Campbell
|
fbc21266e6
|
Minor C code fixes in driver/
|
2015-11-09 14:15:49 +05:30 |
Ralph Campbell
|
c053559ed9
|
Minor C code fixes in kernel/arm
|
2015-11-09 14:15:49 +05:30 |
Ralph Campbell
|
55e4332f00
|
Remove duplicate -D args in kernel/Makefile.L1
|
2015-11-09 14:15:48 +05:30 |
Zhang Xianyi
|
a550431ee6
|
Refs #682. Enable LAPACK_COMPLEX_STRUCTURE when __ANDROID_API_ < 21.
|
2015-11-06 23:46:20 -06:00 |
Zhang Xianyi
|
3e8d6ea74f
|
Init POWER8 kernels by POWER6.
|
2015-11-03 12:34:23 +08:00 |
Zhang Xianyi
|
839395fc25
|
Detect AMD Trinity and Richland.
|
2015-10-29 02:53:29 +08:00 |
Zhang Xianyi
|
1331642f24
|
Merge pull request #677 from j-bo/develop
Refs #676. Fixed ONLY_CBLAS=1 compiling bug on windows.
|
2015-10-28 09:44:25 -05:00 |
j-bo
|
1e0bbea868
|
Refs #676. Fixed ONLY_CBLAS=1 compiling bug on windows.
|
2015-10-28 15:10:42 +01:00 |
Zhang Xianyi
|
ccf41ebf78
|
Merge branch 'develop' into arm_soft_fp_abi
|
2015-10-28 12:12:31 +00:00 |
Zhang Xianyi
|
53e849f4fc
|
Merge branch 'develop'
|
2015-10-27 15:44:50 -05:00 |
Zhang Xianyi
|
8447498b50
|
Update doc for OpenBLAS 0.2.15 version. [CI skipped]
|
2015-10-27 15:44:35 -05:00 |