Zhang Xianyi
cc7cab8a45
Detect other Intel Skylake cores.
...
http://users.atw.hu/instlatx64/
2015-09-09 10:47:17 -05:00
Yichao Yu
61ae47eb99
Ref #632 . Support Intel Skylake by Haswell kernels.
2015-09-09 11:07:33 -04:00
Zhang Xianyi
22353b1727
Merge pull request #634 from kortschak/lantr-trans-prep
...
Fix lantr preparation for row major matrices
2015-09-09 09:56:07 -05:00
kortschak
efffd28739
Fix lantr preparation for row major matrices
2015-09-09 09:25:48 +09:30
Zhang Xianyi
62cabef857
Merge pull request #633 from grisuthedragon/tune_imatcopy
...
Improved Ximatcopy when lda==ldb.
2015-09-08 13:59:08 -05:00
Martin Koehler
711ca33bc6
Improved Ximatcopy when lda==ldb.
...
The Ximatcopy functions create a copy of the input matrix
although they seem to work inplace. The new routines
XIMATCOPY_K_YY perform the operations inplace if the leading
dimension does not change.
2015-09-07 14:36:16 +02:00
Zhang Xianyi
40a3fed6b8
Merge pull request #630 from buffer51/develop
...
Fixed error in common.h for Android compilation introduced by e12cf11
2015-09-04 13:01:01 -05:00
buffer51
2297a2d989
Fixed error in common.h for Android compilation introduced by e12cf1123e
2015-09-03 20:54:21 -04:00
Zhang Xianyi
5408074941
Add notification.
2015-08-19 22:50:25 -05:00
Zhang Xianyi
bbcdf63bb4
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2015-08-19 22:48:55 -05:00
Zhang Xianyi
43eabab62f
Merge pull request #619 from gitter-badger/gitter-badge
...
Add a Gitter chat badge to README.md
2015-08-19 22:26:20 -05:00
The Gitter Badger
50901943fd
Added Gitter badge
2015-08-20 03:21:09 +00:00
Zhang Xianyi
7df0820160
Use C kernels for s/dgemv on x86.
2015-08-19 08:07:47 -05:00
Zhang Xianyi
17ee2237c3
Fixed cmake bug with NO_LAPACK=1
2015-08-18 22:43:42 -05:00
Zhang Xianyi
4b7381b7a4
Merge pull request #617 from notaz/arm_fixes
...
really fix ARM64 locking
2015-08-17 15:22:37 -05:00
Grazvydas Ignotas
abade3f896
really fix ARM64 locking
2015-08-17 01:27:45 +02:00
Zhang Xianyi
d1349e7a11
Merge pull request #616 from notaz/arm_fixes
...
ARM fixes
2015-08-16 17:16:18 -05:00
Grazvydas Ignotas
3efeaed0d8
correct a minor mistake
2015-08-16 20:12:04 +02:00
Grazvydas Ignotas
d38a1ddc7a
use real armv5 support
...
there is no more requirement for ARMv6 instructions,
and VFP on ARMv5 is uncommon
2015-08-16 18:59:18 +02:00
Grazvydas Ignotas
6b92204a7c
add fallback blas_lock implementation
...
to be used on armv5 and new platforms
2015-08-16 18:59:17 +02:00
Grazvydas Ignotas
f2ac1a5cee
set ARMV7 for Cortex-A9 and Cortex-A15
...
otherwise some macros like YIELDING are not defined correctly
2015-08-16 18:59:17 +02:00
Grazvydas Ignotas
e12cf1123e
add fallback rpcc implementation
...
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
2015-08-16 18:59:16 +02:00
Grazvydas Ignotas
d3e2f0a1af
add missing barriers
...
should fix issue #597
2015-08-16 15:37:02 +02:00
Grazvydas Ignotas
c2323dd4d2
really fix ARM locking
...
- was writing 0 to lock variable, so was ineffective
- only exit loop if both lock was 0 and strex was successful
2015-08-16 15:18:42 +02:00
Zhang Xianyi
f8eba3d548
Fixed cmake build bugs on Linux.
2015-08-11 16:25:16 -05:00
Zhang Xianyi
40ab5cfc50
Merge branch 'hpanderson_cmake' into cmake
2015-08-11 03:31:55 +08:00
Zhang Xianyi
b7a8f9ad47
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
2015-08-11 03:31:07 +08:00
Zhang Xianyi
f874465bb8
Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.
...
Disable CBLAS and LAPACK.
2015-08-10 14:10:44 -05:00
Zhang Xianyi
bb6e050509
Merge pull request #614 from xantares/cmake_version
...
install OpenBLASConfigVersion.cmake
2015-08-06 13:15:51 -05:00
xantares
87336b9acf
install OpenBLASConfigVersion.cmake
2015-08-06 20:03:50 +02:00
Hank Anderson
19664f3ef4
Added missing lapacke.cmake file.
2015-08-06 07:40:06 -05:00
Zhang Xianyi
c50661e5b7
Merge pull request #613 from fabioperez/develop
...
Add POWER7/POWER8 as targets
2015-08-05 09:19:17 -05:00
Fábio Perez
b8d64a856a
Add POWER7/POWER8 as targets
2015-08-05 11:02:39 -03:00
Zhang Xianyi
898fc7552a
Merge pull request #612 from ibmsoe/ppc64le
...
ppc64le platform support (ELF ABI v2)
2015-08-04 16:58:24 -05:00
Zhang Xianyi
ab0a0a75fc
Merge branch 'develop' into cmake
2015-08-03 23:59:01 -05:00
Zhang Xianyi
1cf2b10224
Use pure C generic target on x86 and x86_64.
...
make TARGET=GENERIC
?gemm3m is unimplemented on generic target.
2015-08-03 23:55:56 -05:00
Zhang Xianyi
7ac7e147d4
Fixed cmake building bugs on Linux. Disable LAPACK by default.
2015-08-04 04:37:05 +08:00
Matthew Brandyberry
7ba4fe5afb
ppc64le platform support (ELF ABI v2)
2015-07-21 22:20:19 -05:00
Zhang Xianyi
a55377e9a4
Merge branch 'hpanderson_cmake' into cmake
2015-07-22 04:07:27 +08:00
Zhang Xianyi
dcd5ba4443
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
2015-07-22 04:06:39 +08:00
Zhang Xianyi
3f1b57668e
Fix blas lock bug on AArch64.
2015-06-26 11:54:41 +08:00
Zhang Xianyi
d8f18d32c3
Merge pull request #595 from tanderson92/fixTests
...
Fix test execution when USE_OPENMP=0
2015-06-22 21:54:51 -05:00
wernsaar
bdb5c842fc
Merge pull request #596 from wernsaar/develop
...
optimizations for haswell
2015-06-13 16:44:48 +02:00
Werner Saar
e7c969e164
added optimized dtrmm_kernel for haswell
2015-06-13 16:16:29 +02:00
Werner Saar
9bd962f655
modified haswell parameter dgemm_unroll_n
2015-06-13 10:28:27 +02:00
Thomas Anderson
4f5691e5c0
Fix test execution when USE_OPENMP=0
...
The standard way to disable OpenMP support is to set USE_OPENMP=0,
as indicated by other checks to see if USE_OPENMP equals 1. The
problem is obviously then that `ifdef USE_OPENMP` is very much not
what we want to test for. This causes tests to fail when no OpenMP
library is installed.
2015-06-12 23:52:07 -07:00
Zhang Xianyi
29293160a4
Fix #593 . Change MACOSX_DEPLOYMENT_TARGET to 10.6.
2015-06-08 10:53:50 -05:00
wernsaar
3e33afef2e
Merge pull request #592 from wernsaar/develop
...
added benchmark scripts
2015-06-08 14:22:02 +02:00
Werner Saar
8614057ea9
added benchmark scripts for numpy, octave and R
2015-06-08 14:06:38 +02:00
Werner Saar
7f375f9e8f
updated geev benchmark
2015-06-08 12:58:38 +02:00