Zhang Xianyi
a55377e9a4
Merge branch 'hpanderson_cmake' into cmake
2015-07-22 04:07:27 +08:00
Zhang Xianyi
dcd5ba4443
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
2015-07-22 04:06:39 +08:00
Zhang Xianyi
3f1b57668e
Fix blas lock bug on AArch64.
2015-06-26 11:54:41 +08:00
Zhang Xianyi
d8f18d32c3
Merge pull request #595 from tanderson92/fixTests
...
Fix test execution when USE_OPENMP=0
2015-06-22 21:54:51 -05:00
wernsaar
bdb5c842fc
Merge pull request #596 from wernsaar/develop
...
optimizations for haswell
2015-06-13 16:44:48 +02:00
Werner Saar
e7c969e164
added optimized dtrmm_kernel for haswell
2015-06-13 16:16:29 +02:00
Werner Saar
9bd962f655
modified haswell parameter dgemm_unroll_n
2015-06-13 10:28:27 +02:00
Thomas Anderson
4f5691e5c0
Fix test execution when USE_OPENMP=0
...
The standard way to disable OpenMP support is to set USE_OPENMP=0,
as indicated by other checks to see if USE_OPENMP equals 1. The
problem is obviously then that `ifdef USE_OPENMP` is very much not
what we want to test for. This causes tests to fail when no OpenMP
library is installed.
2015-06-12 23:52:07 -07:00
Zhang Xianyi
29293160a4
Fix #593 . Change MACOSX_DEPLOYMENT_TARGET to 10.6.
2015-06-08 10:53:50 -05:00
wernsaar
3e33afef2e
Merge pull request #592 from wernsaar/develop
...
added benchmark scripts
2015-06-08 14:22:02 +02:00
Werner Saar
8614057ea9
added benchmark scripts for numpy, octave and R
2015-06-08 14:06:38 +02:00
Werner Saar
7f375f9e8f
updated geev benchmark
2015-06-08 12:58:38 +02:00
wernsaar
69c5169e7d
Merge pull request #589 from wernsaar/develop
...
small modification of gemm.c
2015-06-03 12:14:09 +02:00
Werner Saar
e19948baa1
small modification of gemm.c
2015-06-03 09:11:51 +02:00
wernsaar
a2eaf234fc
Merge pull request #587 from wernsaar/develop
...
added gesv benchmark
2015-06-02 15:29:49 +02:00
Werner Saar
6a13a94e71
added gesv benchmark
2015-06-02 13:35:49 +02:00
wernsaar
eff43d3289
Merge pull request #585 from wernsaar/develop
...
bugfix for benchmark Makefile on MAC
2015-05-31 15:01:54 +02:00
Werner Saar
9c4817d07b
bugfix for Makefile on mac
2015-05-31 14:16:51 +02:00
wernsaar
319f3a0451
Merge pull request #584 from wernsaar/develop
...
bugfixes, to build benchmarks with mingw on Windows OS
2015-05-29 13:27:20 +02:00
Werner Saar
02c7766f68
bugfixes, to build benchmarks with mingw on Windows OS
2015-05-29 12:56:22 +02:00
wernsaar
f38cb67ca8
Merge pull request #581 from wernsaar/develop
...
bugfix for arm locking
2015-05-23 12:58:15 +02:00
Werner Saar
eea2e30b74
bugfix for arm locking
2015-05-23 11:40:40 +02:00
Werner Saar
19b8fd2aed
smp lock bugfix
2015-05-23 10:58:38 +02:00
wernsaar
0cc5212741
Merge pull request #580 from wernsaar/develop
...
added blas level1 swap benchmark
2015-05-23 09:46:39 +02:00
Werner Saar
c47c8e8cf5
added blas level1 swap benchmark
2015-05-21 08:51:42 +02:00
Zhang Xianyi
a11555c715
Support Android NDK armeabi-v7a-hard ABI. (-mfloat-abi=hard)
...
e.g.
make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7
In Android NDK, it uses armeabi-v7a-hard ABI.
TARGET_CFLAGS += -mhard-float -D_NDK_MATH_NO_SOFTFP=1
TARGET_LDFLAGS += -Wl,--no-warn-mismatch -lm_hard
For more information, please check hard-float example at
android_ndk/tests/device/hard-float/jni/.
2015-05-20 21:57:27 -05:00
wernsaar
897d03518e
Merge pull request #578 from wernsaar/develop
...
added blas level1 copy benchmark
2015-05-20 11:56:02 +02:00
Werner Saar
23fbc5728e
added blas level1 copy benchmark
2015-05-20 11:05:00 +02:00
Zhang Xianyi
6d40fa587f
Fix f_check bug.
2015-05-19 12:04:45 -05:00
wernsaar
22dcd79959
Merge pull request #577 from wernsaar/develop
...
Bugfix for armv6 memory barrier
2015-05-19 10:59:24 +02:00
Werner Saar
ea4df0aad3
Ref #574 : Bugfix for armv6 memory barrier
2015-05-19 10:43:12 +02:00
Zhang Xianyi
e127fb8fd8
1) Refs #575 . Remove g77 from compiler list.
...
2) If OpenBLAS cannot find Fortran compiler, it will only build BLAS
(without LAPACK).
2015-05-19 00:01:04 -05:00
wernsaar
7fb718a7d8
Merge pull request #572 from wernsaar/develop
...
added optimized cscal and zscal functions for steamroller
2015-05-18 13:47:38 +02:00
Werner Saar
24f58c8bb1
added optimized cscal and zscal kernels for steamroller
2015-05-18 12:40:07 +02:00
Werner Saar
95b1faf667
added optimized cscal and zscal kernels for steamroller and piledriver
2015-05-18 10:50:57 +02:00
Werner Saar
2d9e406050
added optimized cscal kernel for sandybridge
2015-05-18 08:46:06 +02:00
Werner Saar
59083e3ce1
added optimized cscal kernel for bulldozer
2015-05-18 07:33:52 +02:00
wernsaar
685be40339
Merge pull request #571 from wernsaar/develop
...
added optimized cscal and zscal functions
2015-05-17 14:09:14 +02:00
Werner Saar
31c9e399e9
added optimized cscal kernel for haswell
2015-05-17 13:44:09 +02:00
Werner Saar
7de6bb9889
added optimized zscal kernel for bulldozer
2015-05-17 11:45:19 +02:00
Werner Saar
d63034303b
added optimized zscal kernel for haswell
2015-05-16 16:41:45 +02:00
Zhang Xianyi
51ff17d46e
Add AMD Excavator target.
2015-05-13 16:16:30 -05:00
wernsaar
905534942a
Merge pull request #568 from wernsaar/develop
...
added optimized dscal kernel
2015-05-13 13:48:08 +02:00
Werner Saar
18e90ee2e3
bugfix: added static to functions
2015-05-13 13:31:26 +02:00
Werner Saar
e00cccc41e
added optimized dscal kernel for piledriver
2015-05-13 13:05:35 +02:00
Werner Saar
73f09bf64f
optimized dscal kernel for increment != 1
2015-05-13 12:14:39 +02:00
Werner Saar
02e772c7e4
added optimized dscal kernel for haswell
2015-05-12 17:19:58 +02:00
Werner Saar
7aee913991
added optimized dscal kernel for sandybridge
2015-05-12 16:27:43 +02:00
Werner Saar
e50a933037
added optimized dscal kernel for bulldozer
2015-05-12 12:28:44 +02:00
Zhang Xianyi
5f9011d6ef
Merge pull request #566 from powderluv/develop
...
Fix build with ALLOC_SHM=0 (Android NDK)
2015-05-11 20:59:12 -05:00