Werner Saar
b57a60dac8
updated cdot and zdot for piledriver
2015-04-09 10:33:46 +02:00
Werner Saar
5c51163972
added optimized cdot- and zdot-kernel for steamroller
2015-04-09 09:45:23 +02:00
Werner Saar
9299d8cfd6
added optimized cdot- and zdot-kernels for bulldozer
2015-04-08 16:29:55 +02:00
Werner Saar
ba926e807c
added cdot- and zdot benchmark
2015-04-07 11:56:06 +02:00
Werner Saar
60c6dec6e6
updated some lines for bulldozer
2015-04-06 18:47:16 +02:00
Werner Saar
47898cca35
added optimized saxpy- and daxpy-kernel for sandybridge
2015-04-06 16:05:16 +02:00
Werner Saar
53bb924287
added optimized saxpy- and daxpy-kernel for haswell
2015-04-06 12:33:16 +02:00
Zhang Xianyi
1e80b8b0d3
Merge pull request #531 from wernsaar/develop
...
added optimized sdot- and ddot-kernels for Haswell and Sandybridge
2015-04-05 16:42:39 -05:00
Werner Saar
a901b065d3
added optimized ddot-kernel for sandybridge
2015-04-05 20:19:38 +02:00
Werner Saar
3937e2a0a0
add optimized sdot-kernel for sandybridge
2015-04-05 19:47:05 +02:00
Werner Saar
9707d608d5
removed double definition line
2015-04-05 18:35:34 +02:00
Werner Saar
701b9d7556
added optimized sdot- and ddot-kernel for HASWELL
2015-04-05 17:57:53 +02:00
Zhang Xianyi
8977b3f235
Refs #529 . Support Intel Broadwell by Haswell kernels.
2015-04-02 11:08:03 -05:00
Zhang Xianyi
f6426395ea
Merge pull request #527 from xantares/patch-1
...
fix mingw install
2015-03-30 10:16:11 -05:00
xantares
0ac787eefe
fix mingw install
2015-03-30 09:30:55 +02:00
Zhang Xianyi
e5b96e55a7
Fix build bug for ARM64.
2015-03-24 15:27:17 -05:00
Zhang Xianyi
a3491e1e88
Update the doc for 0.2.14.
2015-03-24 15:05:59 -05:00
Zhang Xianyi
e81a5d61e4
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2015-03-24 12:17:12 -05:00
Zhang Xianyi
c674fa32be
Add ARM targets.
2015-03-24 12:17:04 -05:00
Zhang Xianyi
e34911a73d
Fix compiling bug for ARM with setting BINARY.
2015-03-24 17:15:33 +00:00
Zhang Xianyi
76dcaf2281
Merge pull request #521 from maxlevesque/patch-1
...
Correct typo /proc/ instead of /pros/
2015-03-21 12:26:35 -05:00
Maximilien Levesque
770fac92eb
Correct typo /proc/ instead of /pros/
2015-03-20 23:25:11 +01:00
Zhang Xianyi
e95d64333a
Refs #519 . Avoid calling strncpy.
2015-03-19 15:57:22 -05:00
Zhang Xianyi
75c40bcc48
Refs #520 . Fixed ONLY_CBLAS=1 compiling bug on OSX.
2015-03-19 11:52:09 -05:00
Zhang Xianyi
b62f9f4120
Merge pull request #518 from ton/issue-508
...
Fix issue #508
2015-03-18 13:00:07 -05:00
Ton van den Heuvel
b6438dedea
Fix issue #508
...
Fix race condition during shutdown causing a crash in
gotoblas_set_affinity().
2015-03-18 13:22:43 +01:00
Zhang Xianyi
cdefdb21cd
Refs #492 . Fixed c/zsyr bug with negative incx.
2015-02-26 06:37:03 +08:00
Zhang Xianyi
ea7f9dacf4
Refs #509 . Fixed geadd building bug with DYNAMIC_ARCH=1.
2015-02-26 01:47:11 +08:00
Zhang Xianyi
bf5dbb7e2a
Refs#509. Merge branch 'grisuthedragon-develop' into develop
2015-02-26 01:44:19 +08:00
Martin Koehler
39cc6b21d3
Add ATLAS-style ?geadd function
2015-02-16 13:46:20 +01:00
Zhang Xianyi
771b18ae9c
Detect the wrong combined flags of USE_OPENMP=1 and USE_THREAD=0.
2015-02-08 01:42:48 -06:00
Zhang Xianyi
cfa9392ffa
Fix openblas_get_num_threads and openblas_get_num_procs bug with single thread.
2015-02-08 01:30:23 -06:00
Zhang Xianyi
1ccd57ce80
Merge pull request #497 from eschnett/develop
...
Introduce openblas_get_num_threads and openblas_get_num_procs
2015-02-03 23:09:38 -06:00
Erik Schnetter
65a847cd36
Introduce openblas_get_num_threads and openblas_get_num_procs
2015-02-03 12:23:41 -05:00
Zhang Xianyi
07ff001981
Merge pull request #495 from jeromerobert/develop
...
Fix a segfault in gemv when MAX_STACK_ALLOC is set
2015-01-29 18:23:50 +08:00
Jerome Robert
b17ccb4c5c
Fix a segfault in gemv when MAX_STACK_ALLOC is set
...
* stack_alloc_size is needed after the implementation call
but it may be overwritten if it's optimized to a register,
because some gemv implementation (ex: dgemv_n.S) do not
restore all register (ex: r10).
* do the same in ger.c for the same reasons even if the bug
has not been observed.
2015-01-29 09:55:57 +01:00
Zhang Xianyi
63c6fcfa0a
Merge pull request #490 from eschnett/develop
...
Move #include statements outside extern "C" blocks
2015-01-13 15:43:56 +08:00
Erik Schnetter
29cb47fc06
Move #include statements outside extern "C" blocks
2015-01-12 21:27:52 -05:00
Zhang Xianyi
4e6c4046f7
Fix cortex-a15 detecting bug.
2015-01-12 09:35:16 +00:00
Zhang Xianyi
229ce2ccd1
Add cortex-a9 and cortex-a15 targets.
2015-01-12 08:55:29 +00:00
Zhang Xianyi
ef75be0e51
Merge pull request #487 from kortschak/dromtg-test
...
Add test for drotmg bug fixed by 692b14c
2015-01-07 14:13:11 +08:00
kortschak
5344f335a8
Add test for drotmg bug fixed by 692b14c
...
Test requested in issue xianyi/OpenBLAS#484 .
Run tests by applying the following change and then make:
diff --git a/Makefile.rule b/Makefile.rule
index bea1fe1..9852ff3 100644
--- a/Makefile.rule
+++ b/Makefile.rule
@@ -140,7 +140,7 @@ NO_AFFINITY = 1
-# UTEST_CHECK = 1
+UTEST_CHECK = 1
2015-01-07 10:06:55 +10:30
Zhang Xianyi
5cb5af9333
Add configuration options.
2015-01-02 02:42:32 +08:00
Zhang Xianyi
41aad0407f
Merge pull request #482 from jeromerobert/develop
...
Allow to do gemv and ger buffer allocation on the stack
2015-01-02 02:26:17 +08:00
Zhang Xianyi
f8f2e84659
Merge pull request #486 from wernsaar/develop
...
Optimizations for steamroller
2014-12-31 02:36:23 +08:00
Werner Saar
34633fef01
Merge branch 'develop' of github.com:wernsaar/OpenBLAS into develop
2014-12-30 20:16:53 +08:00
Werner Saar
ddf983d643
added optimizations for steamroller
2014-12-30 20:14:45 +08:00
Zhang Xianyi
17b9db20f1
Merge pull request #483 from wernsaar/develop
...
added Steamroller as a cpu target
2014-12-29 12:00:16 +08:00
Werner Saar
0dc559ed30
bugfix in dynamic.c
2014-12-28 17:15:42 +01:00
Werner Saar
9566f5fdb0
added Steamroller as a target processor
2014-12-28 13:45:19 +01:00