Zhang Xianyi
3e81c99b6b
Fixed installation bug on Mac OSX.
2014-12-13 13:05:06 +08:00
Werner Saar
ec85c4a51d
Increased the Threshold value in sep.in
2014-12-11 14:57:41 +01:00
Werner Saar
97de657d38
added tests to sep.as as workaround for gfortran-4.8.x
2014-12-11 13:53:59 +01:00
Zhang Xianyi
71966eba6c
Merge pull request #475 from xantares/patch-2
...
add OpenBLAS_VERSION to cmake config file
2014-12-09 17:57:43 +08:00
Zhang Xianyi
a359979e17
Merge pull request #474 from xantares/patch-1
...
set OPENBLAS_CMAKE_DIR to <prefix>/lib/cmake/<package_name>
2014-12-09 17:57:16 +08:00
xantares
7a6a141bc4
add OpenBLAS_VERSION to cmake config file
2014-12-09 10:34:41 +01:00
xantares
b8ff6892f6
set OPENBLAS_CMAKE_DIR to <prefix>/lib/cmake/<package_name>
...
usually these files are more often located in this subdir
2014-12-09 10:18:18 +01:00
Zhang Xianyi
8fe7a9ce6f
Merge pull request #473 from wernsaar/develop
...
changed inline assembler labels to short form
2014-12-08 13:22:18 +08:00
Werner Saar
bc5fff7085
changed inline assembler labels to short form
2014-12-07 12:38:54 +01:00
Zhang Xianyi
1943ea91a8
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2014-12-03 23:03:48 +08:00
Zhang Xianyi
f5424fc9de
Update the doc for 0.2.13 version.
2014-12-03 23:00:29 +08:00
Zhang Xianyi
0cf29ba6d2
Fixed a bug of sgemm sandy bridge kernel.
...
Reported by Julia project. JuliaLang/julia#9084
2014-12-03 17:38:41 +08:00
Zhang Xianyi
50e18033e6
Merge pull request #471 from nolta/patch-4
...
c_check: set $hostarch to x86_64 instead of amd64
2014-12-03 12:53:20 +08:00
Zhang Xianyi
551b55d1c7
Merge pull request #470 from nolta/patch-3
...
fix fortran compiler detection on FreeBSD
2014-12-03 12:50:46 +08:00
Mike Nolta
271ceb8bae
c_check: set $hostarch to x86_64 instead of amd64
...
`uname -m` returns "amd64" on some systems.
2014-12-02 21:23:23 -05:00
Mike Nolta
5f846be2e4
fix fortran compiler detection on FreeBSD
...
On FreeBSD, passing extra options to `which` causes it to report a non-zero status:
```
$ which gfortran48 -m64
/usr/local/bin/gfortran48
$ echo $?
1
```
```
$ which gfortran48
/usr/local/bin/gfortran48
$ echo $?
0
```
2014-12-02 20:47:40 -05:00
Zhang Xianyi
fe7dcf98f3
Refs #461 . Provide OpenBLASConfig.cmake to support CMake.
...
If you "make PREFIX=/path/to/OpenBLAS install" ,
The config file will be located in /path/to/OpenBLAS/cmake
Then, you can use "find_package(OpenBLAS)" at CMake.
cmake -DOpenBLAS_DIR=/path/to/OpenBLAS/cmake ..
2014-11-29 02:16:40 +08:00
Zhang Xianyi
2fb02626da
Update organization info.
2014-11-25 15:28:58 +08:00
Zhang Xianyi
a85c2785ae
Refs #467 . Added generic kernel file for x86_64.
2014-11-24 15:34:48 +08:00
Zhang Xianyi
4806715c97
Fixed #456 . Merged the optimizations for APM's
...
xgene-1 (aarch64).
Merge branch 'benedikt-huber-dave-patch' into develop
2014-11-11 22:21:04 +08:00
Benedikt Huber
58c90d5937
# The first commit's message is:
...
Optimizations for APM's xgene-1 (aarch64).
1) general system updates to support armv8 better. Make all did not work, one needed to supply TARGET=ARMV8.
2) sgem 4x4 kernel in assembler using SIMD, and configuration changes to use it.
3) strmm 4x4 kernel in C. Since the sgem kernel does 4x4, the trmm kernel must also do 4xN.
Added Dave Nuechterlein to the contributors list.
2014-11-11 22:19:23 +08:00
Zhang Xianyi
2987bc7b40
refs #464 . Fixed the bug of detecting L2 associative on x86.
2014-11-10 17:15:34 +08:00
Zhang Xianyi
695e0fa649
#463 fixed a compiling bug on AIX.
2014-11-10 14:39:56 +08:00
Zhang Xianyi
cbb23c46c2
Merge pull request #459 from tkelman/symbol-rename
...
add SYMBOLPREFIX and SYMBOLSUFFIX makefile options
2014-10-25 19:49:03 +08:00
Tony Kelman
0b4602b753
add SYMBOLPREFIX and SYMBOLSUFFIX makefile options
...
for adding a prefix or suffix to all exported symbol names in the shared library
Useful to avoid conflicts with other BLAS libraries, especially when using
64 bit integer interfaces in OpenBLAS
Note that since OSX does not have the objcopy utility, setting these options
to non-empty values on Mac requires the objconv tool, available (GPL license)
from http://www.agner.org/optimize/#objconv
2014-10-24 22:27:09 -07:00
Zhang Xianyi
ac5a7e1c1b
Update dot to 0.2.12 version.
2014-10-13 17:10:12 +08:00
wernsaar
f1b9a4a1ca
Ref #454 : fixed bug in common_param.h
2014-09-23 11:34:29 +02:00
Zhang Xianyi
ae6b7caf32
Merge pull request #453 from wernsaar/develop
...
Enabled GEMM3M functions
2014-09-22 16:47:54 +08:00
wernsaar
f446d2368a
updated cblas.h and cblas_noconst.h
2014-09-21 13:39:15 +02:00
wernsaar
dab4edd069
added benchmark for gemm3m functions
2014-09-21 12:00:41 +02:00
wernsaar
9d7057366d
bugfix for GEMM3M functions
2014-09-21 11:41:43 +02:00
wernsaar
7f234f8ed1
added GEMM3M tests
2014-09-21 10:55:08 +02:00
wernsaar
9e829ce98f
enabled cblas gemm3m functions
2014-09-20 17:20:02 +02:00
wernsaar
d49fd33885
disabled SYMM3M and HEMM3M functions because segment violations
2014-09-20 15:27:40 +02:00
wernsaar
f0f9b25bb6
added test for CGEMM3M function
2014-09-20 14:53:30 +02:00
wernsaar
7aae4a62e7
enabled use of GEMM3M functions
2014-09-20 14:27:10 +02:00
wernsaar
7a911569b8
added test for GEMM3M functions
2014-09-20 14:21:42 +02:00
wernsaar
466bfb8b86
updated README.md
2014-09-17 16:01:07 +02:00
Zhang Xianyi
70d1ba09b2
Update the doc for target list.
2014-09-17 14:29:21 +08:00
Zhang Xianyi
d293b78b64
Merge pull request #451 from eshelman/patch-1
...
Add HASWELL to TargetList.txt
2014-09-17 14:20:06 +08:00
Eliot Eshelman
9912dbbcf9
Add HASWELL to TargetList.txt
...
The Intel "Haswell" architecture is missing from the list of build targets.
2014-09-16 18:26:45 -04:00
Zhang Xianyi
01bc462e8e
Merge pull request #449 from wernsaar/develop
...
optimized multithreading lower limits
2014-09-16 14:33:48 +08:00
wernsaar
3300f5ebff
optimized multithreading lower limits
2014-09-15 11:38:25 +02:00
Zhang Xianyi
59e2c20557
Merge pull request #448 from wernsaar/develop
...
Optimized cgemv and zgemv kernels
2014-09-15 13:12:14 +08:00
wernsaar
b7c9566eea
removed obsolete gemv kernel files
2014-09-14 11:00:53 +02:00
wernsaar
6df1b0be81
optimized zgemv_n_microk_sandy-4.c
2014-09-14 10:21:22 +02:00
wernsaar
2ac1e076c1
added optimized zgemv_n kernel for sandybridge
2014-09-14 09:02:05 +02:00
wernsaar
9908b6031c
bugfix in KERNEL.PILEDRIVER
2014-09-13 16:26:53 +02:00
wernsaar
8f100a14f2
optimized cgemv_t kernel for haswell
2014-09-13 16:13:27 +02:00
wernsaar
53b5726b04
added optimized cgemv_t kernel for haswell
2014-09-13 15:14:12 +02:00