Zhang Xianyi
1e80b8b0d3
Merge pull request #531 from wernsaar/develop
...
added optimized sdot- and ddot-kernels for Haswell and Sandybridge
2015-04-05 16:42:39 -05:00
Werner Saar
a901b065d3
added optimized ddot-kernel for sandybridge
2015-04-05 20:19:38 +02:00
Werner Saar
3937e2a0a0
add optimized sdot-kernel for sandybridge
2015-04-05 19:47:05 +02:00
Werner Saar
9707d608d5
removed double definition line
2015-04-05 18:35:34 +02:00
Werner Saar
701b9d7556
added optimized sdot- and ddot-kernel for HASWELL
2015-04-05 17:57:53 +02:00
Zhang Xianyi
8977b3f235
Refs #529 . Support Intel Broadwell by Haswell kernels.
2015-04-02 11:08:03 -05:00
Zhang Xianyi
f6426395ea
Merge pull request #527 from xantares/patch-1
...
fix mingw install
2015-03-30 10:16:11 -05:00
xantares
0ac787eefe
fix mingw install
2015-03-30 09:30:55 +02:00
Zhang Xianyi
e5b96e55a7
Fix build bug for ARM64.
2015-03-24 15:27:17 -05:00
Zhang Xianyi
d0c51c4de9
Merge branch 'develop'
2015-03-24 15:07:07 -05:00
Zhang Xianyi
a3491e1e88
Update the doc for 0.2.14.
2015-03-24 15:05:59 -05:00
Zhang Xianyi
e81a5d61e4
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
2015-03-24 12:17:12 -05:00
Zhang Xianyi
c674fa32be
Add ARM targets.
2015-03-24 12:17:04 -05:00
Zhang Xianyi
e34911a73d
Fix compiling bug for ARM with setting BINARY.
2015-03-24 17:15:33 +00:00
Zhang Xianyi
76dcaf2281
Merge pull request #521 from maxlevesque/patch-1
...
Correct typo /proc/ instead of /pros/
2015-03-21 12:26:35 -05:00
Maximilien Levesque
770fac92eb
Correct typo /proc/ instead of /pros/
2015-03-20 23:25:11 +01:00
Zhang Xianyi
e95d64333a
Refs #519 . Avoid calling strncpy.
2015-03-19 15:57:22 -05:00
Zhang Xianyi
75c40bcc48
Refs #520 . Fixed ONLY_CBLAS=1 compiling bug on OSX.
2015-03-19 11:52:09 -05:00
Zhang Xianyi
b62f9f4120
Merge pull request #518 from ton/issue-508
...
Fix issue #508
2015-03-18 13:00:07 -05:00
Ton van den Heuvel
b6438dedea
Fix issue #508
...
Fix race condition during shutdown causing a crash in
gotoblas_set_affinity().
2015-03-18 13:22:43 +01:00
Hank Anderson
1d183dcda8
Added lapacke sources.
2015-02-25 16:51:08 -06:00
Zhang Xianyi
cdefdb21cd
Refs #492 . Fixed c/zsyr bug with negative incx.
2015-02-26 06:37:03 +08:00
Hank Anderson
e19bf3a28b
Removed MSVC cpuid func when using clang.
2015-02-25 14:44:49 -06:00
Hank Anderson
3649cfbd7b
Fixed EPILOGUE for clang.
2015-02-25 12:23:26 -06:00
Hank Anderson
5ae8993752
Added intrinsics for MSVC.
2015-02-25 11:52:51 -06:00
Hank Anderson
84d90d6ed8
Fixed some compiler errors/warnings for clang.
2015-02-25 11:52:25 -06:00
Hank Anderson
518e2424a8
Fixed bad filename for cpuid.S compile.
2015-02-25 11:51:29 -06:00
Zhang Xianyi
ea7f9dacf4
Refs #509 . Fixed geadd building bug with DYNAMIC_ARCH=1.
2015-02-26 01:47:11 +08:00
Zhang Xianyi
bf5dbb7e2a
Refs#509. Merge branch 'grisuthedragon-develop' into develop
2015-02-26 01:44:19 +08:00
Hank Anderson
00e373aea6
Added LAPACK sources directly to add_library call instead of OBJECT.
2015-02-25 10:18:18 -06:00
Hank Anderson
9eaea02f33
Added additional gemm defines for complex types.
2015-02-25 09:39:11 -06:00
Hank Anderson
ab7043373f
Fixed bug generating trmv complex source names.
2015-02-24 15:18:41 -06:00
Hank Anderson
504cdb10ed
Added check for MSVC before enabling fortran.
...
Currently forcing gfortran, instead of assuming ifort.
2015-02-24 14:31:45 -06:00
Hank Anderson
a8002b0c5f
Separated getarch ASM file when using MSVC.
2015-02-24 14:31:18 -06:00
Hank Anderson
0553476fba
Added TRANS defines for complex sources in lapack.
2015-02-24 14:30:35 -06:00
Hank Anderson
2416d9dbac
Fixed TRANSA defines for complex sources in driver/level2.
2015-02-24 13:18:07 -06:00
Hank Anderson
0d8e227ea7
Changed strategy for setting preprocessor definitions.
...
Instead of generating separate object files for each permutation of
defines for a source file, GenerateNamedObjects now writes an entirely
new source file and inserts the defines as #define c statements.
This solves a problem I ran into with ar.exe where it was refusing to
link objects that had the same filename despite having different paths.
2015-02-24 12:26:33 -06:00
Hank Anderson
12d1fb2e40
Fixed incorrect object name in kernel CMakeLists.txt
2015-02-24 10:30:16 -06:00
Hank Anderson
1b7f427401
Added conj gemv objects for complex build.
2015-02-23 10:24:31 -06:00
Hank Anderson
b2284647a3
More complex objects.
2015-02-23 07:51:05 -06:00
Hank Anderson
a6116e5859
Added some more complex-only objects.
2015-02-22 17:49:28 -06:00
Hank Anderson
fb5d5bb971
Added defines for complex trmv.
2015-02-21 12:39:03 -06:00
Hank Anderson
371071d461
Added CONJ defines for trmm/trsm.
2015-02-21 10:59:02 -06:00
Hank Anderson
8a143516e3
Added alternate_name to a couple of the name mangling schemes.
...
Added zherk_k sources to driver/level3.
2015-02-20 17:03:33 -06:00
Hank Anderson
e5897ecb9b
Added zherk_kernel.c objects to driver/level3.
2015-02-19 16:19:56 -06:00
Hank Anderson
714638c187
Added some TRMM objects for complex types.
2015-02-19 16:11:51 -06:00
Hank Anderson
e27c372e53
Fixed reuse of float_char from parent loop.
...
Fixed in/it/on/otcopy names.
2015-02-19 13:53:29 -06:00
Hank Anderson
f3f2b3d768
Added complex and single netlib-lapack fortran sources to lapack.cmake.
2015-02-19 12:26:11 -06:00
Hank Anderson
9492298048
Added other float types to Makefile.L3.
2015-02-18 13:01:05 -06:00
Hank Anderson
43725b82c5
ParseMakefileVars now replaces Makefile vars with CMake vars.
2015-02-18 12:23:17 -06:00