wernsaar
|
3dabd7e6e6
|
added swap-kernel for all precisions
|
2013-11-14 19:06:19 +01:00 |
wernsaar
|
6f4a0ebe38
|
added max- und min-kernels for all precisions
|
2013-11-14 13:52:47 +01:00 |
Zhang Xianyi
|
5048a80032
|
Refs #283. Fixed the incorrect usage of long data type for Windows 64.
|
2013-11-14 13:46:42 +08:00 |
wernsaar
|
6e679266f8
|
changes for compatibility with Pathscale compiler
|
2013-11-13 18:32:42 +01:00 |
wernsaar
|
f1db386211
|
changes for compatibility with Pathscale compiler
|
2013-11-13 17:59:11 +01:00 |
wernsaar
|
6da558d2ab
|
changes for compatibility with Pathscale compiler
|
2013-11-13 17:39:13 +01:00 |
Zhang Xianyi
|
a2942456ef
|
Refs #307. Fixed the hang bug when free OpenBLAS dll in Windows.
|
2013-11-13 10:00:18 +08:00 |
wernsaar
|
f750103336
|
small optimizations on dot-kernels
|
2013-11-11 15:47:56 +01:00 |
wernsaar
|
00f33c0134
|
added asum_kernel for all precisions and complex
|
2013-11-11 14:20:59 +01:00 |
wernsaar
|
5b36cc0f47
|
added blas level1 dot kernels for complex and double complex
|
2013-11-08 09:08:11 +01:00 |
wernsaar
|
c8f1aeb154
|
added optimized blas level1 dot kernels for single and double precision
|
2013-11-07 17:22:03 +01:00 |
wernsaar
|
8fa93be06e
|
added optimized blas level1 copy kernels
|
2013-11-07 17:18:56 +01:00 |
wernsaar
|
1e8128f41c
|
added cgemm_tcopy_2_vfpv3.S and zgemm_tcopy_2_vfpv3.S
|
2013-11-07 17:15:50 +01:00 |
Zhang Xianyi
|
6d8095bcb9
|
Avoid argument list too long issue in make clean.
|
2013-11-07 13:06:42 +08:00 |
Zhang Xianyi
|
2f5fdd2000
|
Refs #314. Fixed clang compiling bug on OSX.
|
2013-11-07 08:12:03 +08:00 |
wernsaar
|
80a2e901b1
|
added dgemm_tcopy_4_vfpv3.S and sgemm_tcopy_4_vfpv3.S
|
2013-11-06 20:01:18 +01:00 |
Zhang Xianyi
|
73770e60b8
|
Refs #309. Fixed trtri_U single thread computational bug.
|
2013-11-07 01:08:39 +08:00 |
wernsaar
|
ac50bccbd2
|
added cgemm_ncopy_2_vfpv3.S and made assembler labels unique
|
2013-11-05 20:21:35 +01:00 |
wernsaar
|
82015beaef
|
added zgemm_ncopy_2_vfpv3.S and made assembler labels unique
|
2013-11-05 19:31:22 +01:00 |
wernsaar
|
6216ab8a7e
|
removed obsolete gemm_kernels from haswell branch
|
2013-11-04 08:33:04 +01:00 |
wernsaar
|
370e3834a9
|
added missing file kernel/arm/Makefile
|
2013-11-03 11:54:39 +01:00 |
wernsaar
|
95aedfa0ff
|
added missing file arm/Makefile in lapack/laswp
|
2013-11-03 11:19:32 +01:00 |
wernsaar
|
cba97daf3c
|
added missing file cblas_noconst.h to the armv7 branch
|
2013-11-03 11:04:16 +01:00 |
wernsaar
|
5400a9f4e4
|
redefined functions for TIMING and YIELDING for ARMV7 processor
|
2013-11-03 10:34:04 +01:00 |
wernsaar
|
e31186efd4
|
deleted obsolete dgemm_kernel and dtrmm_kernel
|
2013-11-02 13:12:21 +01:00 |
wernsaar
|
2b801a00a5
|
small optimizations on sgemm_kernel for ARMV7
|
2013-11-02 13:06:11 +01:00 |
wernsaar
|
b3eab8fcb7
|
minor optimizations on zgemm_kernel for ARMV7
|
2013-11-02 09:43:53 +01:00 |
Zhang Xianyi
|
6d9d70c55c
|
Fixed #315. Added OPENBLAS_ prefix to openblas_config.h.
|
2013-11-02 15:59:00 +08:00 |
Zhang Xianyi
|
dfd1064d7b
|
refs #287. Don't enable OpenMP for netlib LAPACK sequential Fortran codes.
|
2013-11-02 15:09:33 +08:00 |
wernsaar
|
02bc36ac79
|
added sgemm_ncopy routine and made some improvements on cgemm_kernel for ARMV7
|
2013-11-01 18:22:27 +01:00 |
wernsaar
|
5118a7f4d1
|
small optimizations on dgemm_kernel for Piledriver
|
2013-10-31 11:53:26 +01:00 |
wernsaar
|
e172b70ea2
|
added cgemm_kernel for Piledriver
|
2013-10-31 08:38:17 +01:00 |
wernsaar
|
1cf4b974b2
|
added zgemm_kernel for Piledriver
|
2013-10-30 09:12:17 +01:00 |
wernsaar
|
7bccff1512
|
added sgemm_kernel for PILEDRIVER
|
2013-10-29 22:53:04 +01:00 |
wernsaar
|
afe44b0241
|
tests and code cleanup of gemm_kernels for HASWELL
|
2013-10-28 14:23:48 +01:00 |
wernsaar
|
a77c71eaf5
|
added highly optimized dgemm_kernel for HASWELL
|
2013-10-28 10:23:47 +01:00 |
Zhang Xianyi
|
b2219b3478
|
Merge pull request #311 from loladiro/patch-1
Use FC instead of CC to link the dynamic library on OS X
|
2013-10-24 23:41:22 -07:00 |
Keno Fischer
|
f5a0038bad
|
Use FC instead of CC to link the dynamic library on OS X
Avoids problems of libgfortran not being found.
|
2013-10-23 18:43:00 -04:00 |
Zhang Xianyi
|
c937090121
|
Added gfortran dependency for LSB/lsbcc.
|
2013-10-22 13:24:47 +08:00 |
wernsaar
|
fe8c5666f9
|
optimized dgemm_kernel for HASWELL
|
2013-10-20 16:52:26 +02:00 |
wernsaar
|
f6b50057e2
|
corrected and testet FMA3 Code
|
2013-10-19 10:52:20 +02:00 |
wernsaar
|
2840d56aeb
|
added dgemm_kernel for Piledriver
|
2013-10-19 09:47:15 +02:00 |
wernsaar
|
2d49db2f5b
|
moved compiler flags from Makefile.rule to Makefile.arm
|
2013-10-16 19:04:42 +02:00 |
wernsaar
|
04391e6d9c
|
optimized param.h
|
2013-10-16 18:04:34 +02:00 |
wernsaar
|
85484a42df
|
added kernels for cgemm, ctrmm, zgemm and ztrmm
|
2013-10-16 18:00:41 +02:00 |
wernsaar
|
3983011f0b
|
added sgemm- and strmm_kernel
|
2013-10-14 08:22:27 +02:00 |
wernsaar
|
2a1515c9dd
|
added dgemm_ncopy_4_vfpv3.S
|
2013-10-12 16:48:29 +02:00 |
wernsaar
|
31f51e78bc
|
minor optimizations on dgemm_kernel
|
2013-10-12 09:42:18 +02:00 |
wangqian
|
beffee7d91
|
Fixed buffer overflow bug in kernel/x86_64/dgemv_t.S file.
|
2013-10-11 03:20:20 +08:00 |
Zhang Xianyi
|
a35f4343fa
|
Merge pull request #301 from yieldthought/develop
Remove -Wl,--retain-symbols-file from dynamic link line to fix tool support
|
2013-10-09 00:46:49 -07:00 |