Zhang Xianyi
|
a07cc39571
|
Refs #266. Fixed the compiling bug with Open64 5.0.
|
2013-07-31 14:41:39 +08:00 |
Zhang Xianyi
|
5b504d6c23
|
Refs #263. Rollback bulldozer and piledriver kernels to barcelona kernels.
|
2013-07-28 17:39:24 +08:00 |
Zhang Xianyi
|
77b572fa0b
|
Merge branch 'loongson3a' into develop
Conflicts:
Makefile.system
|
2013-07-20 22:33:17 +08:00 |
Zhang Xianyi
|
b67252c2e4
|
Ensure the correct stack alignment on Win32.
|
2013-07-17 15:19:07 +08:00 |
Zhang Xianyi
|
e80e285928
|
Update build matrix for Travis CI.
|
2013-07-11 23:49:29 +08:00 |
Zhang Xianyi
|
6df39ad9e7
|
Refs #248. Support LAPACK and LAPACKE with lsbcc.
For LAPACKE, use LAPACK_COMPLEX_STRUCTURE.
The reson is lsbcc didn't define complex I in complex.h.
|
2013-07-10 16:02:27 +08:00 |
Zhang Xianyi
|
3eb5af1955
|
Refs #247. Included lapack source codes. Avoid downloading tar.gz from netlib.org
Based on 3.4.2 version, apply patch.for_lapack-3.4.2.
|
2013-07-09 18:13:48 +08:00 |
Zhang Xianyi
|
f54f5bac9e
|
Refs #248. Fixed the LSB compatiable issue for BLAS only.
For example, make CC=lsbcc NO_LAPACK=1.
|
2013-07-09 15:38:03 +08:00 |
Zhang Xianyi
|
886cbaf4e4
|
Support AMD Piledriver by bulldozer kernels.
|
2013-07-06 12:06:43 -03:00 |
Zhang Xianyi
|
cc522aa21d
|
Use quiet make for Travis CI.
|
2013-07-05 14:52:57 +08:00 |
Zhang Xianyi
|
cd1d473ba0
|
Merge pull request #230 from wernsaar/develop
Refs #230. New dgemm and sgemm Kernel for BULLDOZER
|
2013-06-13 07:29:27 -07:00 |
Zhang Xianyi
|
56f160134d
|
Refs #231. Change the default C compiler to clang on Mac OSX.
|
2013-06-13 22:15:19 +08:00 |
wernsaar
|
d854b30ae6
|
Added UNROLL values for 3M to getarch_2nd.c, Makefile.system and Makefile.L3
|
2013-06-09 17:26:42 +02:00 |
Zhang Xianyi
|
960b0c88a7
|
Refs #227. Detected LLVM/Clang compiler.
|
2013-06-06 23:43:40 +08:00 |
Zhang Xianyi
|
f2fb8c7035
|
Change LIBSUFFIX from .lib to .a on windows.
|
2013-06-04 16:05:28 +08:00 |
Zhang Xianyi
|
357078b93e
|
Refs #216. Revert the default value of GEMM_MULTITHREAD_THRESHOLD to 4.
|
2013-05-03 09:08:54 +08:00 |
Zhang Xianyi
|
48bdc1ad3b
|
Added NO_PARALLEL_MAKE flag to disable parallel make.
|
2013-04-15 21:37:30 +08:00 |
Zhang Xianyi
|
990efcab6e
|
Merge branch 'loongson3b' into loongson3a
|
2013-04-11 16:11:03 +00:00 |
Zhang Xianyi
|
75a5dc3975
|
Added the configure for the host loongcc compiling on Loongson3.
|
2013-04-11 16:10:47 +00:00 |
Xianyi Zhang
|
6958c1a1aa
|
Fixed the SEGFAULT bug with Loongcc and Loongson3.
|
2013-04-11 15:33:43 +08:00 |
Xianyi Zhang
|
1a57717b1a
|
Added the configuration of Loongcc compiler for Loongson 3 CPU.
|
2013-04-07 15:42:07 +08:00 |
Zhang Xianyi
|
5c8bf6ae0e
|
Merge branch 'bulldozer' into develop
|
2013-02-10 01:19:42 +08:00 |
Zaheer Chothia
|
4db6660de4
|
Refs #185. Add missing 'const' to declarations in <cblas.h>. Thanks to Dan Povey!
The 'const' modifications were done automatically using this scripts:
https://kaldi.svn.sourceforge.net/svnroot/kaldi/sandbox/dan/tools/for_openblas
|
2013-01-20 22:52:51 +01:00 |
Zhang Xianyi
|
b7c0fa6bd2
|
Init AMD Bulldozer codebase.
|
2012-12-06 07:29:54 -05:00 |
Alexander Nasonov
|
e85549ee11
|
Fix NetBSD build.
|
2012-11-10 23:20:44 +00:00 |
Zhang Xianyi
|
08c177ca36
|
Refs #145. Update LAPACK to 3.4.2 version.
|
2012-09-29 23:14:39 +08:00 |
Zhang Xianyi
|
2573311308
|
refs #140. Fixed zdot incompatibility ABI issue with GCC 4.7 on Win 32.
GCC 4.7 uses MSVC ABI on Win 32. This means the caller pops the hidden pointer for returning
aggregate structures larger than 8 bytes.
|
2012-09-24 20:34:33 +08:00 |
Zhang Xianyi
|
758e34efbb
|
Fixed the detection bug on Loongson 3A server.
|
2012-09-21 10:14:07 +00:00 |
Zhang Xianyi
|
f76a384841
|
Refs #139. Added NO_AVX flag to use old Nehalem kernels on Sandy Bridge.
For example, make NO_AVX=1 or make DYNAMIC_ARCH=1 NO_AVX=1
|
2012-09-17 23:25:46 +08:00 |
Jameson Nash
|
d0e731e8b8
|
provide support for passing CFLAGS, FFLAGS, PFLAGS, FPFLAGS to make on the command line
|
2012-08-21 00:31:12 -04:00 |
Zhang Xianyi
|
068861a927
|
Refs #133. Users can set COMMON_OPT flag to control CFLAGS and FFLAGS.
|
2012-08-10 14:36:26 +08:00 |
Zaheer Chothia
|
e8306f623a
|
Refs #127. Generate DLL without a version suffix on Windows.
|
2012-07-30 19:46:30 +02:00 |
Xianyi Zhang
|
25f1a573fd
|
Fixed the build bug when DYNAMIC_ARCH=0.
|
2012-07-07 12:12:24 +08:00 |
Xianyi Zhang
|
34fd3b85a8
|
Refs #113. Fixed BOBCATE typo in dynamic arch building.
|
2012-06-19 17:05:16 +08:00 |
Zhang Xianyi
|
d6cab3f37e
|
Refs #113. Support AMD Bobcate using Barcelona kernel codes. Replace 3DNow! with MMX.
|
2012-05-31 18:17:45 +08:00 |
Xianyi Zhang
|
a53c6e2440
|
Merge branch 'develop' into sandybridge
|
2012-05-25 23:16:44 +08:00 |
Zaheer Chothia
|
14c3511e92
|
Respect C compiler set on the command line or inherited from the environment
|
2012-05-20 18:09:35 +02:00 |
Zhang Xianyi
|
06e208c5c3
|
Refs #106. Fixed wget and md5 bug on FreeBSD and NetBSD.
|
2012-05-13 11:43:29 +08:00 |
Xianyi Zhang
|
7f89edee3e
|
refs #103 Increase GEMM_MULTITHREAD_THRESHOLD to 50.
|
2012-05-03 20:05:34 +08:00 |
Zaheer Chothia
|
f4eee224d8
|
Refs #93. Upgraded LAPACK to 3.4.1 version.
|
2012-04-23 17:15:22 +08:00 |
Xianyi Zhang
|
fd2ee0c9e2
|
Refs #88. Fixed the build bug about LAPACKE C Interface to LAPACKE.
|
2012-04-13 23:12:06 +08:00 |
Zaheer Chothia
|
78914475ae
|
Fixed #88. Build LAPACKE: C Interface to LAPACK.
|
2012-04-10 16:19:55 +08:00 |
Xianyi Zhang
|
910338f071
|
Merge branch 'develop' into sandybridge
|
2012-04-05 16:25:57 +08:00 |
Xianyi Zhang
|
fad089ffff
|
Fixed #84 the MD5 command line bug on Mac OSX.
|
2012-04-05 16:21:40 +08:00 |
Xianyi Zhang
|
19a48b82cf
|
Init Sandybridge codes based on Nehalem.
|
2012-03-30 20:01:03 +08:00 |
Xianyi Zhang
|
3871b6a86d
|
Merge branch 'loongson3b' into release-0.1.0
|
2012-03-23 01:26:44 +08:00 |
Xianyi Zhang
|
31c836ac25
|
Ref #79 Added GEMM_MULTITHREAD_THRESHOLD flag to use single thread in gemm function with small matrices.
|
2012-03-23 01:17:41 +08:00 |
Xianyi Zhang
|
1f15bee02a
|
Refs #81. Added LIBNAMESUFFIX flag in Makefile.rule. The user can use this flag to control the library name, e.g. libopenblas.a, libopenblas_ifort.a or libopenblas_omp.a.
|
2012-03-21 23:57:09 +08:00 |
Xianyi Zhang
|
d047afe615
|
Set shared library soname in Linux.
|
2012-03-15 01:07:34 +08:00 |
Xianyi Zhang
|
0bbf955d4c
|
Refs #74. Added -lgfortran into generating shared library.
|
2012-03-12 18:20:37 +08:00 |