Commit Graph

7452 Commits

Author SHA1 Message Date
Xianyi Zhang 03b0eb19f7 Refs #86. Test alpha=Nan in x86/x86_64 dscale. 2012-04-05 18:16:18 +08:00
Xianyi Zhang 910338f071 Merge branch 'develop' into sandybridge 2012-04-05 16:25:57 +08:00
Xianyi Zhang fad089ffff Fixed #84 the MD5 command line bug on Mac OSX. 2012-04-05 16:21:40 +08:00
Xianyi Zhang 19a48b82cf Init Sandybridge codes based on Nehalem. 2012-03-30 20:01:03 +08:00
Xianyi Zhang 09b2417848 Fixed a typo in license file. 2012-03-27 14:17:13 +08:00
Xianyi Zhang 5cbbc496b0 Merge branch 'release-0.1.0' into develop 2012-03-23 18:53:51 +08:00
Xianyi Zhang 09f74f6d23 Merge branch 'release-0.1.0' 2012-03-23 18:52:40 +08:00
Xianyi Zhang 2b3eae6cc7 Ref #70 Updated Changelog.txt. 2012-03-23 18:45:54 +08:00
Xianyi Zhang 0b89a7a92d Ref #82. Disable outputing debug information in alloc_mmap. 2012-03-23 18:17:12 +08:00
Xianyi Zhang 91ce66a0a8 Ref #82 fixed the bug in my_mbind function. 2012-03-23 15:15:05 +08:00
Xianyi Zhang ccdba3c771 Updated the version to 0.1.0. 2012-03-23 01:29:05 +08:00
Xianyi Zhang 3871b6a86d Merge branch 'loongson3b' into release-0.1.0 2012-03-23 01:26:44 +08:00
Xianyi Zhang 83ecfbb9b3 Merge branch 'loongson3a' into release-0.1.0 2012-03-23 01:26:27 +08:00
Xianyi Zhang 31c836ac25 Ref #79 Added GEMM_MULTITHREAD_THRESHOLD flag to use single thread in gemm function with small matrices. 2012-03-23 01:17:41 +08:00
Xianyi Zhang 3afedbf6f0 Merge branch 'fix-crash_on_P4' into develop 2012-03-23 00:06:29 +08:00
Xianyi Zhang b4ec36debc Merge branch 'master' into develop 2012-03-23 00:00:13 +08:00
Xianyi Zhang 1f15bee02a Refs #81. Added LIBNAMESUFFIX flag in Makefile.rule. The user can use this flag to control the library name, e.g. libopenblas.a, libopenblas_ifort.a or libopenblas_omp.a. 2012-03-21 23:57:09 +08:00
unknown dff146e306 refs #80. Used GEMV SSE2 kernels on x86. 2012-03-19 17:56:22 +08:00
Xianyi Zhang 722dd08703 ref #80. On P4 CPU with 32-bit Windows XP, Octave crashed with OpenBLAS. Walkaroud: Use netlib reference gemv instead of own funtions.
For example, make USE_NETLIB_GEMV=1
2012-03-16 20:29:39 +08:00
Xianyi Zhang d047afe615 Set shared library soname in Linux. 2012-03-15 01:07:34 +08:00
Xianyi Zhang a7a7751be7 Export CBLAS funtions on Windows DLL. 2012-03-14 17:08:21 +08:00
Xianyi Zhang 0bbf955d4c Refs #74. Added -lgfortran into generating shared library. 2012-03-12 18:20:37 +08:00
Xianyi Zhang 70abe10fc0 Check new LAPACK version in generating shared library. 2012-03-07 23:14:25 +08:00
Xianyi Zhang 0a696bd4ce Improved the makefile for Intel compiler. 2012-02-20 23:36:58 +08:00
Xianyi Zhang fda39c6cb0 Updated the Changelog. 2012-02-20 09:06:43 +08:00
Xianyi Zhang 875da22a43 Merge pull request #77 from nolta/master
fix #49 the sched_yield warnings bug on Mac OS X.
2012-02-19 16:44:35 -08:00
Mike Nolta 363a563ec2 fix #49 2012-02-19 14:07:34 -05:00
Xianyi Zhang 8da6fdc2ce Merge branch 'hotfix-0.1alpha2.5' into develop 2012-02-19 23:11:06 +08:00
Xianyi Zhang 0caa5616f2 Merge branch 'hotfix-0.1alpha2.5' 2012-02-19 22:56:06 +08:00
Xianyi Zhang 727e6d83c0 Released 0.1 alpha 2.5. Updated the documents. 2012-02-19 22:55:31 +08:00
Xianyi Zhang da3f101a77 Merge branch 'develop' into hotfix-0.1alpha2.5 2012-02-19 22:31:09 +08:00
Xianyi Zhang fe613de8e1 refs #69. Auto-detect Intel Core i6/i7 (Sandy Bridge) CPU with Nehalem assembly kernels. 2012-02-13 19:20:35 +08:00
Xianyi Zhang 142e99d4e9 Merge branch 'master' into develop 2012-01-20 21:32:13 +08:00
traz 7af0139a09 Modify P Q R size of Loongson3b. 2012-01-11 16:05:39 +00:00
Wang Qian 8e53b57bb2 Appending gemmkernel and trmmkernel C code in kernel/generic, this code can be used to execute on a new platform which dose not have optimized assemble kernel. 2012-01-10 17:16:13 +00:00
Xianyi Zhang 0d3647c395 Merge pull request #76 from StefanKarpinski/patch-1
Fix #68: don't require SystemStubs on OS X. SystemStubs does not exist on Lion.
2012-01-01 05:57:25 -08:00
Stefan Karpinski 0d76196a09 Fix #68: don't require SystemStubs on OS X. 2011-12-28 23:53:20 -05:00
traz b281f3dee4 Merge remote branch 'origin/loongson3a' into loongson3b 2011-12-06 13:49:39 +00:00
traz a4292976e9 Adding detection of complex situations in symm.c, otherwise the buffer address of sb will overlap the end of sa. 2011-12-05 14:54:25 +00:00
Wang Qian c2dad58ad1 Adding n32 multiple threads condition. 2011-12-01 16:33:11 +00:00
Xianyi Zhang d5a6d789e6 Fixed a typo in Makefile. 2011-11-28 15:31:46 +08:00
Xianyi Zhang 875dde437d Merge branch 'lapack_3.4.0' into develop 2011-11-28 15:28:54 +08:00
Xianyi Zhang 5be22ca80d Refs #72. Upgraded LAPACK to 3.4.0 version. 2011-11-28 15:28:22 +08:00
Wang Qian 66904fc4e8 BLAS3 used standard MIPS instructions without extensions on Loongson 3B. 2011-11-25 11:20:25 +00:00
Wang Qian 8163ab7e55 Change the block size on Loongson 3B. 2011-11-23 18:41:49 +00:00
Xianyi Zhang ef6f7f32ae Fixed mbind bug on Loongson 3B. Check the return value of my_mbind function. 2011-11-23 17:17:41 +00:00
Xianyi Zhang 285e69e2d1 Disable using simple thread level3 to fix a bug on Loongson 3B. 2011-11-17 16:46:26 +00:00
Xianyi Zhang d1baf14a64 Enable thread affinity on Loongson 3B. Fixed the bug of reading cycle counter.
In Loongson 3A and 3B, the CPU core increases the counter in every 2 cycles by default.
2011-11-11 17:49:41 +00:00
Xianyi Zhang 0884f6b78d Merge branch 'loongson3a' of github.com:xianyi/OpenBLAS into loongson3b 2011-11-11 14:26:49 +00:00
traz 2d78fb05c8 Add conjugate condition to gemv. 2011-11-10 15:38:48 +00:00