ccdba3c771 
								
							 
						 
						
							
							
								
								Updated the version to 0.1.0.  
							
							
							
						 
						
							2012-03-23 01:29:05 +08:00  
				
					
						
							
							
								 
						
							
								3871b6a86d 
								
							 
						 
						
							
							
								
								Merge branch 'loongson3b' into release-0.1.0  
							
							
							
						 
						
							2012-03-23 01:26:44 +08:00  
				
					
						
							
							
								 
						
							
								83ecfbb9b3 
								
							 
						 
						
							
							
								
								Merge branch 'loongson3a' into release-0.1.0  
							
							
							
						 
						
							2012-03-23 01:26:27 +08:00  
				
					
						
							
							
								 
						
							
								31c836ac25 
								
							 
						 
						
							
							
								
								Ref  #79  Added GEMM_MULTITHREAD_THRESHOLD flag to use single thread in gemm function with small matrices.  
							
							
							
						 
						
							2012-03-23 01:17:41 +08:00  
				
					
						
							
							
								 
						
							
								3afedbf6f0 
								
							 
						 
						
							
							
								
								Merge branch 'fix-crash_on_P4' into develop  
							
							
							
						 
						
							2012-03-23 00:06:29 +08:00  
				
					
						
							
							
								 
						
							
								b4ec36debc 
								
							 
						 
						
							
							
								
								Merge branch 'master' into develop  
							
							
							
						 
						
							2012-03-23 00:00:13 +08:00  
				
					
						
							
							
								 
						
							
								1f15bee02a 
								
							 
						 
						
							
							
								
								Refs  #81 . Added LIBNAMESUFFIX flag in Makefile.rule. The user can use this flag to control the library name, e.g. libopenblas.a, libopenblas_ifort.a or libopenblas_omp.a.  
							
							
							
						 
						
							2012-03-21 23:57:09 +08:00  
				
					
						
							
							
								 
						
							
								dff146e306 
								
							 
						 
						
							
							
								
								refs  #80 . Used GEMV SSE2 kernels on x86.  
							
							
							
						 
						
							2012-03-19 17:56:22 +08:00  
				
					
						
							
							
								 
						
							
								722dd08703 
								
							 
						 
						
							
							
								
								ref  #80 . On P4 CPU with 32-bit Windows XP, Octave crashed with OpenBLAS. Walkaroud: Use netlib reference gemv instead of own funtions.  
							
							... 
							
							
							
							For example, make USE_NETLIB_GEMV=1 
							
						 
						
							2012-03-16 20:29:39 +08:00  
				
					
						
							
							
								 
						
							
								d047afe615 
								
							 
						 
						
							
							
								
								Set shared library soname in Linux.  
							
							
							
						 
						
							2012-03-15 01:07:34 +08:00  
				
					
						
							
							
								 
						
							
								a7a7751be7 
								
							 
						 
						
							
							
								
								Export CBLAS funtions on Windows DLL.  
							
							
							
						 
						
							2012-03-14 17:08:21 +08:00  
				
					
						
							
							
								 
						
							
								0bbf955d4c 
								
							 
						 
						
							
							
								
								Refs  #74 . Added -lgfortran into generating shared library.  
							
							
							
						 
						
							2012-03-12 18:20:37 +08:00  
				
					
						
							
							
								 
						
							
								70abe10fc0 
								
							 
						 
						
							
							
								
								Check new LAPACK version in generating shared library.  
							
							
							
						 
						
							2012-03-07 23:14:25 +08:00  
				
					
						
							
							
								 
						
							
								0a696bd4ce 
								
							 
						 
						
							
							
								
								Improved the makefile for Intel compiler.  
							
							
							
						 
						
							2012-02-20 23:36:58 +08:00  
				
					
						
							
							
								 
						
							
								fda39c6cb0 
								
							 
						 
						
							
							
								
								Updated the Changelog.  
							
							
							
						 
						
							2012-02-20 09:06:43 +08:00  
				
					
						
							
							
								 
						
							
								875da22a43 
								
							 
						 
						
							
							
								
								Merge pull request  #77  from nolta/master  
							
							... 
							
							
							
							fix  #49  the sched_yield warnings bug on Mac OS X. 
						
							2012-02-19 16:44:35 -08:00  
				
					
						
							
							
								 
						
							
								363a563ec2 
								
							 
						 
						
							
							
								
								fix   #49  
							
							
							
						 
						
							2012-02-19 14:07:34 -05:00  
				
					
						
							
							
								 
						
							
								8da6fdc2ce 
								
							 
						 
						
							
							
								
								Merge branch 'hotfix-0.1alpha2.5' into develop  
							
							
							
						 
						
							2012-02-19 23:11:06 +08:00  
				
					
						
							
							
								 
						
							
								0caa5616f2 
								
							 
						 
						
							
							
								
								Merge branch 'hotfix-0.1alpha2.5'  
							
							
							
						 
						
							2012-02-19 22:56:06 +08:00  
				
					
						
							
							
								 
						
							
								727e6d83c0 
								
							 
						 
						
							
							
								
								Released 0.1 alpha 2.5. Updated the documents.  
							
							
							
						 
						
							2012-02-19 22:55:31 +08:00  
				
					
						
							
							
								 
						
							
								da3f101a77 
								
							 
						 
						
							
							
								
								Merge branch 'develop' into hotfix-0.1alpha2.5  
							
							
							
						 
						
							2012-02-19 22:31:09 +08:00  
				
					
						
							
							
								 
						
							
								fe613de8e1 
								
							 
						 
						
							
							
								
								refs  #69 . Auto-detect Intel Core i6/i7 (Sandy Bridge) CPU with Nehalem assembly kernels.  
							
							
							
						 
						
							2012-02-13 19:20:35 +08:00  
				
					
						
							
							
								 
						
							
								142e99d4e9 
								
							 
						 
						
							
							
								
								Merge branch 'master' into develop  
							
							
							
						 
						
							2012-01-20 21:32:13 +08:00  
				
					
						
							
							
								 
						
							
								7af0139a09 
								
							 
						 
						
							
							
								
								Modify P Q R size of Loongson3b.  
							
							
							
						 
						
							2012-01-11 16:05:39 +00:00  
				
					
						
							
							
								 
						
							
								8e53b57bb2 
								
							 
						 
						
							
							
								
								Appending gemmkernel and trmmkernel C code in kernel/generic, this code can be used to execute on a new platform which dose not have optimized assemble kernel.  
							
							
							
						 
						
							2012-01-10 17:16:13 +00:00  
				
					
						
							
							
								 
						
							
								0d3647c395 
								
							 
						 
						
							
							
								
								Merge pull request  #76  from StefanKarpinski/patch-1  
							
							... 
							
							
							
							Fix  #68 : don't require SystemStubs on OS X. SystemStubs does not exist on Lion. 
						
							2012-01-01 05:57:25 -08:00  
				
					
						
							
							
								 
						
							
								0d76196a09 
								
							 
						 
						
							
							
								
								Fix   #68 : don't require SystemStubs on OS X.  
							
							
							
						 
						
							2011-12-28 23:53:20 -05:00  
				
					
						
							
							
								 
						
							
								b281f3dee4 
								
							 
						 
						
							
							
								
								Merge remote branch 'origin/loongson3a' into loongson3b  
							
							
							
						 
						
							2011-12-06 13:49:39 +00:00  
				
					
						
							
							
								 
						
							
								a4292976e9 
								
							 
						 
						
							
							
								
								Adding detection of complex situations in symm.c, otherwise the buffer address of sb will overlap the end of sa.  
							
							
							
						 
						
							2011-12-05 14:54:25 +00:00  
				
					
						
							
							
								 
						
							
								c2dad58ad1 
								
							 
						 
						
							
							
								
								Adding n32 multiple threads condition.  
							
							
							
						 
						
							2011-12-01 16:33:11 +00:00  
				
					
						
							
							
								 
						
							
								d5a6d789e6 
								
							 
						 
						
							
							
								
								Fixed a typo in Makefile.  
							
							
							
						 
						
							2011-11-28 15:31:46 +08:00  
				
					
						
							
							
								 
						
							
								875dde437d 
								
							 
						 
						
							
							
								
								Merge branch 'lapack_3.4.0' into develop  
							
							
							
						 
						
							2011-11-28 15:28:54 +08:00  
				
					
						
							
							
								 
						
							
								5be22ca80d 
								
							 
						 
						
							
							
								
								Refs  #72 . Upgraded LAPACK to 3.4.0 version.  
							
							
							
						 
						
							2011-11-28 15:28:22 +08:00  
				
					
						
							
							
								 
						
							
								66904fc4e8 
								
							 
						 
						
							
							
								
								BLAS3 used standard MIPS instructions without extensions on Loongson 3B.  
							
							
							
						 
						
							2011-11-25 11:20:25 +00:00  
				
					
						
							
							
								 
						
							
								8163ab7e55 
								
							 
						 
						
							
							
								
								Change the block size on Loongson 3B.  
							
							
							
						 
						
							2011-11-23 18:41:49 +00:00  
				
					
						
							
							
								 
						
							
								ef6f7f32ae 
								
							 
						 
						
							
							
								
								Fixed mbind bug on Loongson 3B. Check the return value of my_mbind function.  
							
							
							
						 
						
							2011-11-23 17:17:41 +00:00  
				
					
						
							
							
								 
						
							
								285e69e2d1 
								
							 
						 
						
							
							
								
								Disable using simple thread level3 to fix a bug on Loongson 3B.  
							
							
							
						 
						
							2011-11-17 16:46:26 +00:00  
				
					
						
							
							
								 
						
							
								d1baf14a64 
								
							 
						 
						
							
							
								
								Enable thread affinity on Loongson 3B. Fixed the bug of reading cycle counter.  
							
							... 
							
							
							
							In Loongson 3A and 3B, the CPU core increases the counter in every 2 cycles by default. 
							
						 
						
							2011-11-11 17:49:41 +00:00  
				
					
						
							
							
								 
						
							
								0884f6b78d 
								
							 
						 
						
							
							
								
								Merge branch 'loongson3a' of github.com:xianyi/OpenBLAS into loongson3b  
							
							
							
						 
						
							2011-11-11 14:26:49 +00:00  
				
					
						
							
							
								 
						
							
								2d78fb05c8 
								
							 
						 
						
							
							
								
								Add conjugate condition to gemv.  
							
							
							
						 
						
							2011-11-10 15:38:48 +00:00  
				
					
						
							
							
								 
						
							
								b95ad4cfaf 
								
							 
						 
						
							
							
								
								Support detecting ICT Loongson-3B CPU.  
							
							
							
						 
						
							2011-11-09 19:29:50 +00:00  
				
					
						
							
							
								 
						
							
								3bbe3ddb31 
								
							 
						 
						
							
							
								
								Merge branch 'develop' of github.com:xianyi/OpenBLAS into loongson3b  
							
							
							
						 
						
							2011-11-09 19:08:29 +00:00  
				
					
						
							
							
								 
						
							
								a32e56500a 
								
							 
						 
						
							
							
								
								Fix the compute error of gemv when incx and incy are negative numbers.  
							
							
							
						 
						
							2011-11-04 19:32:21 +00:00  
				
					
						
							
							
								 
						
							
								c1e618ea2d 
								
							 
						 
						
							
							
								
								Add complete gemv function on Loongson3a platform.  
							
							
							
						 
						
							2011-11-03 13:53:48 +00:00  
				
					
						
							
							
								 
						
							
								19f5b5c132 
								
							 
						 
						
							
							
								
								Fixed   #66  the bug in zgemv kernel with transpose matrix on 64-bit MingW (Windows).  
							
							
							
						 
						
							2011-10-18 18:44:23 +08:00  
				
					
						
							
							
								 
						
							
								c852ce3981 
								
							 
						 
						
							
							
								
								Ref  #65 . Fixed 64-bit Windows calling convention bug in cdot and zdot.  
							
							... 
							
							
							
							According to 64-bit Windows calling convention, the return value is in %rax instead of %xmm0 in cdot kernel.
In zdot, the caller allocates a memory space for return value and sets this memory address to the first hidden parameter. Thus, the callee (zdot) should assign the result to this memory space and return the memory address in %rax. 
							
						 
						
							2011-10-18 10:23:17 +08:00  
				
					
						
							
							
								 
						
							
								ba31b19c00 
								
							 
						 
						
							
							
								
								Ref  #62 . In OpenMP implementation, check the return value of omp_get_max_threads().  
							
							... 
							
							
							
							It makes sure the return value as same as blas_cpu_numbers which is an internal global variable to store the number of threads in OpenBLAS. 
							
						 
						
							2011-10-16 23:00:43 +08:00  
				
					
						
							
							
								 
						
							
								66a3c6df4e 
								
							 
						 
						
							
							
								
								Ref  #63 . Fixed generating DLL bug on ming-w64.  
							
							
							
						 
						
							2011-10-09 17:25:44 +08:00  
				
					
						
							
							
								 
						
							
								57658a8c14 
								
							 
						 
						
							
							
								
								ref  #62 . Added the user friendly message with USE_OPENMP=1. The users should use OMP_NUM_THREADS.  
							
							... 
							
							
							
							When OpenBLAS is compiled with USE_OPENMP=1, it ignores OPENBLAS_NUM_THREADS and GOTO_NUM_THREADS flags.Therefore, you should use OMP_NUM_THREADS.
Without setting OMP_NUM_THREADS, a process will use maximal number of threads on a computing node. Thus, if there are 2 processes on the computing node, the thread will contend against other threads on CPU cores. As a result, the application will hang. 
							
						 
						
							2011-10-09 15:14:48 +08:00  
				
					
						
							
							
								 
						
							
								9fe3049de6 
								
							 
						 
						
							
							
								
								Adding conditional compilation(#if defined(LOONGSON3A)) to avoid affecting the performance of other platforms.  
							
							
							
						 
						
							2011-09-26 15:21:45 +00:00