d346c533b1 
								
							 
						 
						
							
							
								
								Fix z/ctrmv stack allocation on AMD bulldozer and barcelona target  
							
							... 
							
							
							
							* Hopefully, because this was found by error and trial (dark magic)
* Ref #786  
							
						 
						
							2016-06-07 16:11:09 +02:00  
				
					
						
							
							
								 
						
							
								f04af36ad0 
								
							 
						 
						
							
							
								
								Merge pull request  #898  from wernsaar/develop  
							
							... 
							
							
							
							added experimental support for optimized lapack fortran functions 
							
						 
						
							2016-05-31 14:13:52 +02:00  
				
					
						
							
							
								 
						
							
								41000c8443 
								
							 
						 
						
							
							
								
								added directory for optimized lapack fortan codes and added dlaqr5.f  
							
							
							
						 
						
							2016-05-31 12:53:07 +02:00  
				
					
						
							
							
								 
						
							
								053044ae4d 
								
							 
						 
						
							
							
								
								Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR  
							
							... 
							
							
							
							If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project. 
							
						 
						
							2016-05-25 09:13:28 +02:00  
				
					
						
							
							
								 
						
							
								40af513669 
								
							 
						 
						
							
							
								
								Disable multi-threading in swap  
							
							... 
							
							
							
							* Close  #873  
							
						 
						
							2016-05-16 13:07:55 +00:00  
				
					
						
							
							
								 
						
							
								16ec5323c9 
								
							 
						 
						
							
							
								
								Fix zgemv.c compilation when stack allocation is disabled  
							
							
							
						 
						
							2016-02-08 12:05:02 +01:00  
				
					
						
							
							
								 
						
							
								5fc2203d8a 
								
							 
						 
						
							
							
								
								zgemv: Add a workaround for  #746  
							
							
							
						 
						
							2016-02-08 11:25:15 +01:00  
				
					
						
							
							
								 
						
							
								78dcf5c3d5 
								
							 
						 
						
							
							
								
								Improve performances of ztrmv on small matrices  
							
							... 
							
							
							
							* Use stack allocation
* Disable multi-threading
* Ref #727  
							
						 
						
							2016-02-08 11:25:02 +01:00  
				
					
						
							
							
								 
						
							
								32f793195f 
								
							 
						 
						
							
							
								
								Use stack allocation in zgemv and zger  
							
							... 
							
							
							
							For better performance with small matrices
Ref #727  
							
						 
						
							2016-02-08 11:24:21 +01:00  
				
					
						
							
							
								 
						
							
								1fe3aab047 
								
							 
						 
						
							
							
								
								Use GEMM_MULTITHREAD_THRESHOLD as a number of ops  
							
							... 
							
							
							
							...not a matrix size. For GEMM_MULTITHREAD_THRESHOLD=4
(the default value) this does not change anything but
for other values it make the GEMM and GEMV thresholds
changing in the same way.
Close  #742  
							
						 
						
							2016-01-24 11:31:40 +01:00  
				
					
						
							
							
								 
						
							
								1a1935507b 
								
							 
						 
						
							
							
								
								[z]ger: increase multithread threshold  
							
							... 
							
							
							
							The ones given in 3ae30cd 
							
						 
						
							2016-01-24 10:46:35 +01:00  
				
					
						
							
							
								 
						
							
								66eafb16cf 
								
							 
						 
						
							
							
								
								swap: disable multi-threading for small matrices  
							
							... 
							
							
							
							Close  #731  
						
							2016-01-19 17:14:46 +01:00  
				
					
						
							
							
								 
						
							
								3ae30cd6b9 
								
							 
						 
						
							
							
								
								Disable multi-threading for small matrices in [z]ger  
							
							... 
							
							
							
							Ref #731  
							
						 
						
							2016-01-19 17:14:31 +01:00  
				
					
						
							
							
								 
						
							
								87a2ccc37c 
								
							 
						 
						
							
							
								
								Factorize MAX_STACK_ALLOC code to common_stackalloc.h  
							
							... 
							
							
							
							Ref #727  
							
						 
						
							2016-01-08 16:03:52 +01:00  
				
					
						
							
							
								 
						
							
								f9890a6452 
								
							 
						 
						
							
							
								
								Fix compilation when MAX_STACK_ALLOC is not set  
							
							... 
							
							
							
							Close  #722  
						
							2015-12-31 14:43:09 +01:00  
				
					
						
							
							
								 
						
							
								285d042b10 
								
							 
						 
						
							
							
								
								Fixed rotg bug on ARM.  
							
							
							
						 
						
							2015-12-14 10:07:01 -06:00  
				
					
						
							
							
								 
						
							
								640cccc2b1 
								
							 
						 
						
							
							
								
								Refs  #697 . Fixed gemv bug for Windows.  
							
							... 
							
							
							
							Thank matzeri's patch. 
							
						 
						
							2015-11-30 15:19:45 -06:00  
				
					
						
							
							
								 
						
							
								55a0b27c01 
								
							 
						 
						
							
							
								
								Minor C code fixes in interface/  
							
							
							
						 
						
							2015-11-09 14:15:49 +05:30  
				
					
						
							
							
								 
						
							
								2feef49fa8 
								
							 
						 
						
							
							
								
								Merge branch 'develop' into cmake  
							
							... 
							
							
							
							Conflicts:
	driver/others/memory.c 
							
						 
						
							2015-10-26 14:54:34 -05:00  
				
					
						
							
							
								 
						
							
								5a291606ad 
								
							 
						 
						
							
							
								
								Refs  #671 . the return of i?max cannot larger than N.  
							
							
							
						 
						
							2015-10-24 01:16:34 +08:00  
				
					
						
							
							
								 
						
							
								8fade093aa 
								
							 
						 
						
							
							
								
								Fixed cmake bug on Visual Studio.  
							
							
							
						 
						
							2015-10-20 14:37:22 -05:00  
				
					
						
							
							
								 
						
							
								94b125255f 
								
							 
						 
						
							
							
								
								Merge branch 'develop' into cmake  
							
							... 
							
							
							
							Conflicts:
	driver/others/memory.c 
							
						 
						
							2015-10-13 04:46:08 +08:00  
				
					
						
							
							
								 
						
							
								baec8f5cac 
								
							 
						 
						
							
							
								
								Refs  #638 . Fixed compiling bug with clang on Mac OS X.  
							
							
							
						 
						
							2015-09-10 10:32:07 -05:00  
				
					
						
							
							
								 
						
							
								711ca33bc6 
								
							 
						 
						
							
							
								
								Improved Ximatcopy when lda==ldb.  
							
							... 
							
							
							
							The Ximatcopy functions create a copy of the input matrix
although they seem to work inplace. The new routines
XIMATCOPY_K_YY perform the operations inplace if the leading
dimension does not change. 
							
						 
						
							2015-09-07 14:36:16 +02:00  
				
					
						
							
							
								 
						
							
								f874465bb8 
								
							 
						 
						
							
							
								
								Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.  
							
							... 
							
							
							
							Disable CBLAS and LAPACK. 
							
						 
						
							2015-08-10 14:10:44 -05:00  
				
					
						
							
							
								 
						
							
								dcd5ba4443 
								
							 
						 
						
							
							
								
								Merge branch 'cmake' of  https://github.com/hpanderson/OpenBLAS  into hpanderson_cmake  
							
							
							
						 
						
							2015-07-22 04:06:39 +08:00  
				
					
						
							
							
								 
						
							
								f8f2e261fe 
								
							 
						 
						
							
							
								
								use only 1 thread if m or n < 2*GEMM_MULTITHREAD_THRESHOLD  
							
							
							
						 
						
							2015-05-06 10:41:53 +02:00  
				
					
						
							
							
								 
						
							
								ab567d8443 
								
							 
						 
						
							
							
								
								gemv: Ensure stack buffer is large enough to handle memory alignment  
							
							... 
							
							
							
							Ref #478  
							
						 
						
							2015-04-24 10:12:49 +02:00  
				
					
						
							
							
								 
						
							
								847e19c04e 
								
							 
						 
						
							
							
								
								Refs #478,#482, Enable stack alloc for s/dgemv_t.(revert 9798491)  
							
							
							
						 
						
							2015-04-20 23:22:40 -05:00  
				
					
						
							
							
								 
						
							
								fd9fd42936 
								
							 
						 
						
							
							
								
								Refs  #478 ,  #482 . Fixed bug on previous commit.  
							
							
							
						 
						
							2015-04-13 23:22:27 -05:00  
				
					
						
							
							
								 
						
							
								9798481979 
								
							 
						 
						
							
							
								
								Refs  #478 ,  #482 . Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.  
							
							... 
							
							
							
							For gemv_t, directly use malloc to create the buffer. 
							
						 
						
							2015-04-13 19:45:27 -05:00  
				
					
						
							
							
								 
						
							
								cdefdb21cd 
								
							 
						 
						
							
							
								
								Refs  #492 . Fixed c/zsyr bug with negative incx.  
							
							
							
						 
						
							2015-02-26 06:37:03 +08:00  
				
					
						
							
							
								 
						
							
								0d8e227ea7 
								
							 
						 
						
							
							
								
								Changed strategy for setting preprocessor definitions.  
							
							... 
							
							
							
							Instead of generating separate object files for each permutation of
defines for a source file, GenerateNamedObjects now writes an entirely
new source file and inserts the defines as #define c statements.
This solves a problem I ran into with ar.exe where it was refusing to
link objects that had the same filename despite having different paths. 
							
						 
						
							2015-02-24 12:26:33 -06:00  
				
					
						
							
							
								 
						
							
								b2284647a3 
								
							 
						 
						
							
							
								
								More complex objects.  
							
							
							
						 
						
							2015-02-23 07:51:05 -06:00  
				
					
						
							
							
								 
						
							
								a6116e5859 
								
							 
						 
						
							
							
								
								Added some more complex-only objects.  
							
							
							
						 
						
							2015-02-22 17:49:28 -06:00  
				
					
						
							
							
								 
						
							
								67e39bd8fb 
								
							 
						 
						
							
							
								
								Added mangled complex filenames to interface and lapack CMakeLists.txt.  
							
							
							
						 
						
							2015-02-17 13:12:30 -06:00  
				
					
						
							
							
								 
						
							
								9eb1499095 
								
							 
						 
						
							
							
								
								Added another param to GenerateNamedObjects to mangle complex source names.  
							
							... 
							
							
							
							There are a lot of sources for complex float types that are the same
names as the real sources, except with z prepended. 
							
						 
						
							2015-02-17 10:30:28 -06:00  
				
					
						
							
							
								 
						
							
								39cc6b21d3 
								
							 
						 
						
							
							
								
								Add ATLAS-style ?geadd function  
							
							
							
						 
						
							2015-02-16 13:46:20 +01:00  
				
					
						
							
							
								 
						
							
								4662a0b13a 
								
							 
						 
						
							
							
								
								Changed generate functions to iterate through a list of float types.  
							
							... 
							
							
							
							This will generate obj files for SINGLE/DOUBLE/COMPLEX/DOUBLE COMPLEX. 
							
						 
						
							2015-02-15 17:44:37 -06:00  
				
					
						
							
							
								 
						
							
								e74462a3f5 
								
							 
						 
						
							
							
								
								Moved declarations to start of functions to satisfy MSVC C89 implementation.  
							
							
							
						 
						
							2015-02-11 11:16:57 -06:00  
				
					
						
							
							
								 
						
							
								e8c39138c6 
								
							 
						 
						
							
							
								
								Removed return value from GenerateNamedObjects.  
							
							... 
							
							
							
							It sets DBLAS_OBJS directly to save a bunch of list appending in the
CMakeLists.txt files. 
							
						 
						
							2015-02-09 12:28:09 -06:00  
				
					
						
							
							
								 
						
							
								58cff2fed8 
								
							 
						 
						
							
							
								
								Added CBLAS define/naming convention to GenerateNamedObjects.  
							
							
							
						 
						
							2015-02-04 11:30:15 -06:00  
				
					
						
							
							
								 
						
							
								5690cf3f0e 
								
							 
						 
						
							
							
								
								Added override for function names in GenerateNamedObjects.  
							
							... 
							
							
							
							The BLAS interface folder should now be generated the correct objects
for the DOUBLE case. 
							
						 
						
							2015-02-04 10:52:19 -06:00  
				
					
						
							
							
								 
						
							
								a0aeda6187 
								
							 
						 
						
							
							
								
								Added function to set defines for the object names (e.g. -DNAME=dgemm).  
							
							
							
						 
						
							2015-02-04 10:37:34 -06:00  
				
					
						
							
							
								 
						
							
								20e593a44a 
								
							 
						 
						
							
							
								
								Added cblas_ objects to interface CMakeLists.  
							
							... 
							
							
							
							Naming isn't right, though, not seeing cblas_xxxx exports in the
resulting library. 
							
						 
						
							2015-02-02 16:25:30 -06:00  
				
					
						
							
							
								 
						
							
								9e154aba58 
								
							 
						 
						
							
							
								
								Added LAPACK object files to interface CMakeLists.  
							
							
							
						 
						
							2015-02-02 12:31:15 -06:00  
				
					
						
							
							
								 
						
							
								5057a4b4df 
								
							 
						 
						
							
							
								
								Added openblas add_library call that uses DBLAS_OBJS ojbects.  
							
							
							
						 
						
							2015-01-30 15:21:21 -06:00  
				
					
						
							
							
								 
						
							
								a6cf8aafc0 
								
							 
						 
						
							
							
								
								Updated level3/CMakeLists with correct defines using all combos.  
							
							
							
						 
						
							2015-01-30 11:21:50 -06:00  
				
					
						
							
							
								 
						
							
								b17ccb4c5c 
								
							 
						 
						
							
							
								
								Fix a segfault in gemv when MAX_STACK_ALLOC is set  
							
							... 
							
							
							
							* stack_alloc_size is needed after the implementation call
but it may be overwritten if it's optimized to a register,
because some gemv implementation (ex: dgemv_n.S) do not
restore all register (ex: r10).
* do the same in ger.c for the same reasons even if the bug
has not been observed. 
							
						 
						
							2015-01-29 09:55:57 +01:00  
				
					
						
							
							
								 
						
							
								5eefe18ae4 
								
							 
						 
						
							
							
								
								Added CMakeLists.txt for the first of the BLAS folders.  
							
							... 
							
							
							
							It only does the double precision compile currently.
I realized I didn't finish converting Makefile.system yet, so I made
a note of that. 
							
						 
						
							2015-01-27 16:17:17 -06:00