99c7bba8e4 
								
							 
						 
						
							
							
								
								Initial support for SkylakeX / AVX512  
							
							... 
							
							
							
							This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change. 
							
						 
						
							2018-06-03 07:58:52 +00:00  
				
					
						
							
							
								 
						
							
								1b83341d19 
								
							 
						 
						
							
							
								
								Fix race condition in blas_server_omp.c  
							
							... 
							
							
							
							Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d 
							
						 
						
							2018-04-27 17:00:42 +08:00  
				
					
						
							
							
								 
						
							
								f81815e48a 
								
							 
						 
						
							
							
								
								Fix CMake cross-compiling  
							
							... 
							
							
							
							Without specifying thread count, NUM_THREADS would not be defined and CMake would fail.
This is because core count cannot be determined when cross-compiling. 
							
						 
						
							2018-02-28 10:25:25 +10:00  
				
					
						
							
							
								 
						
							
								e0ddd7d124 
								
							 
						 
						
							
							
								
								Allow overriding NUM_THREADS  
							
							
							
						 
						
							2017-12-01 01:42:45 -06:00  
				
					
						
							
							
								 
						
							
								962b20a9bb 
								
							 
						 
						
							
							
								
								Optionally add ReLAPACK to LIB_COMPONENTS  
							
							
							
						 
						
							2017-10-12 17:02:01 +02:00  
				
					
						
							
							
								 
						
							
								47ebce4d1a 
								
							 
						 
						
							
							
								
								Clean up, fix old typos. Simplify arch usages. Move system arch check to earlier position.  
							
							
							
						 
						
							2017-08-21 00:37:29 +10:00  
				
					
						
							
							
								 
						
							
								69b560751c 
								
							 
						 
						
							
							
								
								Improvements to previous commit (cross-compile).  
							
							... 
							
							
							
							Fix typos and bad if statements discovered in 0.2.20. 
							
						 
						
							2017-08-20 22:50:31 +10:00  
				
					
						
							
							
								 
						
							
								0a7a527a92 
								
							 
						 
						
							
							
								
								Add support for cross compiling.  
							
							... 
							
							
							
							Add support for not having host compiler as CMake cannot detect such a compiler.
Add support for not using getarch.
Successfully builds Android ARMV8. Any target can be added by supplying the TARGET_CORE config in prebuild.cmake. 
							
						 
						
							2017-08-20 20:08:53 +10:00  
				
					
						
							
							
								 
						
							
								6aac06587d 
								
							 
						 
						
							
							
								
								Fix typos and use CMake OpenMP support.  
							
							
							
						 
						
							2017-08-17 17:27:01 +10:00  
				
					
						
							
							
								 
						
							
								7c1acc07f0 
								
							 
						 
						
							
							
								
								Fix bug that required fortran. Fix bug that needed CXX var. Remove redundant set vars. Fix threading detection. Do not attempt to run code if cross compiling.  
							
							
							
						 
						
							2017-08-17 03:32:04 +10:00  
				
					
						
							
							
								 
						
							
								38d273ea03 
								
							 
						 
						
							
							
								
								Drop some redundant vars and improve arch detection in CMake.  
							
							
							
						 
						
							2017-08-17 02:04:36 +10:00  
				
					
						
							
							
								 
						
							
								90a4dab501 
								
							 
						 
						
							
							
								
								Let CMake deal with build type.  
							
							
							
						 
						
							2017-08-17 00:35:54 +10:00  
				
					
						
							
							
								 
						
							
								4260215adf 
								
							 
						 
						
							
							
								
								Support DYNAMIC_ARCH with cmake  
							
							
							
						 
						
							2017-08-01 22:25:52 +05:30  
				
					
						
							
							
								 
						
							
								65e56cb29d 
								
							 
						 
						
							
							
								
								Add 64bit support for Microsoft Visual Studio  
							
							
							
						 
						
							2017-06-21 13:38:22 -07:00  
				
					
						
							
							
								 
						
							
								c9ff735da6 
								
							 
						 
						
							
							
								
								Add ZEN support (tested for auto-detected static backend)  
							
							
							
						 
						
							2017-03-19 15:32:50 +01:00  
				
					
						
							
							
								 
						
							
								053044ae4d 
								
							 
						 
						
							
							
								
								Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR  
							
							... 
							
							
							
							If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project. 
							
						 
						
							2016-05-25 09:13:28 +02:00  
				
					
						
							
							
								 
						
							
								53b6023a6c 
								
							 
						 
						
							
							
								
								Fix cmake bug on MSVC 32-bit.  
							
							
							
						 
						
							2015-10-26 14:52:13 -05:00  
				
					
						
							
							
								 
						
							
								309875de3c 
								
							 
						 
						
							
							
								
								Fix cmake bug on x86 32-bit.  
							
							... 
							
							
							
							e.g. Build 32-bit on 64-bit Linux.
cmake -DBINARY=32 
							
						 
						
							2015-10-27 02:54:53 +08:00  
				
					
						
							
							
								 
						
							
								f874465bb8 
								
							 
						 
						
							
							
								
								Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.  
							
							... 
							
							
							
							Disable CBLAS and LAPACK. 
							
						 
						
							2015-08-10 14:10:44 -05:00  
				
					
						
							
							
								 
						
							
								1d183dcda8 
								
							 
						 
						
							
							
								
								Added lapacke sources.  
							
							
							
						 
						
							2015-02-25 16:51:08 -06:00  
				
					
						
							
							
								 
						
							
								4662a0b13a 
								
							 
						 
						
							
							
								
								Changed generate functions to iterate through a list of float types.  
							
							... 
							
							
							
							This will generate obj files for SINGLE/DOUBLE/COMPLEX/DOUBLE COMPLEX. 
							
						 
						
							2015-02-15 17:44:37 -06:00  
				
					
						
							
							
								 
						
							
								d60b49e5c5 
								
							 
						 
						
							
							
								
								Turned off uninizialized variable warning when compiling lapack-netlib.  
							
							
							
						 
						
							2015-02-10 14:36:43 -06:00  
				
					
						
							
							
								 
						
							
								2828f6630c 
								
							 
						 
						
							
							
								
								Added SMP sources to COMMONOBJS.  
							
							
							
						 
						
							2015-02-04 14:01:36 -06:00  
				
					
						
							
							
								 
						
							
								a0aeda6187 
								
							 
						 
						
							
							
								
								Added function to set defines for the object names (e.g. -DNAME=dgemm).  
							
							
							
						 
						
							2015-02-04 10:37:34 -06:00  
				
					
						
							
							
								 
						
							
								84b3d760c4 
								
							 
						 
						
							
							
								
								Converted rest of Makefile.system to system.cmake.  
							
							
							
						 
						
							2015-02-03 16:05:01 -06:00  
				
					
						
							
							
								 
						
							
								0beea3a5a5 
								
							 
						 
						
							
							
								
								Converted LAPACK flags from Makefile.system.  
							
							
							
						 
						
							2015-02-03 15:33:56 -06:00  
				
					
						
							
							
								 
						
							
								0ccfa60a53 
								
							 
						 
						
							
							
								
								Changed fortran compiler name to be uppercase and stripped of path/ext.  
							
							
							
						 
						
							2015-02-03 15:09:37 -06:00  
				
					
						
							
							
								 
						
							
								30be551502 
								
							 
						 
						
							
							
								
								Corrected fortran compiler name variables.  
							
							... 
							
							
							
							Fixed some typos.
Updated c_check to set ARCH and BINARY64/32.
Added version variables. 
							
						 
						
							2015-02-03 14:21:22 -06:00  
				
					
						
							
							
								 
						
							
								e818ace11a 
								
							 
						 
						
							
							
								
								Ported more of Makefile.system to CMake.  
							
							
							
						 
						
							2015-02-03 13:34:41 -06:00  
				
					
						
							
							
								 
						
							
								2d5b442f5b 
								
							 
						 
						
							
							
								
								Ported Fortran configuration code from Makefile.system to fc.cmake.  
							
							
							
						 
						
							2015-02-03 12:32:23 -06:00  
				
					
						
							
							
								 
						
							
								af11aff309 
								
							 
						 
						
							
							
								
								Ported C compiler settings from Makefile.system into new cmake file.  
							
							
							
						 
						
							2015-02-03 12:00:49 -06:00  
				
					
						
							
							
								 
						
							
								e66aa5f3b7 
								
							 
						 
						
							
							
								
								Ported arch dependent settings from Makefile.system to new cmake file.  
							
							
							
						 
						
							2015-02-03 11:32:20 -06:00  
				
					
						
							
							
								 
						
							
								31cf22cb4b 
								
							 
						 
						
							
							
								
								Ported OS settings from Makefile.system into new cmake file.  
							
							
							
						 
						
							2015-02-03 11:07:58 -06:00  
				
					
						
							
							
								 
						
							
								8ede4a8da4 
								
							 
						 
						
							
							
								
								getarch now compiles and sets config.h defines properly.  
							
							... 
							
							
							
							Still isn't parsed into CMake variables, and getarch_2 needs to
get the same treatment. 
							
						 
						
							2015-01-28 17:18:26 -06:00  
				
					
						
							
							
								 
						
							
								5eefe18ae4 
								
							 
						 
						
							
							
								
								Added CMakeLists.txt for the first of the BLAS folders.  
							
							... 
							
							
							
							It only does the double precision compile currently.
I realized I didn't finish converting Makefile.system yet, so I made
a note of that. 
							
						 
						
							2015-01-27 16:17:17 -06:00  
				
					
						
							
							
								 
						
							
								f4d1e7a265 
								
							 
						 
						
							
							
								
								Hardcoded NUM_CORES to get system.cmake working.  
							
							
							
						 
						
							2015-01-27 11:37:39 -06:00  
				
					
						
							
							
								 
						
							
								e5c47e44f6 
								
							 
						 
						
							
							
								
								First pass at converting a few makefiles to CMake.  
							
							
							
						 
						
							2014-12-30 21:53:00 -06:00