2a9981a244 
								
							 
						 
						
							
							
								
								Add -lgomp when IBM xlf is combined with gcc in OPENMP builds  
							
							
							
						 
						
							2023-09-24 10:19:11 +02:00  
				
					
						
							
							
								 
						
							
								44e6e5479b 
								
							 
						 
						
							
							
								
								Use the C compiler for the C SBGEMM test source  
							
							
							
						 
						
							2023-09-21 23:01:21 +02:00  
				
					
						
							
							
								 
						
							
								b209915121 
								
							 
						 
						
							
							
								
								Fix build with clang  
							
							... 
							
							
							
							There are two instances when building the tests where OpenBLAS fails to
build with OpenMP and clang due to library paths getting reset as flags
are set rather than appended. This seems to only affect certain
clang/libomp installations, but if it's already grabbing the correct
library paths we might as well use them. 
							
						 
						
							2023-07-28 12:59:44 -07:00  
				
					
						
							
							
								 
						
							
								3d338b57de 
								
							 
						 
						
							
							
								
								remove spurious loops  
							
							
							
						 
						
							2022-08-08 17:09:45 +02:00  
				
					
						
							
							
								 
						
							
								d9dc015cfc 
								
							 
						 
						
							
							
								
								Use blasint for INTERFACE64 compatibility  
							
							
							
						 
						
							2022-08-08 14:52:10 +02:00  
				
					
						
							
							
								 
						
							
								1d97405c02 
								
							 
						 
						
							
							
								
								POWER: Enable bfloat16 kernels by default  
							
							... 
							
							
							
							This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems. 
							
						 
						
							2022-07-28 07:43:53 -05:00  
				
					
						
							
							
								 
						
							
								74dba0c90a 
								
							 
						 
						
							
							
								
								Fix missing braces from previous commit (PR3600)  
							
							
							
						 
						
							2022-04-09 20:03:36 +02:00  
				
					
						
							
							
								 
						
							
								bef3871833 
								
							 
						 
						
							
							
								
								Disable flang (over-)optimizations in BLAS tests ( #3600 )  
							
							... 
							
							
							
							* limit flang optimizations to -O2 
							
						 
						
							2022-04-09 17:14:24 +02:00  
				
					
						
							
							
								 
						
							
								145ccf8943 
								
							 
						 
						
							
							
								
								Prevent powershell from adding a BOM to test input ( #3595 )  
							
							... 
							
							
							
							* Prevent addition of a BOM to test input (which would distort the names of output files) 
							
						 
						
							2022-04-07 14:24:19 +02:00  
				
					
						
							
							
								 
						
							
								2408315d10 
								
							 
						 
						
							
							
								
								Skip tests if Windows powershell added a BOM  
							
							
							
						 
						
							2022-03-22 21:37:55 +01:00  
				
					
						
							
							
								 
						
							
								5dc6aa74f0 
								
							 
						 
						
							
							
								
								Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3  
							
							
							
						 
						
							2021-07-30 14:46:19 +02:00  
				
					
						
							
							
								 
						
							
								b4f4ed378b 
								
							 
						 
						
							
							
								
								Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3  
							
							
							
						 
						
							2021-07-30 14:21:08 +02:00  
				
					
						
							
							
								 
						
							
								3cfdb1770c 
								
							 
						 
						
							
							
								
								Remove code that disabled EXTRALIB on RISCV C910V  
							
							
							
						 
						
							2021-07-06 20:21:07 +02:00  
				
					
						
							
							
								 
						
							
								30d835168a 
								
							 
						 
						
							
							
								
								Merge pull request  #3088  from xoviat/msvc  
							
							... 
							
							
							
							add misc fixes. 
							
						 
						
							2021-03-14 17:14:28 +01:00  
				
					
						
							
							
								 
						
							
								6ae7af78a3 
								
							 
						 
						
							
							
								
								Support compilation with nagfor  
							
							
							
						 
						
							2021-03-11 11:53:51 +01:00  
				
					
						
							
							
								 
						
							
								3165c915b6 
								
							 
						 
						
							
							
								
								fix test helpers  
							
							
							
						 
						
							2021-01-27 15:24:49 -06:00  
				
					
						
							
							
								 
						
							
								457ccc42c9 
								
							 
						 
						
							
							
								
								Merge branch 'develop' into msvc  
							
							
							
						 
						
							2021-01-27 14:15:59 -06:00  
				
					
						
							
							
								 
						
							
								2b114c3f30 
								
							 
						 
						
							
							
								
								Restore proper Makefile  
							
							
							
						 
						
							2020-11-22 17:16:22 +01:00  
				
					
						
							
							
								 
						
							
								65eb7afaf4 
								
							 
						 
						
							
							
								
								Change ifndef CROSS to ifneq  
							
							
							
						 
						
							2020-11-22 16:25:36 +01:00  
				
					
						
							
							
								 
						
							
								ef8e7d0279 
								
							 
						 
						
							
							
								
								Add the support for RISC-V Vector.  
							
							... 
							
							
							
							Change-Id: Iae7800a32f5af3903c330882cdf6f292d885f266 
							
						 
						
							2020-10-15 16:09:02 +08:00  
				
					
						
							
							
								 
						
							
								c1f4f5d4e7 
								
							 
						 
						
							
							
								
								Replace Makefile with simplified version again  
							
							
							
						 
						
							2020-10-14 01:08:50 +02:00  
				
					
						
							
							
								 
						
							
								bc5c7f9578 
								
							 
						 
						
							
							
								
								Cleanup  
							
							
							
						 
						
							2020-10-13 19:56:09 +02:00  
				
					
						
							
							
								 
						
							
								b5d30b390d 
								
							 
						 
						
							
							
								
								Fix build issues with bfloat16  
							
							... 
							
							
							
							This patch fixes compilation errors due to recent renaming from SH to SB
with BUILD_BFLOAT16. 
							
						 
						
							2020-10-13 11:00:22 -05:00  
				
					
						
							
							
								 
						
							
								5800758b43 
								
							 
						 
						
							
							
								
								Rename "HALF" and "sh" to "BFLOAT16" and "sb"  
							
							
							
						 
						
							2020-10-11 23:44:38 +02:00  
				
					
						
							
							
								 
						
							
								924fd806d0 
								
							 
						 
						
							
							
								
								Rename "HALF" and "sh" to "BFLOAT16" and "sb"  
							
							
							
						 
						
							2020-10-11 23:43:36 +02:00  
				
					
						
							
							
								 
						
							
								4db09c6cec 
								
							 
						 
						
							
							
								
								Rename compare_sgemm_shgemm.c to compare_sgemm_sbgemm.c  
							
							
							
						 
						
							2020-10-11 23:42:45 +02:00  
				
					
						
							
							
								 
						
							
								5f23bdf437 
								
							 
						 
						
							
							
								
								remove debug output  
							
							
							
						 
						
							2020-10-11 17:23:08 +02:00  
				
					
						
							
							
								 
						
							
								08f4749eb4 
								
							 
						 
						
							
							
								
								Adapt tests to having only a subset of types in the build  
							
							
							
						 
						
							2020-10-11 14:25:24 +02:00  
				
					
						
							
							
								 
						
							
								0eaae30e8c 
								
							 
						 
						
							
							
								
								Adapt tests to having only a subset of types in the build  
							
							
							
						 
						
							2020-09-22 23:28:03 +02:00  
				
					
						
							
							
								 
						
							
								593ce9e237 
								
							 
						 
						
							
							
								
								Make building individual tests depend on BUILD_SINGLE etc defines  
							
							
							
						 
						
							2020-09-13 21:50:12 +02:00  
				
					
						
							
							
								 
						
							
								18a11137f1 
								
							 
						 
						
							
							
								
								Update BLAS tests to correspond to Reference-LAPACK 3.9.0  
							
							... 
							
							
							
							replaces calculation of machine precision with call to epsilon intrinsic and removes the requirement for previous output files to be removed before rerunning tests 
							
						 
						
							2020-06-14 10:26:25 +02:00  
				
					
						
							
							
								 
						
							
								13c28889a2 
								
							 
						 
						
							
							
								
								Update "cosmetic fixes for non-C99 compilers"  
							
							
							
						 
						
							2020-06-06 15:22:27 +02:00  
				
					
						
							
							
								 
						
							
								28915eed72 
								
							 
						 
						
							
							
								
								Cosmetic fixes for non-C99 compilers  
							
							
							
						 
						
							2020-06-05 10:05:34 +02:00  
				
					
						
							
							
								 
						
							
								8efba9b7c0 
								
							 
						 
						
							
							
								
								Improve shgemm test  
							
							... 
							
							
							
							This patch adds another check to test shgemm results. 
							
						 
						
							2020-05-11 17:15:10 -05:00  
				
					
						
							
							
								 
						
							
								564b0d39ef 
								
							 
						 
						
							
							
								
								Add test for shgemm  
							
							... 
							
							
							
							This patch has Makefile changes to add test for shgemm which
compares sgemm and shgemm result. 
							
						 
						
							2020-04-29 13:40:34 -05:00  
				
					
						
							
							
								 
						
							
								7eb55504b1 
								
							 
						 
						
							
							
								
								RFC : Add half precision gemm for bfloat16 in OpenBLAS  
							
							... 
							
							
							
							This patch adds support for bfloat16 data type matrix multiplication kernel.
For architectures that don't support bfloat16, it is defined as unsigned short
(2 bytes).  Default unroll sizes can be changed as per architecture as done for
SGEMM and for now 8 and 4 are used for M and N.  Size of ncopy/tcopy can be
changed as per architecture requirement and for now, size 2 is used.
Added shgemm in kernel/power/KERNEL.POWER9 and tested in powerpc64le and
powerpc64.  For reference, added a small test compare_sgemm_shgemm.c to compare
sgemm and shgemm output.
This patch does not cover OpenBLAS test, benchmark and lapack tests for shgemm.
Complex type implementation can be discussed and added once this is approved. 
							
						 
						
							2020-04-14 14:55:08 -05:00  
				
					
						
							
							
								 
						
							
								2d8781b0dc 
								
							 
						 
						
							
							
								
								Do not attempt to run test without fortran  
							
							
							
						 
						
							2020-03-13 20:11:19 +01:00  
				
					
						
							
							
								 
						
							
								dbf9ad1f3d 
								
							 
						 
						
							
							
								
								tests: add windows compatibility  
							
							
							
						 
						
							2019-05-05 13:09:39 -05:00  
				
					
						
							
							
								 
						
							
								daf2fec12d 
								
							 
						 
						
							
							
								
								Misc. typo fixes  
							
							... 
							
							
							
							Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib` 
							
						 
						
							2019-04-29 17:03:56 -04:00  
				
					
						
							
							
								 
						
							
								6a5ab083b7 
								
							 
						 
						
							
							
								
								Handle special case of gfortran+clang+OpenMP  
							
							
							
						 
						
							2018-06-19 20:47:33 +02:00  
				
					
						
							
							
								 
						
							
								53026dc63a 
								
							 
						 
						
							
							
								
								Update single and double precision BLAS1 tests from LAPACK 3.8.0  
							
							... 
							
							
							
							adding tests for SROTMG, SROTM, SDSDOT, DROTMG, DROTM, DSDOT 
							
						 
						
							2018-02-18 12:44:14 +01:00  
				
					
						
							
							
								 
						
							
								4474465438 
								
							 
						 
						
							
							
								
								Remove _static usages for tests  
							
							
							
						 
						
							2017-08-20 00:13:46 +10:00  
				
					
						
							
							
								 
						
							
								d245caa49a 
								
							 
						 
						
							
							
								
								Support out-of-source build  
							
							
							
						 
						
							2017-08-01 15:16:14 +05:30  
				
					
						
							
							
								 
						
							
								053044ae4d 
								
							 
						 
						
							
							
								
								Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR  
							
							... 
							
							
							
							If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project. 
							
						 
						
							2016-05-25 09:13:28 +02:00  
				
					
						
							
							
								 
						
							
								3d50ccdc0d 
								
							 
						 
						
							
							
								
								allow building tests when CROSS compiling but don't run them  
							
							
							
						 
						
							2016-04-26 12:36:47 +03:00  
				
					
						
							
							
								 
						
							
								aca7d7e953 
								
							 
						 
						
							
							
								
								Detect cmake test result.  
							
							
							
						 
						
							2015-10-20 03:35:25 +08:00  
				
					
						
							
							
								 
						
							
								f8eba3d548 
								
							 
						 
						
							
							
								
								Fixed cmake build bugs on Linux.  
							
							
							
						 
						
							2015-08-11 16:25:16 -05:00  
				
					
						
							
							
								 
						
							
								9d7057366d 
								
							 
						 
						
							
							
								
								bugfix for GEMM3M functions  
							
							
							
						 
						
							2014-09-21 11:41:43 +02:00  
				
					
						
							
							
								 
						
							
								7f234f8ed1 
								
							 
						 
						
							
							
								
								added GEMM3M tests  
							
							
							
						 
						
							2014-09-21 10:55:08 +02:00  
				
					
						
							
							
								 
						
							
								d49fd33885 
								
							 
						 
						
							
							
								
								disabled SYMM3M and HEMM3M functions because segment violations  
							
							
							
						 
						
							2014-09-20 15:27:40 +02:00