4a553e8678 
								
							 
						 
						
							
							
								
								Merge pull request  #1713  from martin-frbg/issue1710  
							
							... 
							
							
							
							Introduce blasabs macro and use it to switch between abs and labs for INTERFACE64 
							
						 
						
							2018-08-04 23:51:31 +02:00  
				
					
						
							
							
								 
						
							
								165f00c159 
								
							 
						 
						
							
							
								
								fabs -> fabsl  
							
							
							
						 
						
							2018-08-04 20:14:51 +02:00  
				
					
						
							
							
								 
						
							
								933896a1d0 
								
							 
						 
						
							
							
								
								Use blasabs to switch between abs and labs as needed for INTERFACE64  
							
							
							
						 
						
							2018-08-04 20:06:49 +02:00  
				
					
						
							
							
								 
						
							
								a4e321400b 
								
							 
						 
						
							
							
								
								fabs -> fabsl  
							
							... 
							
							
							
							Fixes two calls that were using `fabs` on a `long double` argument rather than `fabsl`, which looks like it is doing an unintentional truncation to `double` precision. 
							
						 
						
							2018-08-03 13:00:10 -04:00  
				
					
						
							
							
								 
						
							
								9cf22b7d91 
								
							 
						 
						
							
							
								
								Build cblas_iXamin interfaces  
							
							
							
						 
						
							2018-06-23 13:27:30 +02:00  
				
					
						
							
							
								 
						
							
								c2545b0fd6 
								
							 
						 
						
							
							
								
								Fixed a few more unnecessary calls to num_cpu_avail.  
							
							... 
							
							
							
							I don't have as many benchmarks for these as for gemm, but it should still
make a difference for small matrices. 
							
						 
						
							2018-06-11 10:17:16 +01:00  
				
					
						
							
							
								 
						
							
								66316b9f4c 
								
							 
						 
						
							
							
								
								Improve performance of GEMM for small matrices when SMP is defined.  
							
							... 
							
							
							
							Always checking num_cpu_avail() regardless of whether threading will actually
be used adds noticeable overhead for small matrices.  Most other uses of
num_cpu_avail() do so only if threading will be used, so do the same here. 
							
						 
						
							2018-06-07 15:29:13 +01:00  
				
					
						
							
							
								 
						
							
								e8880c1699 
								
							 
						 
						
							
							
								
								Use a single thread for small input size  
							
							... 
							
							
							
							copies daxpy improvement from #27 , see #1560  
							
						 
						
							2018-06-07 10:26:55 +02:00  
				
					
						
							
							
								 
						
							
								1d27fa8507 
								
							 
						 
						
							
							
								
								Merge pull request  #1539  from martin-frbg/ztrmv-1332  
							
							... 
							
							
							
							Disable multithreading in ztrmv 
							
						 
						
							2018-04-27 23:10:21 +02:00  
				
					
						
							
							
								 
						
							
								a8ed428bab 
								
							 
						 
						
							
							
								
								Disable multithreading in ztrmv  
							
							... 
							
							
							
							BLAS-Tester shows that the same problem exists as with DTRMV (issue #1332 ) 
							
						 
						
							2018-04-25 22:35:46 +02:00  
				
					
						
							
							
								 
						
							
								809fd0d451 
								
							 
						 
						
							
							
								
								Rewrite ROTMG to address cases not covered by the netlib algorithm ( #1480 )  
							
							... 
							
							
							
							* Rewrite ROTMG based on the new implementation in GONUM based on the algorithm proposed by Tim Hopkins, see issue 1452 for the reference
* Correct ROTMG utest for issue1452 and add another from gonum, also correct transposition of expected and observed values in error messages 
							
						 
						
							2018-03-04 17:39:56 +01:00  
				
					
						
							
							
								 
						
							
								72f14a0363 
								
							 
						 
						
							
							
								
								Fix conditionals in the rescaling against GAMSQ  
							
							
							
						 
						
							2018-02-18 12:54:52 +01:00  
				
					
						
							
							
								 
						
							
								798f1595d5 
								
							 
						 
						
							
							
								
								Fix condition in both second scaling loops  
							
							
							
						 
						
							2018-02-18 12:37:09 +01:00  
				
					
						
							
							
								 
						
							
								0464aa6784 
								
							 
						 
						
							
							
								
								Remove debug printfs  
							
							
							
						 
						
							2018-02-09 23:06:50 +01:00  
				
					
						
							
							
								 
						
							
								55840f0bc9 
								
							 
						 
						
							
							
								
								Keep the flag handling separate from the scaling loops  
							
							... 
							
							
							
							Fixes  #1452  and is more in line with how ATLAS does it. The earlier fix from #356  only moved the bug elsewhere, but we will never want the iterative rescaling to change the dflag setting and variable associations with each cycle. 
						
							2018-02-09 23:00:03 +01:00  
				
					
						
							
							
								 
						
							
								47deec2c1a 
								
							 
						 
						
							
							
								
								fix couple of dead assignment warnings  
							
							
							
						 
						
							2017-12-22 00:56:35 +01:00  
				
					
						
							
							
								 
						
							
								38763ec4f3 
								
							 
						 
						
							
							
								
								Disable multithreading for trmv  
							
							... 
							
							
							
							as a (hopefully temporary) workaround for #1332  
							
						 
						
							2017-12-03 22:40:54 +01:00  
				
					
						
							
							
								 
						
							
								9251a2efde 
								
							 
						 
						
							
							
								
								Merge pull request  #1359  from brada4/develop  
							
							... 
							
							
							
							Eliminate mode variable where not needed in syrk interface 
							
						 
						
							2017-11-18 23:47:17 +01:00  
				
					
						
							
							
								 
						
							
								b46e2b57cc 
								
							 
						 
						
							
							
								
								Make return parameter of cblas_Xdotc_sub, cblas_Xdotu_sub a void pointer as well  
							
							
							
						 
						
							2017-11-18 20:28:02 +01:00  
				
					
						
							
							
								 
						
							
								3ce401f51b 
								
							 
						 
						
							
							
								
								Make last parameter of cblas_Xdotc_sub/cblas_Xdotu_sub a void pointer as well  
							
							
							
						 
						
							2017-11-18 18:58:40 +01:00  
				
					
						
							
							
								 
						
							
								27575d200a 
								
							 
						 
						
							
							
								
								Eliminate mode variable where not needed  
							
							
							
						 
						
							2017-11-15 15:32:38 +01:00  
				
					
						
							
							
								 
						
							
								2c222f1faa 
								
							 
						 
						
							
							
								
								Modify complex CBLAS functions to take void pointers  
							
							... 
							
							
							
							Modify complex CBLAS functions to take void pointers instead of float or double arguments (to bring the prototypes in line with netlib and other implementations' cblas.h) 
							
						 
						
							2017-11-05 15:53:14 +01:00  
				
					
						
							
							
								 
						
							
								742f54c235 
								
							 
						 
						
							
							
								
								Merge pull request  #1303  from martin-frbg/imatcopy-rowscols  
							
							... 
							
							
							
							Fix cols/rows mixup in omatcopy 2nd step for BlasTrans cases 
							
						 
						
							2017-09-14 21:46:26 +02:00  
				
					
						
							
							
								 
						
							
								d674fbb4c7 
								
							 
						 
						
							
							
								
								Fix cols/rows mixup in omatcopy 2nd step for BlasTrans cases  
							
							... 
							
							
							
							Equivalent of #1244  (issue #899 ) for the non-complex cases. Fixes  #1289  
							
						 
						
							2017-09-14 19:59:05 +02:00  
				
					
						
							
							
								 
						
							
								46c9357c72 
								
							 
						 
						
							
							
								
								Merge pull request  #1288  from quickwritereader/develop  
							
							... 
							
							
							
							Optimized standard Blas Level-1,2 (excluding nrm2 functions) for z13 (double precision). Issue 884 
							
						 
						
							2017-09-09 23:47:17 +02:00  
				
					
						
							
							
								 
						
							
								1cfdb2295d 
								
							 
						 
						
							
							
								
								Optimized standard Blas Level-1,2 (excluding nrm2 functions) for z13 (double precision)  
							
							
							
						 
						
							2017-09-06 16:41:08 +04:00  
				
					
						
							
							
								 
						
							
								00740c0e34 
								
							 
						 
						
							
							
								
								Merge pull request  #1290  from martin-frbg/imatcopy  
							
							... 
							
							
							
							Use in-place transform shortcut only if matrix is square 
							
						 
						
							2017-09-03 13:02:10 +02:00  
				
					
						
							
							
								 
						
							
								254db9bd7c 
								
							 
						 
						
							
							
								
								Use in-place transform shortcut only if matrix is square  
							
							
							
						 
						
							2017-09-03 09:52:55 +02:00  
				
					
						
							
							
								 
						
							
								d245caa49a 
								
							 
						 
						
							
							
								
								Support out-of-source build  
							
							
							
						 
						
							2017-08-01 15:16:14 +05:30  
				
					
						
							
							
								 
						
							
								376048156b 
								
							 
						 
						
							
							
								
								Use in-place transform shortcut only if matrix is square  
							
							
							
						 
						
							2017-07-21 11:20:15 +02:00  
				
					
						
							
							
								 
						
							
								d1c5b8f913 
								
							 
						 
						
							
							
								
								Add files via upload  
							
							
							
						 
						
							2017-07-20 20:51:06 +02:00  
				
					
						
							
							
								 
						
							
								91bde7d315 
								
							 
						 
						
							
							
								
								Exchange rows and cols in final omatcopy with BlasTrans  
							
							... 
							
							
							
							This is MicMuc's patch from #899  
							
						 
						
							2017-07-15 22:02:53 +02:00  
				
					
						
							
							
								 
						
							
								1e06b49854 
								
							 
						 
						
							
							
								
								Update xerbla.c  
							
							
							
						 
						
							2017-04-26 20:29:30 +02:00  
				
					
						
							
							
								 
						
							
								7f546f54fa 
								
							 
						 
						
							
							
								
								Add cblas_xerbla  
							
							
							
						 
						
							2017-04-26 20:01:34 +02:00  
				
					
						
							
							
								 
						
							
								a809431e34 
								
							 
						 
						
							
							
								
								Add cblas_xerbla()  
							
							
							
						 
						
							2017-04-26 19:58:59 +02:00  
				
					
						
							
							
								 
						
							
								99880f7906 
								
							 
						 
						
							
							
								
								Address unlikely memleak in zimatcopy interface ( #1129 )  
							
							... 
							
							
							
							* fix unlikely memleak in zimatcopy interface
* fix only unlikely memleak in zimatcopy interface
* fix only unlikely memleak in zimatcopy interface 
							
						 
						
							2017-03-16 13:13:31 +01:00  
				
					
						
							
							
								 
						
							
								211d2eceb5 
								
							 
						 
						
							
							
								
								Update zdot.c  
							
							
							
						 
						
							2017-03-13 18:08:00 +01:00  
				
					
						
							
							
								 
						
							
								5813ed095b 
								
							 
						 
						
							
							
								
								Update zdot.c  
							
							
							
						 
						
							2017-03-13 17:49:07 +01:00  
				
					
						
							
							
								 
						
							
								e44b028fe5 
								
							 
						 
						
							
							
								
								Replace gnu _real_, _imag_ extensions in initializers  
							
							
							
						 
						
							2017-03-13 00:40:11 +01:00  
				
					
						
							
							
								 
						
							
								071a830e8b 
								
							 
						 
						
							
							
								
								THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations  
							
							
							
						 
						
							2017-02-03 03:55:06 -08:00  
				
					
						
							
							
								 
						
							
								dd6212e684 
								
							 
						 
						
							
							
								
								updated some level1 funcions, that are not thread save  
							
							
							
						 
						
							2017-01-10 14:05:07 +01:00  
				
					
						
							
							
								 
						
							
								84b8170bfb 
								
							 
						 
						
							
							
								
								Adding multi-threading for copy, dot, rot, and asum funcitons  
							
							
							
						 
						
							2017-01-10 11:48:58 +08:00  
				
					
						
							
							
								 
						
							
								ae4ac6f984 
								
							 
						 
						
							
							
								
								removed obj-files, that are moved to lapack 3.7.0  
							
							
							
						 
						
							2017-01-06 16:14:53 +01:00  
				
					
						
							
							
								 
						
							
								d346c533b1 
								
							 
						 
						
							
							
								
								Fix z/ctrmv stack allocation on AMD bulldozer and barcelona target  
							
							... 
							
							
							
							* Hopefully, because this was found by error and trial (dark magic)
* Ref #786  
							
						 
						
							2016-06-07 16:11:09 +02:00  
				
					
						
							
							
								 
						
							
								f04af36ad0 
								
							 
						 
						
							
							
								
								Merge pull request  #898  from wernsaar/develop  
							
							... 
							
							
							
							added experimental support for optimized lapack fortran functions 
							
						 
						
							2016-05-31 14:13:52 +02:00  
				
					
						
							
							
								 
						
							
								41000c8443 
								
							 
						 
						
							
							
								
								added directory for optimized lapack fortan codes and added dlaqr5.f  
							
							
							
						 
						
							2016-05-31 12:53:07 +02:00  
				
					
						
							
							
								 
						
							
								053044ae4d 
								
							 
						 
						
							
							
								
								Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR  
							
							... 
							
							
							
							If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project. 
							
						 
						
							2016-05-25 09:13:28 +02:00  
				
					
						
							
							
								 
						
							
								40af513669 
								
							 
						 
						
							
							
								
								Disable multi-threading in swap  
							
							... 
							
							
							
							* Close  #873  
							
						 
						
							2016-05-16 13:07:55 +00:00  
				
					
						
							
							
								 
						
							
								16ec5323c9 
								
							 
						 
						
							
							
								
								Fix zgemv.c compilation when stack allocation is disabled  
							
							
							
						 
						
							2016-02-08 12:05:02 +01:00  
				
					
						
							
							
								 
						
							
								5fc2203d8a 
								
							 
						 
						
							
							
								
								zgemv: Add a workaround for  #746  
							
							
							
						 
						
							2016-02-08 11:25:15 +01:00