8668571040 
								
							 
						 
						
							
							
								
								add openblas_getaffinity()  
							
							
							
						 
						
							2022-07-27 19:14:36 +02:00  
				
					
						
							
							
								 
						
							
								2e44ca0136 
								
							 
						 
						
							
							
								
								sbgemm: add missing cblas_sbgemm definition  
							
							
							
						 
						
							2021-08-30 17:40:30 +08:00  
				
					
						
							
							
								 
						
							
								25c986db5a 
								
							 
						 
						
							
							
								
								Add prototypes for CBLAS_CROTG and CBLAS_ZROTG  
							
							
							
						 
						
							2021-01-13 00:30:27 +01:00  
				
					
						
							
							
								 
						
							
								9ccb12b031 
								
							 
						 
						
							
							
								
								Add prototypes for cblas_csrot and cblas_zdrot  
							
							
							
						 
						
							2021-01-12 23:20:07 +01:00  
				
					
						
							
							
								 
						
							
								a7b1f9b1bb 
								
							 
						 
						
							
							
								
								Implementation of BF16 based gemv  
							
							... 
							
							
							
							1. Add a new API -- sbgemv to support bfloat16 based gemv
2. Implement a generic kernel for sbgemv
3. Implement an avx512-bf16 based kernel for sbgemv
Signed-off-by: Chen, Guobing <guobing.chen@intel.com> 
							
						 
						
							2020-10-29 02:08:23 +08:00  
				
					
						
							
							
								 
						
							
								b5d30b390d 
								
							 
						 
						
							
							
								
								Fix build issues with bfloat16  
							
							... 
							
							
							
							This patch fixes compilation errors due to recent renaming from SH to SB
with BUILD_BFLOAT16. 
							
						 
						
							2020-10-13 11:00:22 -05:00  
				
					
						
							
							
								 
						
							
								ca31c32693 
								
							 
						 
						
							
							
								
								Rename "HALF" and "sh" to "BFLOAT16" and "sb"  
							
							
							
						 
						
							2020-10-11 23:49:22 +02:00  
				
					
						
							
							
								 
						
							
								deaeb6c5b8 
								
							 
						 
						
							
							
								
								Add bfloat16 based dot and conversion with single/double  
							
							... 
							
							
							
							1. Added bfloat16 based dot as new API: shdot
2. Implemented generic kernel and cooperlake-specific (AVX512-BF16) kernel for shdot
3. Added 4 conversion APIs for bfloat16 data type <=> single/double: shstobf16 shdtobf16 sbf16tos dbf16tod
     shstobf16 -- convert single float array to bfloat16 array
     shdtobf16 -- convert double float array to bfloat16 array
     sbf16tos  -- convert bfloat16 array to single float array
     dbf16tod  -- convert bfloat16 array to double float array
4. Implemented generic kernels for all 4 conversion APIs, and cooperlake-specific kernel for shstobf16 and shdtobf16
5. Update level1 thread facilitate functions and macros to support multi-threading for these new APIs
6. Fix Cooperlake platform detection/specify issue when under dynamic-arch building
7. Change the typedef of bfloat16 from unsigned short to more strict uint16_t
Signed-off-by: Chen, Guobing <guobing.chen@intel.com> 
							
						 
						
							2020-09-04 02:31:25 +08:00  
				
					
						
							
							
								 
						
							
								7b4773b24d 
								
							 
						 
						
							
							
								
								Add API to set thread affinity on Linux.  
							
							... 
							
							
							
							Issue: #2545  
							
						 
						
							2020-04-08 12:49:35 -07:00  
				
					
						
							
							
								 
						
							
								5c42287c4f 
								
							 
						 
						
							
							
								
								Add declarations for ?sum and cblas_?sum  
							
							
							
						 
						
							2019-03-30 21:58:03 +01:00  
				
					
						
							
							
								 
						
							
								4f9d3e4b28 
								
							 
						 
						
							
							
								
								Expose CBLAS interfaces for I?MIN and I?MAX  
							
							
							
						 
						
							2019-03-30 12:37:13 +01:00  
				
					
						
							
							
								 
						
							
								b57af93792 
								
							 
						 
						
							
							
								
								just make CBLAS_LAYOUT an alias of the existing CBLAS_ORDER  
							
							... 
							
							
							
							to avoid having to change all instances of enum CBLAS_ORDER in this file 
							
						 
						
							2018-09-06 16:54:31 +02:00  
				
					
						
							
							
								 
						
							
								8aeab0601e 
								
							 
						 
						
							
							
								
								Follow netlib renaming/aliasing CBLAS_ORDER to CBLAS_LAYOUT  
							
							... 
							
							
							
							fixes  #1754  
						
							2018-09-06 16:39:52 +02:00  
				
					
						
							
							
								 
						
							
								eb71d61c7c 
								
							 
						 
						
							
							
								
								Expose CBLAS interface to BLAS extensions iXamin  
							
							
							
						 
						
							2018-06-23 13:31:09 +02:00  
				
					
						
							
							
								 
						
							
								1c9f4422b5 
								
							 
						 
						
							
							
								
								Fix declaration of cblas_Xdotc_sub and cblas_Xdotu_sub  
							
							... 
							
							
							
							last parameter of cblas_(c,z)dotc_sub and cblas_(c,z)dotu_sub should be void* as well 
							
						 
						
							2017-11-18 18:56:30 +01:00  
				
					
						
							
							
								 
						
							
								66ac898f64 
								
							 
						 
						
							
							
								
								Change prototypes of all complex functions to use void*  
							
							... 
							
							
							
							Change prototypes of complex functions to use void pointers like the other implementations of CBLAS 
							
						 
						
							2017-11-05 15:42:33 +01:00  
				
					
						
							
							
								 
						
							
								39cc6b21d3 
								
							 
						 
						
							
							
								
								Add ATLAS-style ?geadd function  
							
							
							
						 
						
							2015-02-16 13:46:20 +01:00  
				
					
						
							
							
								 
						
							
								65a847cd36 
								
							 
						 
						
							
							
								
								Introduce openblas_get_num_threads and openblas_get_num_procs  
							
							
							
						 
						
							2015-02-03 12:23:41 -05:00  
				
					
						
							
							
								 
						
							
								f446d2368a 
								
							 
						 
						
							
							
								
								updated cblas.h and cblas_noconst.h  
							
							
							
						 
						
							2014-09-21 13:39:15 +02:00  
				
					
						
							
							
								 
						
							
								7f234f8ed1 
								
							 
						 
						
							
							
								
								added GEMM3M tests  
							
							
							
						 
						
							2014-09-21 10:55:08 +02:00  
				
					
						
							
							
								 
						
							
								a057e5434d 
								
							 
						 
						
							
							
								
								add CBLAS interface for s/d/c/zimatcopy  
							
							
							
						 
						
							2014-09-09 09:52:13 +02:00  
				
					
						
							
							
								 
						
							
								7794766d3c 
								
							 
						 
						
							
							
								
								Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them.  
							
							
							
						 
						
							2014-09-08 17:57:44 +02:00  
				
					
						
							
							
								 
						
							
								552119c484 
								
							 
						 
						
							
							
								
								Fixed   #407 . Support outputing the CPU corename on runtime.  
							
							... 
							
							
							
							The user can use char * openblas_get_config() or char * openblas_get_corename(). 
							
						 
						
							2014-07-08 12:48:08 +08:00  
				
					
						
							
							
								 
						
							
								6c2ead30f0 
								
							 
						 
						
							
							
								
								Remove all trailing whitespace except lapack-netlib  
							
							... 
							
							
							
							Signed-off-by: Timothy Gu <timothygu99@gmail.com> 
							
						 
						
							2014-06-27 12:05:18 -07:00  
				
					
						
							
							
								 
						
							
								faf3ac0aad 
								
							 
						 
						
							
							
								
								Ref  #285 : added axpby kernels  
							
							
							
						 
						
							2014-06-08 11:54:24 +02:00  
				
					
						
							
							
								 
						
							
								212463dce9 
								
							 
						 
						
							
							
								
								get rid of the generated cblas_noconst.h file  
							
							
							
						 
						
							2013-08-28 16:52:24 +02:00  
				
					
						
							
							
								 
						
							
								c19a488af2 
								
							 
						 
						
							
							
								
								create openblas_get_parallel to retrieve information which  
							
							... 
							
							
							
							parallelization model is used by OpenBLAS. 
							
						 
						
							2013-07-11 21:39:19 +08:00  
				
					
						
							
							
								 
						
							
								875d520ccf 
								
							 
						 
						
							
							
								
								Refs  #193 . cblas: move #include out of extern "C" block.  
							
							... 
							
							
							
							Standard headers may contain C++ templates which are not permitted inside an
extern "C" block. This might be the case when we include <complex.h>. 
							
						 
						
							2013-01-31 08:48:27 +01:00  
				
					
						
							
							
								 
						
							
								4db6660de4 
								
							 
						 
						
							
							
								
								Refs  #185 . Add missing 'const' to declarations in <cblas.h>. Thanks to Dan Povey!  
							
							... 
							
							
							
							The 'const' modifications were done automatically using this scripts:
https://kaldi.svn.sourceforge.net/svnroot/kaldi/sandbox/dan/tools/for_openblas  
							
						 
						
							2013-01-20 22:52:51 +01:00  
				
					
						
							
							
								 
						
							
								200e4acf15 
								
							 
						 
						
							
							
								
								cblas: typedef enums for improved compatibility with Intel MKL.  
							
							... 
							
							
							
							Netlib style:
    enum CBLAS_XYZ {X=1, Y=2, Z=3};
Intel MKL style:
    typedef enum {X=1, Y=2, Z=3} CBLAS_XYZ;
With this hybrid style, code written in the latter form won't need any
modifications to be built with OpenBLAS.  This change should not affect existing
code, although a warning may be emitted for C code which does the following
(does not occur with C++):
    typedef enum CBLAS_XYZ CBLAS_XYZ;
    warning: redefinition of typedef 'CBLAS_XYZ' [-pedantic] 
							
						 
						
							2013-01-19 22:57:13 +01:00  
				
					
						
							
							
								 
						
							
								bdf8d9411e 
								
							 
						 
						
							
							
								
								Refs  #163 . Obtain the build configure on runtime.  
							
							... 
							
							
							
							openblas_get_config function returns the configure string.
So far, it supports USE64BITINT, NO_CBLAS, NO_LAPACK, NO_LAPACKE,
DYNAMIC_ARCH, NO_AFFINITY.
Example:
 #include <stdio.h>
extern char * openblas_get_config();
void main()
{
  printf("%s\n",openblas_get_config());
  return;
} 
							
						 
						
							2012-12-10 15:52:51 +08:00  
				
					
						
							
							
								 
						
							
								422359d09a 
								
							 
						 
						
							
							
								
								Export openblas_set_num_threads in shared library.  
							
							
							
						 
						
							2012-06-23 11:32:43 +08:00  
				
					
						
							
							
								 
						
							
								f93318a6c8 
								
							 
						 
						
							
							
								
								Refs  #95  cblas: compatibility for compilers without C99 complex number support (e.g. Visual Studio)  
							
							
							
						 
						
							2012-04-24 11:59:58 +08:00  
				
					
						
							
							
								 
						
							
								0a45e5495f 
								
							 
						 
						
							
							
								
								Fixed   #21 . Added extern C to support C++. Thank Tasio for the patch.  
							
							
							
						 
						
							2011-04-20 13:41:38 +08:00  
				
					
						
							
							
								 
						
							
								552f31dbbd 
								
							 
						 
						
							
							
								
								Fixed   #13 . Fixed blasint undefined bug in <cblas.h> file.  
							
							
							
						 
						
							2011-04-07 14:48:10 +08:00  
				
					
						
							
							
								 
						
							
								342bbc3871 
								
							 
						 
						
							
							
								
								Import GotoBLAS2 1.13 BSD version codes.  
							
							
							
						 
						
							2011-01-24 14:54:24 +00:00