OpenBLAS

Commit Graph

Author	SHA1	Message	Date
Chen, Guobing	a7b1f9b1bb	Implementation of BF16 based gemv 1. Add a new API -- sbgemv to support bfloat16 based gemv 2. Implement a generic kernel for sbgemv 3. Implement an avx512-bf16 based kernel for sbgemv Signed-off-by: Chen, Guobing <guobing.chen@intel.com>	2020-10-29 02:08:23 +08:00
Rajalakshmi Srinivasaraghavan	b5d30b390d	Fix build issues with bfloat16 This patch fixes compilation errors due to recent renaming from SH to SB with BUILD_BFLOAT16.	2020-10-13 11:00:22 -05:00
Martin Kroeker	ca31c32693	Rename "HALF" and "sh" to "BFLOAT16" and "sb"	2020-10-11 23:49:22 +02:00
Chen, Guobing	deaeb6c5b8	Add bfloat16 based dot and conversion with single/double 1. Added bfloat16 based dot as new API: shdot 2. Implemented generic kernel and cooperlake-specific (AVX512-BF16) kernel for shdot 3. Added 4 conversion APIs for bfloat16 data type <=> single/double: shstobf16 shdtobf16 sbf16tos dbf16tod shstobf16 -- convert single float array to bfloat16 array shdtobf16 -- convert double float array to bfloat16 array sbf16tos -- convert bfloat16 array to single float array dbf16tod -- convert bfloat16 array to double float array 4. Implemented generic kernels for all 4 conversion APIs, and cooperlake-specific kernel for shstobf16 and shdtobf16 5. Update level1 thread facilitate functions and macros to support multi-threading for these new APIs 6. Fix Cooperlake platform detection/specify issue when under dynamic-arch building 7. Change the typedef of bfloat16 from unsigned short to more strict uint16_t Signed-off-by: Chen, Guobing <guobing.chen@intel.com>	2020-09-04 02:31:25 +08:00
Sharvil Nanavati	7b4773b24d	Add API to set thread affinity on Linux. Issue: #2545	2020-04-08 12:49:35 -07:00
Martin Kroeker	5c42287c4f	Add declarations for ?sum and cblas_?sum	2019-03-30 21:58:03 +01:00
Martin Kroeker	4f9d3e4b28	Expose CBLAS interfaces for I?MIN and I?MAX	2019-03-30 12:37:13 +01:00
Martin Kroeker	b57af93792	just make CBLAS_LAYOUT an alias of the existing CBLAS_ORDER to avoid having to change all instances of enum CBLAS_ORDER in this file	2018-09-06 16:54:31 +02:00
Martin Kroeker	8aeab0601e	Follow netlib renaming/aliasing CBLAS_ORDER to CBLAS_LAYOUT fixes #1754	2018-09-06 16:39:52 +02:00
Martin Kroeker	eb71d61c7c	Expose CBLAS interface to BLAS extensions iXamin	2018-06-23 13:31:09 +02:00
Martin Kroeker	1c9f4422b5	Fix declaration of cblas_Xdotc_sub and cblas_Xdotu_sub last parameter of cblas_(c,z)dotc_sub and cblas_(c,z)dotu_sub should be void* as well	2017-11-18 18:56:30 +01:00
Martin Kroeker	66ac898f64	Change prototypes of all complex functions to use void* Change prototypes of complex functions to use void pointers like the other implementations of CBLAS	2017-11-05 15:42:33 +01:00
Martin Koehler	39cc6b21d3	Add ATLAS-style ?geadd function	2015-02-16 13:46:20 +01:00
Erik Schnetter	65a847cd36	Introduce openblas_get_num_threads and openblas_get_num_procs	2015-02-03 12:23:41 -05:00
wernsaar	f446d2368a	updated cblas.h and cblas_noconst.h	2014-09-21 13:39:15 +02:00
wernsaar	7f234f8ed1	added GEMM3M tests	2014-09-21 10:55:08 +02:00
Martin Koehler	a057e5434d	add CBLAS interface for s/d/c/zimatcopy	2014-09-09 09:52:13 +02:00
Martin Köhler	7794766d3c	Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them.	2014-09-08 17:57:44 +02:00
Zhang Xianyi	552119c484	Fixed #407 . Support outputing the CPU corename on runtime. The user can use char * openblas_get_config() or char * openblas_get_corename().	2014-07-08 12:48:08 +08:00
Timothy Gu	6c2ead30f0	Remove all trailing whitespace except lapack-netlib Signed-off-by: Timothy Gu <timothygu99@gmail.com>	2014-06-27 12:05:18 -07:00
wernsaar	faf3ac0aad	Ref #285 : added axpby kernels	2014-06-08 11:54:24 +02:00
Lars Buitinck	212463dce9	get rid of the generated cblas_noconst.h file	2013-08-28 16:52:24 +02:00
grisuthedragon	c19a488af2	create openblas_get_parallel to retrieve information which parallelization model is used by OpenBLAS.	2013-07-11 21:39:19 +08:00
Zaheer Chothia	875d520ccf	Refs #193 . cblas: move #include out of extern "C" block. Standard headers may contain C++ templates which are not permitted inside an extern "C" block. This might be the case when we include <complex.h>.	2013-01-31 08:48:27 +01:00
Zaheer Chothia	4db6660de4	Refs #185 . Add missing 'const' to declarations in <cblas.h>. Thanks to Dan Povey! The 'const' modifications were done automatically using this scripts: https://kaldi.svn.sourceforge.net/svnroot/kaldi/sandbox/dan/tools/for_openblas	2013-01-20 22:52:51 +01:00
Zaheer Chothia	200e4acf15	cblas: typedef enums for improved compatibility with Intel MKL. Netlib style: enum CBLAS_XYZ {X=1, Y=2, Z=3}; Intel MKL style: typedef enum {X=1, Y=2, Z=3} CBLAS_XYZ; With this hybrid style, code written in the latter form won't need any modifications to be built with OpenBLAS. This change should not affect existing code, although a warning may be emitted for C code which does the following (does not occur with C++): typedef enum CBLAS_XYZ CBLAS_XYZ; warning: redefinition of typedef 'CBLAS_XYZ' [-pedantic]	2013-01-19 22:57:13 +01:00
Zhang Xianyi	bdf8d9411e	Refs #163 . Obtain the build configure on runtime. openblas_get_config function returns the configure string. So far, it supports USE64BITINT, NO_CBLAS, NO_LAPACK, NO_LAPACKE, DYNAMIC_ARCH, NO_AFFINITY. Example: #include <stdio.h> extern char * openblas_get_config(); void main() { printf("%s\n",openblas_get_config()); return; }	2012-12-10 15:52:51 +08:00
Zhang Xianyi	422359d09a	Export openblas_set_num_threads in shared library.	2012-06-23 11:32:43 +08:00
Zaheer Chothia	f93318a6c8	Refs #95 cblas: compatibility for compilers without C99 complex number support (e.g. Visual Studio)	2012-04-24 11:59:58 +08:00
Xianyi Zhang	0a45e5495f	Fixed #21 . Added extern C to support C++. Thank Tasio for the patch.	2011-04-20 13:41:38 +08:00
Xianyi Zhang	552f31dbbd	Fixed #13 . Fixed blasint undefined bug in <cblas.h> file.	2011-04-07 14:48:10 +08:00
Xianyi Zhang	342bbc3871	Import GotoBLAS2 1.13 BSD version codes.	2011-01-24 14:54:24 +00:00

32 Commits