8da6b6ae52 
								
							 
						 
						
							
							
								
								Allow building on OpenBSD  
							
							... 
							
							
							
							With this change, OpenBLAS builds and all tests pass on OpenBSD 6.2
using Clang. Tested on x86-64 only, with and without DYNAMIC_ARCH=1. 
							
						 
						
							2018-04-02 10:48:22 -07:00  
				
					
						
							
							
								 
						
							
								01c4b82f04 
								
							 
						 
						
							
							
								
								Update memory.c  
							
							
							
						 
						
							2018-03-31 22:32:06 +02:00  
				
					
						
							
							
								 
						
							
								93db123f7e 
								
							 
						 
						
							
							
								
								Update memory.c  
							
							
							
						 
						
							2018-03-29 13:13:49 +02:00  
				
					
						
							
							
								 
						
							
								752fdb5dd8 
								
							 
						 
						
							
							
								
								Add workaround for old gcc and clang versions  
							
							... 
							
							
							
							Old gcc and clang do not handle constructor arguments, finally fix  #875  as discussed there, using the fedora patch 
							
						 
						
							2018-03-29 11:56:56 +02:00  
				
					
						
							
							
								 
						
							
								7646974227 
								
							 
						 
						
							
							
								
								Limit the additional locking from PRs 1052,1299 to non-OpenMP multithreading  
							
							
							
						 
						
							2018-02-21 11:45:33 +01:00  
				
					
						
							
							
								 
						
							
								8866e393a2 
								
							 
						 
						
							
							
								
								Revert "Add locks only for non-OPENMP multithreading"  
							
							
							
						 
						
							2018-02-20 17:17:12 +01:00  
				
					
						
							
							
								 
						
							
								3119b2ab4c 
								
							 
						 
						
							
							
								
								Add locks only for non-OPENMP multithreading  
							
							... 
							
							
							
							to migitate performance problems caused by #1052  and #1299  as seen in #1461  
							
						 
						
							2018-02-20 12:17:18 +01:00  
				
					
						
							
							
								 
						
							
								8f5f614615 
								
							 
						 
						
							
							
								
								On Cygwin use mmap instead of Windows native allocation functions, which are not fork-safe.  
							
							
							
						 
						
							2018-02-06 12:23:27 +01:00  
				
					
						
							
							
								 
						
							
								f5fc109fbd 
								
							 
						 
						
							
							
								
								Perform blas_thread_shutdown with pthread_atfork() on Cygwin  
							
							... 
							
							
							
							Even if we're directly using the win32 threading driver and not pthreads,
pthread_atfork still works fine to register a pre-fork handler, and is
necessary to restore the threading server to a pre-initialized state. 
							
						 
						
							2018-02-06 12:23:27 +01:00  
				
					
						
							
							
								 
						
							
								e5752ff9b3 
								
							 
						 
						
							
							
								
								take out unused variables  
							
							
							
						 
						
							2018-01-20 11:42:31 +01:00  
				
					
						
							
							
								 
						
							
								bfc2a88594 
								
							 
						 
						
							
							
								
								remove unused buffer  
							
							
							
						 
						
							2017-12-22 00:55:40 +01:00  
				
					
						
							
							
								 
						
							
								ba1f91f17b 
								
							 
						 
						
							
							
								
								Convert another caller of "allocation" to LOCK_COMMAND  
							
							... 
							
							
							
							... as the "allocation" code jumped to now does UNLOCK_COMMAND instead of blas_unlock 
							
						 
						
							2017-09-09 20:30:33 +02:00  
				
					
						
							
							
								 
						
							
								e882f3d6f3 
								
							 
						 
						
							
							
								
								Fix thread data race in memory.c  
							
							
							
						 
						
							2017-09-09 18:58:38 +02:00  
				
					
						
							
							
								 
						
							
								63cfa32691 
								
							 
						 
						
							
							
								
								Rework __GLIBC_PREREQ checks to avoid breaking non-glibc builds  
							
							
							
						 
						
							2017-07-31 21:02:43 +02:00  
				
					
						
							
							
								 
						
							
								c4af196a2d 
								
							 
						 
						
							
							
								
								Honor cgroup/cpuset limits when enumerating cpus  
							
							
							
						 
						
							2017-07-25 22:47:34 +02:00  
				
					
						
							
							
								 
						
							
								480e697681 
								
							 
						 
						
							
							
								
								Revert "Honor cgroup/cpuset limits when enumerating cpus" ( #1246 )  
							
							
							
						 
						
							2017-07-24 16:17:50 +02:00  
				
					
						
							
							
								 
						
							
								731c518cff 
								
							 
						 
						
							
							
								
								Add files via upload  
							
							
							
						 
						
							2017-07-11 18:42:39 +02:00  
				
					
						
							
							
								 
						
							
								29fc429d9a 
								
							 
						 
						
							
							
								
								Honor cgroup/cpuset constraints when enumerating cpus  
							
							
							
						 
						
							2017-07-11 18:27:33 +02:00  
				
					
						
							
							
								 
						
							
								65e56cb29d 
								
							 
						 
						
							
							
								
								Add 64bit support for Microsoft Visual Studio  
							
							
							
						 
						
							2017-06-21 13:38:22 -07:00  
				
					
						
							
							
								 
						
							
								59c97cfee4 
								
							 
						 
						
							
							
								
								memory: Fix buffer overflow when position == NUM_BUFFERS  
							
							
							
						 
						
							2017-05-05 17:47:03 +01:00  
				
					
						
							
							
								 
						
							
								5fecfe0f42 
								
							 
						 
						
							
							
								
								memory: switch loop condition around in blas_memory_free  
							
							... 
							
							
							
							Before this commit, the "position < NUM_BUFFERS" loop condition from
blas_memory_free will be completely optimized away by GCC. This is
because the condition can only be false after undefined behavior has
already been invoked (reading past the end of an array). As a
consequence of this bug, GCC also removes the subsequent if statement
and all the code after the error label because all of it is dead.
This commit switches the loop condition around so it works as intended. 
							
						 
						
							2017-05-05 16:01:58 +01:00  
				
					
						
							
							
								 
						
							
								4713e7c47f 
								
							 
						 
						
							
							
								
								ARM64: Add the VULCAN Target  
							
							
							
						 
						
							2017-01-10 15:01:17 +05:30  
				
					
						
							
							
								 
						
							
								51aa157e64 
								
							 
						 
						
							
							
								
								Relocate declaration of alloc_lock outside ifdef block  
							
							
							
						 
						
							2017-01-09 01:10:43 +01:00  
				
					
						
							
							
								 
						
							
								87c7d10b34 
								
							 
						 
						
							
							
								
								Fix thread data races detected by helgrind 3.12  
							
							... 
							
							
							
							Ref. #995 , may possibly help solve issues seen in 660,883 
							
						 
						
							2017-01-08 23:33:51 +01:00  
				
					
						
							
							
								 
						
							
								66c9a9b33d 
								
							 
						 
						
							
							
								
								Merge pull request  #981  from howard0su/develop  
							
							... 
							
							
							
							USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN 
							
						 
						
							2016-10-17 11:32:57 +08:00  
				
					
						
							
							
								 
						
							
								ff1da01476 
								
							 
						 
						
							
							
								
								USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN  
							
							... 
							
							
							
							to determine the number of CPU. In ARM platform,
online CPU will increasing when there is more workload.
while configure cpu is the max number of CPU. 
							
						 
						
							2016-10-13 12:37:50 +00:00  
				
					
						
							
							
								 
						
							
								ef52a9266b 
								
							 
						 
						
							
							
								
								Fixed   #979 . Patch for NetBSD.  
							
							
							
						 
						
							2016-10-13 10:17:07 +08:00  
				
					
						
							
							
								 
						
							
								aa744dfa59 
								
							 
						 
						
							
							
								
								Update memory.c  
							
							
							
						 
						
							2016-03-22 20:02:37 +08:00  
				
					
						
							
							
								 
						
							
								61cf8f74d9 
								
							 
						 
						
							
							
								
								Fix access violation on Windows while static linking  
							
							
							
						 
						
							2016-03-22 19:14:54 +08:00  
				
					
						
							
							
								 
						
							
								05196a8497 
								
							 
						 
						
							
							
								
								Refs  #716 . Only call getenv at init function.  
							
							
							
						 
						
							2016-03-09 12:50:07 -05:00  
				
					
						
							
							
								 
						
							
								6b85dbb6dc 
								
							 
						 
						
							
							
								
								Refs  #696 . Turn off stack limit setting on Linux.  
							
							... 
							
							
							
							I cannot reproduce SEGFAULT of lapack-test with default stack size
on ARM Linux. 
							
						 
						
							2016-02-24 14:21:42 -05:00  
				
					
						
							
							
								 
						
							
								f5df444ceb 
								
							 
						 
						
							
							
								
								Merge pull request  #762  from jeromerobert/bug760  
							
							... 
							
							
							
							Let openblas_get_num_threads return the number of active threads 
							
						 
						
							2016-01-26 08:45:16 -06:00  
				
					
						
							
							
								 
						
							
								aaa8551c57 
								
							 
						 
						
							
							
								
								Merge pull request  #749  from lotheac/illumos_fixes  
							
							... 
							
							
							
							illumos fixes 
							
						 
						
							2016-01-26 08:42:20 -06:00  
				
					
						
							
							
								 
						
							
								0d87c1ffb6 
								
							 
						 
						
							
							
								
								Let openblas_get_num_threads return the number of active threads  
							
							... 
							
							
							
							... not the number of allocated threads.
Close  #760  
							
						 
						
							2016-01-26 13:04:16 +01:00  
				
					
						
							
							
								 
						
							
								97cd4b8aee 
								
							 
						 
						
							
							
								
								illumos fixes to memory.c  
							
							
							
						 
						
							2016-01-22 18:55:43 +02:00  
				
					
						
							
							
								 
						
							
								0d22551a6b 
								
							 
						 
						
							
							
								
								increase the stack size limit in the constructor  
							
							
							
						 
						
							2015-11-20 09:23:01 +01:00  
				
					
						
							
							
								 
						
							
								fbc21266e6 
								
							 
						 
						
							
							
								
								Minor C code fixes in driver/  
							
							
							
						 
						
							2015-11-09 14:15:49 +05:30  
				
					
						
							
							
								 
						
							
								2feef49fa8 
								
							 
						 
						
							
							
								
								Merge branch 'develop' into cmake  
							
							... 
							
							
							
							Conflicts:
	driver/others/memory.c 
							
						 
						
							2015-10-26 14:54:34 -05:00  
				
					
						
							
							
								 
						
							
								1ce054fcb3 
								
							 
						 
						
							
							
								
								Refs  #669 . Fixed the build bug with gcc on Mac OS X.  
							
							
							
						 
						
							2015-10-22 11:07:35 -05:00  
				
					
						
							
							
								 
						
							
								94b125255f 
								
							 
						 
						
							
							
								
								Merge branch 'develop' into cmake  
							
							... 
							
							
							
							Conflicts:
	driver/others/memory.c 
							
						 
						
							2015-10-13 04:46:08 +08:00  
				
					
						
							
							
								 
						
							
								11ac4665c8 
								
							 
						 
						
							
							
								
								Fixed   #654 . Make sure the gotoblas_init function is run before all other static initializations.  
							
							
							
						 
						
							2015-10-05 14:14:32 -05:00  
				
					
						
							
							
								 
						
							
								d3e2f0a1af 
								
							 
						 
						
							
							
								
								add missing barriers  
							
							... 
							
							
							
							should fix issue #597  
							
						 
						
							2015-08-16 15:37:02 +02:00  
				
					
						
							
							
								 
						
							
								dcd5ba4443 
								
							 
						 
						
							
							
								
								Merge branch 'cmake' of  https://github.com/hpanderson/OpenBLAS  into hpanderson_cmake  
							
							
							
						 
						
							2015-07-22 04:06:39 +08:00  
				
					
						
							
							
								 
						
							
								a11555c715 
								
							 
						 
						
							
							
								
								Support Android NDK armeabi-v7a-hard ABI. (-mfloat-abi=hard)  
							
							... 
							
							
							
							e.g.
make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7
In Android NDK, it uses armeabi-v7a-hard ABI.
TARGET_CFLAGS += -mhard-float -D_NDK_MATH_NO_SOFTFP=1
TARGET_LDFLAGS += -Wl,--no-warn-mismatch -lm_hard
For more information, please check hard-float example at
android_ndk/tests/device/hard-float/jni/. 
							
						 
						
							2015-05-20 21:57:27 -05:00  
				
					
						
							
							
								 
						
							
								ebb9eba987 
								
							 
						 
						
							
							
								
								Fix build with ALLOC_SHM=0 (Android NDK)  
							
							... 
							
							
							
							Refactor such that you can build with ALLOC_SHM=0. HughTLB
implicity depends on ALLOC_SHM=1. This patch allows
building for Android NDK r10d. 
							
						 
						
							2015-05-10 00:10:26 -07:00  
				
					
						
							
							
								 
						
							
								9798481979 
								
							 
						 
						
							
							
								
								Refs  #478 ,  #482 . Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.  
							
							... 
							
							
							
							For gemv_t, directly use malloc to create the buffer. 
							
						 
						
							2015-04-13 19:45:27 -05:00  
				
					
						
							
							
								 
						
							
								b6438dedea 
								
							 
						 
						
							
							
								
								Fix issue  #508  
							
							... 
							
							
							
							Fix race condition during shutdown causing a crash in
gotoblas_set_affinity(). 
							
						 
						
							2015-03-18 13:22:43 +01:00  
				
					
						
							
							
								 
						
							
								5ae8993752 
								
							 
						 
						
							
							
								
								Added intrinsics for MSVC.  
							
							
							
						 
						
							2015-02-25 11:52:51 -06:00  
				
					
						
							
							
								 
						
							
								84d90d6ed8 
								
							 
						 
						
							
							
								
								Fixed some compiler errors/warnings for clang.  
							
							
							
						 
						
							2015-02-25 11:52:25 -06:00  
				
					
						
							
							
								 
						
							
								cfa9392ffa 
								
							 
						 
						
							
							
								
								Fix openblas_get_num_threads and openblas_get_num_procs bug with single thread.  
							
							
							
						 
						
							2015-02-08 01:30:23 -06:00  
				
					
						
							
							
								 
						
							
								65a847cd36 
								
							 
						 
						
							
							
								
								Introduce openblas_get_num_threads and openblas_get_num_procs  
							
							
							
						 
						
							2015-02-03 12:23:41 -05:00  
				
					
						
							
							
								 
						
							
								2fb02626da 
								
							 
						 
						
							
							
								
								Update organization info.  
							
							
							
						 
						
							2014-11-25 15:28:58 +08:00  
				
					
						
							
							
								 
						
							
								f7eb81a846 
								
							 
						 
						
							
							
								
								Fix link error on Linux/musl.  
							
							... 
							
							
							
							get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.) 
							
						 
						
							2014-08-03 15:06:30 -07:00  
				
					
						
							
							
								 
						
							
								7a8949e0ce 
								
							 
						 
						
							
							
								
								Merge branch 'develop' of  https://github.com/TimothyGu/OpenBLAS  into TimothyGu-develop  
							
							... 
							
							
							
							Conflicts:
	driver/others/memory.c 
							
						 
						
							2014-06-28 20:51:31 +08:00  
				
					
						
							
							
								 
						
							
								6c2ead30f0 
								
							 
						 
						
							
							
								
								Remove all trailing whitespace except lapack-netlib  
							
							... 
							
							
							
							Signed-off-by: Timothy Gu <timothygu99@gmail.com> 
							
						 
						
							2014-06-27 12:05:18 -07:00  
				
					
						
							
							
								 
						
							
								f41f03ab83 
								
							 
						 
						
							
							
								
								fix   #394 . this cleans up some handles after using them, and doesn't disable ALL process privileges upon success  
							
							
							
						 
						
							2014-06-27 12:16:57 -04:00  
				
					
						
							
							
								 
						
							
								2c556f093a 
								
							 
						 
						
							
							
								
								Add cast to function pointer to remove warning  
							
							
							
						 
						
							2014-02-25 11:08:32 +01:00  
				
					
						
							
							
								 
						
							
								3b027d2528 
								
							 
						 
						
							
							
								
								Do not reference pthread_atfork in non-SMP_SERVER mode  
							
							
							
						 
						
							2014-02-25 11:08:32 +01:00  
				
					
						
							
							
								 
						
							
								49bd98f410 
								
							 
						 
						
							
							
								
								Do not reference pthread_atfork under windows  
							
							
							
						 
						
							2014-02-19 19:25:48 +01:00  
				
					
						
							
							
								 
						
							
								138a841390 
								
							 
						 
						
							
							
								
								FIX   #294 : make OpenBLAS thread-pool resilient to fork via pthread_atfork  
							
							
							
						 
						
							2014-02-19 19:01:15 +01:00  
				
					
						
							
							
								 
						
							
								046e4013cb 
								
							 
						 
						
							
							
								
								Revert "Refs  #294 . Used pthread_atfork to avoid hang after a Unix fork."  
							
							... 
							
							
							
							This reverts commit 3617c22a56 
							
						 
						
							2014-02-19 18:32:54 +01:00  
				
					
						
							
							
								 
						
							
								3617c22a56 
								
							 
						 
						
							
							
								
								Refs  #294 . Used pthread_atfork to avoid hang after a Unix fork.  
							
							... 
							
							
							
							The problem is the mutex we used in blas_server. Thus, we must clear
the mutex before the fork and re-init them at parent and child process.
If you used OpenMP, GOMP has the same problem by now. Please try other OpenMP
implemantation. 
							
						 
						
							2014-02-18 15:36:04 +08:00  
				
					
						
							
							
								 
						
							
								5048a80032 
								
							 
						 
						
							
							
								
								Refs  #283 . Fixed the incorrect usage of long data type for Windows 64.  
							
							
							
						 
						
							2013-11-14 13:46:42 +08:00  
				
					
						
							
							
								 
						
							
								f54f5bac9e 
								
							 
						 
						
							
							
								
								Refs  #248 . Fixed the LSB compatiable issue for BLAS only.  
							
							... 
							
							
							
							For example, make CC=lsbcc NO_LAPACK=1. 
							
						 
						
							2013-07-09 15:38:03 +08:00  
				
					
						
							
							
								 
						
							
								5d3312142a 
								
							 
						 
						
							
							
								
								Refs  #221   #246 . Fixed the overflowing stack bug in mutlithreading BLAS3.  
							
							... 
							
							
							
							When NUM_THREADS(MAX_CPU_NUNBERS) is very large ,e.g. 256.
typedef struct {
  volatile BLASLONG working[MAX_CPU_NUMBER][CACHE_LINE_SIZE * DIVIDE_RATE];
} job_t;
job_t          job[MAX_CPU_NUMBER];
The job array is equal 8MB.
Thus, We use malloc instead of stack allocation. 
							
						 
						
							2013-07-08 01:07:05 +08:00  
				
					
						
							
							
								 
						
							
								32dbeb636d 
								
							 
						 
						
							
							
								
								Refs  #221 . Set stack limit to 16MB to prevent a SEGFAULT bug on Mac OS X with DYNAMIC_ARCH=1 & NUM_THREADS=256.  
							
							
							
						 
						
							2013-07-02 14:17:55 +08:00  
				
					
						
							
							
								 
						
							
								6751f7b9a7 
								
							 
						 
						
							
							
								
								Fixed   #157 . Only detect the number of physical CPU cores on Mac OSX.  
							
							
							
						 
						
							2012-11-13 15:48:57 +08:00  
				
					
						
							
							
								 
						
							
								801383effe 
								
							 
						 
						
							
							
								
								Fixed a hang bug when shutdown blas threads server on Windows. Added the feature about dynamic changing the number of threads on Windows.  
							
							
							
						 
						
							2012-08-14 18:34:32 +08:00  
				
					
						
							
							
								 
						
							
								a55821a2ec 
								
							 
						 
						
							
							
								
								Refs  #132 . Kill the threads when unload the library.  
							
							
							
						 
						
							2012-08-11 21:33:15 +08:00  
				
					
						
							
							
								 
						
							
								3692b4d631 
								
							 
						 
						
							
							
								
								Improve the detection of sparc  
							
							
							
						 
						
							2012-07-02 02:51:38 +02:00  
				
					
						
							
							
								 
						
							
								a431042475 
								
							 
						 
						
							
							
								
								Fix inconsistent case for OS_* macros (Refs pull request  #111 )  
							
							
							
						 
						
							2012-05-23 00:01:14 +02:00  
				
					
						
							
							
								 
						
							
								4e29b6ffc0 
								
							 
						 
						
							
							
								
								FreeBSD: fix OS_FreeBSD -> OS_FREEBSD typos  
							
							
							
						 
						
							2012-05-21 16:57:19 -04:00  
				
					
						
							
							
								 
						
							
								0b89a7a92d 
								
							 
						 
						
							
							
								
								Ref  #82 . Disable outputing debug information in alloc_mmap.  
							
							
							
						 
						
							2012-03-23 18:17:12 +08:00  
				
					
						
							
							
								 
						
							
								ef6f7f32ae 
								
							 
						 
						
							
							
								
								Fixed mbind bug on Loongson 3B. Check the return value of my_mbind function.  
							
							
							
						 
						
							2011-11-23 17:17:41 +00:00  
				
					
						
							
							
								 
						
							
								4727fe8abf 
								
							 
						 
						
							
							
								
								Refs  #47 . On Loongson 3A, set DGEMM_R parameter depending on different number of threads. It would improve double precision BLAS3 on multi-threads.  
							
							
							
						 
						
							2011-09-05 15:13:52 +00:00  
				
					
						
							
							
								 
						
							
								f7a5e049e2 
								
							 
						 
						
							
							
								
								Enable Debug flags in memory alloc and init functions.  
							
							
							
						 
						
							2011-02-26 11:51:39 +08:00  
				
					
						
							
							
								 
						
							
								128418f49b 
								
							 
						 
						
							
							
								
								Fixed   #10 . Supported GOTO_NUM_THREADS & GOTO_THREADS_TIMEOUT environment variables.  
							
							
							
						 
						
							2011-02-24 16:32:13 +08:00  
				
					
						
							
							
								 
						
							
								e6c13e2b3c 
								
							 
						 
						
							
							
								
								changed library name to openblas and modified environment variable.  
							
							
							
						 
						
							2011-01-24 17:58:05 +00:00  
				
					
						
							
							
								 
						
							
								342bbc3871 
								
							 
						 
						
							
							
								
								Import GotoBLAS2 1.13 BSD version codes.  
							
							
							
						 
						
							2011-01-24 14:54:24 +00:00