Commit Graph

2557 Commits

Author SHA1 Message Date
Abdelrauf afd514c25d small fix inside ifdef z13mvc . (z13mvc code is not used in production) 2018-01-31 18:30:59 -05:00
the mslm f946a89432 zscal (case: real alpha=0 ) mikrokernel shift&mem fix , da_i as input reg. small typo fixes 2018-01-26 19:25:27 -08:00
the mslm 2619ad7ea5 Blas1 mikrokernels can be inlined by gcc. Refactoring ( symbolic operan
names). Some fixes and tunings
2018-01-19 19:24:35 -08:00
Abdelrauf 87669d1c0a small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels can be inlined 2018-01-10 20:36:53 -05:00
Martin Kroeker def146efed Merge pull request #1308 from sebastien-villemot/develop
Add support for TARGET=ZARCH_GENERIC and TARGET=Z13
2017-09-19 14:04:37 +02:00
Sébastien Villemot 7543e578a4 Add support for TARGET=ZARCH_GENERIC and TARGET=Z13 2017-09-19 12:16:42 +02:00
Martin Kroeker 601c71fe54 Merge pull request #1304 from martin-frbg/aix-build-fixes
(Plain make) build system fixes for AIX
2017-09-18 10:16:40 +02:00
Martin Kroeker 3810a6fd99 (Plain make) build system fixes for AIX
- retry fortran compiler test with aix-specific option if generic -m32/-m64 fails
- pass any custom ARFLAGS to lapack
- no addition of -m32/-m64 to the CFLAGS and FFLAGS on AIX
2017-09-18 01:29:21 +02:00
Martin Kroeker 742f54c235 Merge pull request #1303 from martin-frbg/imatcopy-rowscols
Fix cols/rows mixup in omatcopy 2nd step for BlasTrans cases
2017-09-14 21:46:26 +02:00
Martin Kroeker d674fbb4c7 Fix cols/rows mixup in omatcopy 2nd step for BlasTrans cases
Equivalent of #1244 (issue #899) for the non-complex cases. Fixes #1289
2017-09-14 19:59:05 +02:00
Martin Kroeker 2922c15f36 Merge pull request #1302 from martin-frbg/nofortran-fix
Remove default FEXTRALIBS in NOFORTRAN case
2017-09-14 11:54:20 +02:00
Martin Kroeker 3a245a376f Remove default FEXTRALIBS in NOFORTRAN case 2017-09-14 09:21:04 +02:00
Martin Kroeker 46c9357c72 Merge pull request #1288 from quickwritereader/develop
Optimized standard Blas Level-1,2 (excluding nrm2 functions) for z13 (double precision). Issue 884
2017-09-09 23:47:17 +02:00
Martin Kroeker 1c3e2d3dd5 Merge pull request #1293 from embray/cygwin/install
More canonical installation on Cygwin
2017-09-09 23:46:27 +02:00
Martin Kroeker f66d908282 Merge pull request #1299 from martin-frbg/race_fixes
Fix thread data races uncovered by gcc thread sanitizer
2017-09-09 23:41:53 +02:00
Martin Kroeker ba1f91f17b Convert another caller of "allocation" to LOCK_COMMAND
... as the "allocation" code jumped to now does UNLOCK_COMMAND instead of blas_unlock
2017-09-09 20:30:33 +02:00
Martin Kroeker f460776f0f Fix thread data races 2017-09-09 19:07:06 +02:00
Martin Kroeker e882f3d6f3 Fix thread data race in memory.c 2017-09-09 18:58:38 +02:00
Erik M. Bray dddedbab5d More canonical installation on Cygwin:
* The DLL is named cygopenblas.dll, not libopenblas.dll
* The import lib (still called libopenblas.dll.a) is installed
2017-09-07 14:18:56 +02:00
Abdurrauf 1cfdb2295d Optimized standard Blas Level-1,2 (excluding nrm2 functions) for z13 (double precision) 2017-09-06 16:41:08 +04:00
Martin Kroeker 00740c0e34 Merge pull request #1290 from martin-frbg/imatcopy
Use in-place transform shortcut only if matrix is square
2017-09-03 13:02:10 +02:00
Martin Kroeker 254db9bd7c Use in-place transform shortcut only if matrix is square 2017-09-03 09:52:55 +02:00
Martin Kroeker f2074f9ac1 Merge pull request #1286 from martin-frbg/baytrail
Fix coretype detection for Bay Trail Atom
2017-08-27 13:23:57 +02:00
Martin Kroeker aece65ea29 Fix coretype detection for Bay Trail Atom
My earlier PR #982 appears to have been incomplete in this regard - fixes #1285
2017-08-27 13:06:54 +02:00
Martin Kroeker 50715e8945 Merge pull request #1281 from sharkcz/armv8
fix detection of generic ARMv8 CPUs
2017-08-19 20:37:19 +02:00
Dan Horák 1763e01567 fix detection of generic ARMv8 CPUs 2017-08-18 14:53:29 +02:00
Martin Kroeker a8a342ccc4 Merge pull request #1277 from cconrads-scicomp/fix-installation-instructions
Make: fix installation instructions
2017-08-10 23:42:23 +02:00
Martin Kroeker 9e9a9553db Merge pull request #1276 from cconrads-scicomp/android_-lm_fix
ARM: do not add linker flag `-lm` unconditionally
2017-08-10 21:35:32 +02:00
Martin Kroeker be7c1b6324 Merge pull request #1275 from cconrads-scicomp/recognize-gfortran-on-arm
ARM: recognize gfortran pre-releases
2017-08-10 21:32:09 +02:00
Christoph Conrads be8e8228ae Make: show installation instructions after build 2017-08-10 14:22:26 -04:00
Christoph Conrads 77b197bc93 Make: fix installation instructions
The installation instructions as shown after successfully compiling
OpenBLAS are wrong because this arguments used during compilation have
to be provided to Make again.
2017-08-10 12:47:18 -04:00
Christoph Conrads 9cc6599056 ARM: do not add linker flag `-lm` unconditionally
On ARM the required math library depends on whether the soft floating
point ABI is used or not but this is already handled in
`Makefile.system`, lines 499-505.
2017-08-10 11:58:25 -04:00
Christoph Conrads da611dd81a ARM: recognize gfortran pre-releases
Without proper recognition of gfortran versions such as
> GNU Fortran (GCC) 4.9.x 20150123 (prerelease)
OpenBLAS assumes the presence of the G77 compiler. Consequently,
`-lgfortran` is missing from the pkg-config file.

The aforementioned compiler is the gfortran compiler in the Android repo
in a commit tagged as `ndk-r14`, cf. Paul Mustière's gfortran build
instructions for Android at https://github.com/buffer51/android-gfortran
2017-08-10 11:48:29 -04:00
Martin Kroeker e6b9b660c3 Merge pull request #1264 from isuruf/dyn
Support DYNAMIC_ARCH with CMake
2017-08-08 23:47:47 +02:00
Martin Kroeker 62635b1fb5 Merge pull request #1268 from jirutka/travis-2
Travis: Add jobs building with clang and disable job `LINUX64_MUSL USE_OPENMP=1`
2017-08-08 21:54:35 +02:00
Martin Kroeker 75ddfbd687 Change travis back to sudo true
,,,to see if this has any influence on the recent ld SIGKILLS
2017-08-08 16:39:13 +02:00
Isuru Fernando 2f12ea017b No strncasecmp with MSVC 2017-08-08 00:07:25 +05:30
Isuru Fernando d3b677fe87 Add commonobjs 2017-08-07 23:12:40 +05:30
Isuru Fernando 6f83004252 Test DYNAMIC_ARCH on appveyor 2017-08-06 19:17:31 +05:30
Isuru Fernando 505b218829 Merge remote-tracking branch 'upstream/develop' into dyn 2017-08-06 19:07:00 +05:30
Martin Kroeker 719fcc56b0 Merge pull request #1262 from martin-frbg/xmv_thread-splitting
Make sure that range limit of last thread never exceeds data size
2017-08-06 14:11:44 +02:00
Martin Kroeker 1296c89688 Merge pull request #1256 from isuruf/develop
Support compiling with clang on windows
2017-08-06 14:10:18 +02:00
Jakub Jirutka dfb91f2a31 Travis: Add jobs building with clang 2017-08-06 12:40:59 +02:00
Jakub Jirutka 2ec76011f9 Travis: Disable job "LINUX64_MUSL USE_OPENMP=1"
https://github.com/xianyi/OpenBLAS/pull/1255#issuecomment-320494610
2017-08-06 11:06:03 +02:00
Isuru Fernando 7abbe40980 Build all branches so that appveyor works in forks 2017-08-04 08:04:16 +05:30
Isuru Fernando ca32b66a1c New utest for clang 2017-08-04 07:58:04 +05:30
Isuru Fernando d9346930dd Merge remote-tracking branch 'upstream/develop' into develop 2017-08-04 07:57:55 +05:30
Martin Kroeker 0bfe5c6822 Merge pull request #1266 from ashwinyes/develop_thunderx2t99_fix_clang_compilation
THUDNERX2T99: Fix clang compilation
2017-08-03 15:33:28 +02:00
Ashwin Sekhar T K 4899d67f7d THUDNERX2T99: Fix clang compilation 2017-08-02 11:28:45 -07:00
Isuru Fernando 1d1854032b Add missing EXCAVATOR 2017-08-02 19:03:04 +05:30