AbdelRauf
|
47f892198c
|
conflict resolve
|
2019-05-01 19:36:22 +00:00 |
Martin Kroeker
|
b43c8382c8
|
Correct argument of CPU_ISSET for glibc <2.5
fixes #2104
|
2019-05-01 10:46:46 +02:00 |
luz.paz
|
daf2fec12d
|
Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
|
2019-04-29 17:03:56 -04:00 |
Martin Kroeker
|
4f8143b098
|
Increment version to 0.3.7.dev
|
2019-04-29 19:25:32 +02:00 |
Martin Kroeker
|
bfeb9c16b0
|
Increment version to 0.3.7.dev
|
2019-04-29 19:24:53 +02:00 |
Martin Kroeker
|
97d5034ed3
|
Merge branch 'release-0.3.0' into develop
|
2019-04-29 19:21:54 +02:00 |
Martin Kroeker
|
9763f872fc
|
Update Changelog with changes from 0.3.6
|
2019-04-29 19:18:26 +02:00 |
AbdelRauf
|
628b335e83
|
Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop
|
2019-04-29 08:57:44 +00:00 |
AbdelRauf
|
0f105dd8a5
|
sgemm/strmm
|
2019-04-29 08:49:50 +00:00 |
Martin Kroeker
|
9c4edd38f2
|
Merge pull request #2099 from martin-frbg/rela-gbtrf
Disable repeated recursion on Ab_BR in ReLAPACK xGBTRF
|
2019-04-29 09:25:19 +02:00 |
Martin Kroeker
|
1036299da0
|
Disable repeated recursion on Ab_BR in ReLAPACK xGBTRF
due to crashes in LAPACK tests
|
2019-04-29 00:12:37 +02:00 |
Martin Kroeker
|
5b0398186e
|
Merge pull request #2098 from martin-frbg/rela-malloc
Disable reallocation of work array in ReLAPACK xSYTRF
|
2019-04-28 19:31:01 +02:00 |
Martin Kroeker
|
452859f4e1
|
Merge pull request #2097 from martin-frbg/rela-getrf
Correct INFO=4 condition in ReLAPACK xGETRF
|
2019-04-28 19:28:57 +02:00 |
Martin Kroeker
|
2cd463eabd
|
Disable reallocation of work array in xSYTRF
as it appears to cause memory management problems (seen in the LAPACK tests)
|
2019-04-28 10:02:28 +02:00 |
Martin Kroeker
|
11530b76f7
|
Correct INFO=4 condition
|
2019-04-28 09:58:56 +02:00 |
Martin Kroeker
|
91943b7325
|
Merge pull request #2096 from martin-frbg/eig-testing
Avoid out-of-bounds accesses in LAPACK EIG tests
|
2019-04-28 09:55:42 +02:00 |
Martin Kroeker
|
268c28db7d
|
Merge pull request #2095 from martin-frbg/trsm
Correct length of name string in xerbla call
|
2019-04-28 09:55:25 +02:00 |
Martin Kroeker
|
2aad88d5b9
|
Avoid out-of-bounds accesses in LAPACK EIG tests
see https://github.com/Reference-LAPACK/lapack/issues/333
|
2019-04-27 23:01:49 +02:00 |
Martin Kroeker
|
0bd956fd21
|
Correct length of name string in xerbla call
|
2019-04-27 22:49:04 +02:00 |
Martin Kroeker
|
bbd9d98664
|
Merge pull request #2094 from martin-frbg/issue2066
Fix ReLAPACK integration problems
|
2019-04-27 22:45:47 +02:00 |
Martin Kroeker
|
798c448b0c
|
Add support for INTERFACE64 and fix XERBLA calls
1. Replaced all instances of "int" with "blasint"
2. Added string length as "hidden" third parameter in calls to fortran XERBLA
|
2019-04-27 19:06:00 +02:00 |
Martin Kroeker
|
9a19616a28
|
Support INTERFACE64=1
|
2019-04-27 18:55:47 +02:00 |
Martin Kroeker
|
6b41eb9c0c
|
Merge pull request #2092 from jeffbaylor/snprintf_with_MSC_VER
snprintf define consolidated to common.h
|
2019-04-23 20:12:06 +02:00 |
Martin Kroeker
|
ccfb7ead15
|
Merge pull request #2072 from martin-frbg/sum
Add (C)BLAS extension ?sum
|
2019-04-23 20:11:36 +02:00 |
Jeff Baylor
|
40e53e52d6
|
snprintf define consolidated to common.h
|
2019-04-22 17:01:34 -07:00 |
Martin Kroeker
|
744779d335
|
Merge pull request #2084 from RashmicaG/develop
Add in runtime CPU detection for POWER.
|
2019-04-14 21:40:07 +02:00 |
Rashmica Gupta
|
bcdf1d4917
|
Add in runtime CPU detection for POWER.
|
2019-04-09 14:20:16 +10:00 |
Martin Kroeker
|
e06b8438b4
|
Merge pull request #2080 from martin-frbg/issue2075
Add -lm and disable EXPRECISION support on *BSD
|
2019-04-02 21:40:58 +02:00 |
Martin Kroeker
|
9229d6859b
|
Add -lm and disable EXPRECISION support on *BSD
fixes #2075
|
2019-04-02 09:38:18 +02:00 |
Martin Kroeker
|
21d146a8de
|
Add declarations for ?sum
|
2019-03-31 22:12:23 +02:00 |
Martin Kroeker
|
7f4e36d219
|
Merge pull request #2073 from martin-frbg/issue2056-2
Detect 32bit environment on 64bit ARM hardware
|
2019-03-31 13:56:08 +02:00 |
Martin Kroeker
|
c04a729081
|
Add ?sum definitions for generic kernel
|
2019-03-31 13:55:49 +02:00 |
Martin Kroeker
|
100d94f94e
|
Add ?sum
|
2019-03-31 13:55:05 +02:00 |
Martin Kroeker
|
d17da6c6a4
|
Add cmake defaults for ?sum kernels
|
2019-03-31 11:57:01 +02:00 |
Martin Kroeker
|
1679de5e59
|
Detect 32bit environment on 64bit ARM hardware
for #2056, using same approach as #2058
|
2019-03-31 10:50:43 +02:00 |
Martin Kroeker
|
246ca29679
|
Add ZARCH implementation of ?sum
as trivial copies of the respective ?asum kernels with the ABS and vflpsb calls removed
|
2019-03-30 22:49:05 +01:00 |
Martin Kroeker
|
9d717cb5ee
|
Add x86_64 implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
|
2019-03-30 22:27:04 +01:00 |
Martin Kroeker
|
e3bc83f2a8
|
Add x86 implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
|
2019-03-30 22:26:10 +01:00 |
Martin Kroeker
|
70f2a4e0d7
|
Add SPARC implementation of ?sum
as trivial copy of ?asum with the fabs replaced by fmov to preserve code structure
|
2019-03-30 22:25:06 +01:00 |
Martin Kroeker
|
706dfe263b
|
Add POWER implementation of ?sum
as trivial copy of ?asum with the fabs replaced by fmr to preserve code structure
|
2019-03-30 22:23:42 +01:00 |
Martin Kroeker
|
688fa9201c
|
Add MIPS64 implementation of ?sum
as trivial copy of ?asum with the fabs replaced by mov to preserve code structure
|
2019-03-30 22:22:15 +01:00 |
Martin Kroeker
|
cdbe0f0235
|
Add MIPS implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
|
2019-03-30 22:20:14 +01:00 |
Martin Kroeker
|
f8b82bc6dc
|
Add ia64 implementation of ?sum
as trivial copy of asum with the fabs calls removed
|
2019-03-30 22:18:03 +01:00 |
Martin Kroeker
|
3e3ccb9011
|
Add ARM64 implementations of ?sum
as trivial copies of the respective ?asum kernels with the fabs calls removed
|
2019-03-30 22:13:36 +01:00 |
Martin Kroeker
|
94ab4e6fb2
|
Add ARM implementations of ?sum
(trivial copies of the respective ?asum with the fabs calls removed)
|
2019-03-30 22:11:38 +01:00 |
Martin Kroeker
|
c3cfc6986b
|
Add implementations of ssum/dsum and csum/zsum
as trivial copies of asum/zsasum with the fabs calls replaced by fmov to preserve code structure
|
2019-03-30 22:05:11 +01:00 |
Martin Kroeker
|
b9f4943a14
|
Add ?sum
|
2019-03-30 22:01:13 +01:00 |
Martin Kroeker
|
79cfc24a62
|
Add interface for ?sum (derived from ?asum)
|
2019-03-30 21:59:18 +01:00 |
Martin Kroeker
|
5c42287c4f
|
Add declarations for ?sum and cblas_?sum
|
2019-03-30 21:58:03 +01:00 |
Martin Kroeker
|
32c7063cb0
|
Merge pull request #2061 from martin-frbg/martin-frbg-patch-1
Disable the AVX512 DGEMM kernel (again)
|
2019-03-30 21:21:38 +01:00 |