Commit Graph

2663 Commits

Author SHA1 Message Date
Martin Kroeker
db00a51e6b Merge pull request #1371 from martin-frbg/develop
Add trivially optimized DSDOT for POWER8
2017-11-29 19:55:21 +01:00
martin
7a4b3cfbf8 Add trivially optimized DSDOT for POWER8 2017-11-28 18:38:07 +01:00
Martin Kroeker
6c77b5f267 Merge pull request #1369 from martin-frbg/dsdot
Add optimized dsdot to all other x86_64 kernels that use sdot.c
2017-11-28 18:15:31 +01:00
Martin Kroeker
d8b3c3c7db Merge pull request #1368 from brada4/develop
Eliminate warnings
2017-11-28 18:15:04 +01:00
Martin Kroeker
beb18492fd Merge pull request #1366 from martin-frbg/develop
Update LAPACK to 3.8.0
2017-11-26 19:12:00 +01:00
Andrew
441a9c8385 more dead increments clang4 scan-build deadcode.deadstores 2017-11-26 17:24:08 +01:00
Andrew
1236dbe5a6 Eliminate 2-8 dead increments code 2017-11-26 13:26:11 +01:00
Andrew
ef95cd471f elminate unread variable, after reiteration 3 of them (clang4) 2017-11-25 02:54:37 +01:00
Martin Kroeker
c92cd6d162 Add trivially optimized dsdot based on sdot 2017-11-24 20:05:27 +01:00
Martin Kroeker
cae5d9a20b Add trivially optimized dsdot based on sdot 2017-11-24 20:04:29 +01:00
Martin Kroeker
3d891c3106 Add trivially optimized dsdot based on sdot 2017-11-24 20:03:40 +01:00
Martin Kroeker
4fbdcfa823 Add trivially optimized dsdot based on sdot 2017-11-24 20:02:28 +01:00
Martin Kroeker
1bb6a96ebc Add trivially optimized dsdot based on sdot 2017-11-24 20:01:42 +01:00
Martin Kroeker
6bd163f37a Add trivially optimized dsdot based on sdot 2017-11-24 20:00:23 +01:00
Martin Kroeker
f0333333d1 Add trivially optimized dsdot based on sdot 2017-11-24 19:59:28 +01:00
Andrew
e14d50d86e eliminate Wunused-const gcc7 warning 2017-11-24 19:13:24 +01:00
Andrew
e89b979b2c fix spurious compiler warning fix (no code change) 2017-11-24 18:39:04 +01:00
Andrew
7e9b29b9b8 fix spurious compiler warning (no code change) 2017-11-24 18:36:37 +01:00
martin
5056a044b2 fix location of lapacke_nancheck 2017-11-24 09:15:20 +01:00
martin
4054d32def update cmake files 2017-11-24 08:15:40 +01:00
martin
2d52f0f4c3 update cmakefiles for lapack 3.8.0 2017-11-23 21:22:01 +01:00
martin
3be5c3d343 Update LAPACK to 3.8.0 2017-11-23 18:13:35 +01:00
Martin Kroeker
b18730f9e1 Merge pull request #1365 from xoviat/patch-1
[appveyor] use cmake from conda forge
2017-11-22 21:13:41 +01:00
xoviat
be9e63c021 [appveyor] use cmake from conda forge 2017-11-21 18:44:02 -06:00
Martin Kroeker
21844238ca Merge pull request #1364 from martin-frbg/shmem-init
Handle shmem init failures in cpu affinity setup code
2017-11-19 12:50:16 +01:00
Martin Kroeker
07e7c36dac Handle shmem init failures in cpu affinity setup code
Failures to obtain or attach shared memory segments would lead to an exit without explanation of the exact cause.
This change introduces a more verbose error message and tries to make the code continue without setting cpu affinity.
Fixes #1351
2017-11-18 23:57:44 +01:00
Martin Kroeker
9251a2efde Merge pull request #1359 from brada4/develop
Eliminate mode variable where not needed in syrk interface
2017-11-18 23:47:17 +01:00
Martin Kroeker
06adbf2e41 Merge pull request #1347 from martin-frbg/issue1322
Change CBLAS complex functions to take void pointers
2017-11-18 23:46:57 +01:00
Martin Kroeker
b46e2b57cc Make return parameter of cblas_Xdotc_sub, cblas_Xdotu_sub a void pointer as well 2017-11-18 20:28:02 +01:00
Martin Kroeker
3ce401f51b Make last parameter of cblas_Xdotc_sub/cblas_Xdotu_sub a void pointer as well 2017-11-18 18:58:40 +01:00
Martin Kroeker
1c9f4422b5 Fix declaration of cblas_Xdotc_sub and cblas_Xdotu_sub
last parameter of cblas_(c,z)dotc_sub and cblas_(c,z)dotu_sub should be void* as well
2017-11-18 18:56:30 +01:00
Andrew
27575d200a Eliminate mode variable where not needed 2017-11-15 15:32:38 +01:00
Martin Kroeker
6157d0902a Merge pull request #1358 from martin-frbg/unused_vars
Clean up spurious unused variables in the kernels
2017-11-15 11:31:43 +01:00
Martin Kroeker
3fea849bbf Remove unused variables from Haswell dtrmm and Bulldozer dtrsm 2017-11-14 23:35:10 +01:00
Martin Kroeker
8f177621bc Remove unused variables at0...at3 from ?symv_U 2017-11-14 23:32:25 +01:00
Martin Kroeker
5f402b7759 Remove unused (loop?) variable j from the gemv_n_4 implementations 2017-11-14 23:29:42 +01:00
Martin Kroeker
65bf0a343c Remove unused variable btpr 2017-11-14 23:25:50 +01:00
Martin Kroeker
acf3d34bc5 Silence an unused variable warning with a cast
l2 cache size is not universally needed to assign default unrolling limits, but neither putting its declaration inside an ifdef nor cloning it into all ifdef sections that need it really makes sense here.
2017-11-14 23:23:44 +01:00
Martin Kroeker
8e75f7dcb4 Merge pull request #1353 from xoviat/patch-1
[appveyor] use flang from conda-forge
2017-11-10 22:16:31 +01:00
Martin Kroeker
bd3546704c Merge pull request #1356 from martin-frbg/lapack-issue196
Break out of potentially infinite rescaling loop after 1000 iterations
2017-11-10 22:15:27 +01:00
Martin Kroeker
2df1e3372d Break out of potentially infinite rescaling loop after 1000 iterations
Inf values in the input vector will survive rescaling, causing an infinite loop. The value of 1000 is arbitrarily chosen as a large but finite value with the intention to never interfere with regular calculations.
2017-11-10 20:02:21 +01:00
Martin Kroeker
4271b2b158 Merge pull request #1354 from martin-frbg/shmem
Try to handle shmget or shmat failing
2017-11-10 09:11:03 +01:00
Martin Kroeker
148493df89 Merge branch 'develop' into shmem 2017-11-09 23:25:15 +01:00
Martin Kroeker
415555a9c1 Merge branch 'develop' into shmem 2017-11-09 23:20:54 +01:00
Martin Kroeker
2a6fef9a55 Try to handle shmget or shmat failing
also replaces one verbatim sched_yield with the YIELDING macro for consistency as suggested in #1351
2017-11-09 23:16:13 +01:00
xoviat
307305aeb5 [appeyor] use flang from conda-forge
This flang will be updated in the future. We leave cmake because it's
not yet released with fortran support
2017-11-09 15:10:02 -06:00
Martin Kroeker
cc26cdce0c Merge pull request #1352 from martin-frbg/issue1351
Output an error message when shmat() fails
2017-11-09 21:08:16 +01:00
Martin Kroeker
d8576826c4 Output an error message when shmat() fails
Observed in #1351 with SELinux as the likely culprit. Without the message, the user saw a segfault with no apparent reason
2017-11-09 17:31:44 +01:00
Martin Kroeker
c6968edec4 Merge pull request #1350 from insertinterestingnamehere/flang
WIP: Support for Flang on Windows
2017-11-08 11:40:13 +01:00
Isuru Fernando
9268314290 Fix gensymbol script 2017-11-06 21:12:38 -06:00