Commit Graph

2686 Commits

Author SHA1 Message Date
Martin Kroeker
c49c6b237d Merge pull request #1382 from martin-frbg/dtrmv-1332
Work around errors in multithreaded dtrmv
2017-12-05 19:53:23 +01:00
Martin Kroeker
e2469a9ebc Merge pull request #1386 from martin-frbg/bignuma
Limit MAX_CPU to 1024 for now
2017-12-05 19:52:52 +01:00
Martin Kroeker
5b71f3a8e4 Merge pull request #1387 from martin-frbg/cmakeandroid
Explicitly link against libm on Android with cmake as well
2017-12-05 19:52:03 +01:00
Martin Kroeker
9381ac2748 Explicitly link against libm on Android with cmake as well
Patch from #1384
2017-12-05 13:02:48 +01:00
Martin Kroeker
28ae3ca76f Limit MAX_CPU to 1024 for now
Some Linux distributions (notably SuSE) have raised CPU_SETSIZE to 4096, apparently disregarding API limitations.
From #1348, the highest value to survive array initialization (on a desktop system) is 3232, and 1024 - which is the 
more usual CPU_SETSIZE limit, was demonstrated to work fine on an actual bignuma system.
2017-12-05 12:54:15 +01:00
Martin Kroeker
b414283f48 Disable gemv unrolling
as a (hopefully temporary) workaround for #1332
2017-12-03 22:41:54 +01:00
Martin Kroeker
38763ec4f3 Disable multithreading for trmv
as a (hopefully temporary) workaround for #1332
2017-12-03 22:40:54 +01:00
Martin Kroeker
452fbef0bf Merge pull request #1381 from martin-frbg/ctest-warnings
Fix compiler warnings in ctest
2017-12-03 21:35:20 +01:00
Martin Kroeker
8c8313983b Fix compiler warnings in ctest
Various fixes for const correctness, stray tab characters and unused labels
2017-12-03 18:19:30 +01:00
Martin Kroeker
881a50c093 Merge pull request #1379 from martin-frbg/warnfix
Work around compiler warnings for unused variables
2017-12-03 13:04:02 +01:00
Martin Kroeker
8213385ab8 Work around compiler warnings for unused variables in the generic zgemm3m_Xcopy kernels 2017-12-02 22:51:58 +01:00
Martin Kroeker
bede1c4fb4 Merge pull request #1372 from martin-frbg/param
Correct zgeadd_k prototype
2017-12-02 16:49:47 +01:00
Martin Kroeker
1d2da67841 Prefix make jobs with travis_wait (#1378)
* Prefix make with travis_wait to prevent it getting killed for producing no output

* Extend travis_wait to 30mins for the windows build

* Trying 45 mins wait time

* Increase travis_wait time to 45 minutes for linux builds as well
2017-12-02 12:59:27 +01:00
Martin Kroeker
0dc291d3fa Merge pull request #1377 from isuruf/threads
Allow overriding NUM_THREADS in cmake
2017-12-01 16:22:35 +01:00
Isuru Fernando
e0ddd7d124 Allow overriding NUM_THREADS 2017-12-01 01:42:45 -06:00
Martin Kroeker
adf4316f0e Merge pull request #1376 from xoviat/patch-2
[appveyor] fix test directory
2017-12-01 08:11:12 +01:00
xoviat
7fce11a5b8 [appveyor] fix test directory 2017-11-30 16:31:09 -06:00
Martin Kroeker
c40f01ccea Merge pull request #1375 from xoviat/patch-1
[appveyor] Use out-of-tree build and cache
2017-11-30 22:43:54 +01:00
xoviat
c567e34e6b [appveyor] fix syntax 2017-11-30 15:33:32 -06:00
xoviat
c917278d23 [appveyor] Use out-of-tree build and cache 2017-11-30 15:30:10 -06:00
Martin Kroeker
0639ed1258 Merge pull request #1373 from mc10/patch-1
README: Use the SVG Travis badge
2017-11-30 12:54:52 +01:00
Kevin Ji
f017e169dc README: Use the SVG Travis badge 2017-11-29 15:21:12 -08:00
Martin Kroeker
7e860acd38 Correct zgeadd_k prototype 2017-11-29 19:57:35 +01:00
Martin Kroeker
db00a51e6b Merge pull request #1371 from martin-frbg/develop
Add trivially optimized DSDOT for POWER8
2017-11-29 19:55:21 +01:00
martin
7a4b3cfbf8 Add trivially optimized DSDOT for POWER8 2017-11-28 18:38:07 +01:00
Martin Kroeker
6c77b5f267 Merge pull request #1369 from martin-frbg/dsdot
Add optimized dsdot to all other x86_64 kernels that use sdot.c
2017-11-28 18:15:31 +01:00
Martin Kroeker
d8b3c3c7db Merge pull request #1368 from brada4/develop
Eliminate warnings
2017-11-28 18:15:04 +01:00
Martin Kroeker
beb18492fd Merge pull request #1366 from martin-frbg/develop
Update LAPACK to 3.8.0
2017-11-26 19:12:00 +01:00
Andrew
441a9c8385 more dead increments clang4 scan-build deadcode.deadstores 2017-11-26 17:24:08 +01:00
Andrew
1236dbe5a6 Eliminate 2-8 dead increments code 2017-11-26 13:26:11 +01:00
Andrew
ef95cd471f elminate unread variable, after reiteration 3 of them (clang4) 2017-11-25 02:54:37 +01:00
Martin Kroeker
c92cd6d162 Add trivially optimized dsdot based on sdot 2017-11-24 20:05:27 +01:00
Martin Kroeker
cae5d9a20b Add trivially optimized dsdot based on sdot 2017-11-24 20:04:29 +01:00
Martin Kroeker
3d891c3106 Add trivially optimized dsdot based on sdot 2017-11-24 20:03:40 +01:00
Martin Kroeker
4fbdcfa823 Add trivially optimized dsdot based on sdot 2017-11-24 20:02:28 +01:00
Martin Kroeker
1bb6a96ebc Add trivially optimized dsdot based on sdot 2017-11-24 20:01:42 +01:00
Martin Kroeker
6bd163f37a Add trivially optimized dsdot based on sdot 2017-11-24 20:00:23 +01:00
Martin Kroeker
f0333333d1 Add trivially optimized dsdot based on sdot 2017-11-24 19:59:28 +01:00
Andrew
e14d50d86e eliminate Wunused-const gcc7 warning 2017-11-24 19:13:24 +01:00
Andrew
e89b979b2c fix spurious compiler warning fix (no code change) 2017-11-24 18:39:04 +01:00
Andrew
7e9b29b9b8 fix spurious compiler warning (no code change) 2017-11-24 18:36:37 +01:00
martin
5056a044b2 fix location of lapacke_nancheck 2017-11-24 09:15:20 +01:00
martin
4054d32def update cmake files 2017-11-24 08:15:40 +01:00
martin
2d52f0f4c3 update cmakefiles for lapack 3.8.0 2017-11-23 21:22:01 +01:00
martin
3be5c3d343 Update LAPACK to 3.8.0 2017-11-23 18:13:35 +01:00
Martin Kroeker
b18730f9e1 Merge pull request #1365 from xoviat/patch-1
[appveyor] use cmake from conda forge
2017-11-22 21:13:41 +01:00
xoviat
be9e63c021 [appveyor] use cmake from conda forge 2017-11-21 18:44:02 -06:00
Martin Kroeker
21844238ca Merge pull request #1364 from martin-frbg/shmem-init
Handle shmem init failures in cpu affinity setup code
2017-11-19 12:50:16 +01:00
Martin Kroeker
07e7c36dac Handle shmem init failures in cpu affinity setup code
Failures to obtain or attach shared memory segments would lead to an exit without explanation of the exact cause.
This change introduces a more verbose error message and tries to make the code continue without setting cpu affinity.
Fixes #1351
2017-11-18 23:57:44 +01:00
Martin Kroeker
9251a2efde Merge pull request #1359 from brada4/develop
Eliminate mode variable where not needed in syrk interface
2017-11-18 23:47:17 +01:00