Commit Graph

2686 Commits

Author SHA1 Message Date
Martin Kroeker c49c6b237d
Merge pull request #1382 from martin-frbg/dtrmv-1332
Work around errors in multithreaded dtrmv
2017-12-05 19:53:23 +01:00
Martin Kroeker e2469a9ebc
Merge pull request #1386 from martin-frbg/bignuma
Limit MAX_CPU to 1024 for now
2017-12-05 19:52:52 +01:00
Martin Kroeker 5b71f3a8e4
Merge pull request #1387 from martin-frbg/cmakeandroid
Explicitly link against libm on Android with cmake as well
2017-12-05 19:52:03 +01:00
Martin Kroeker 9381ac2748
Explicitly link against libm on Android with cmake as well
Patch from #1384
2017-12-05 13:02:48 +01:00
Martin Kroeker 28ae3ca76f
Limit MAX_CPU to 1024 for now
Some Linux distributions (notably SuSE) have raised CPU_SETSIZE to 4096, apparently disregarding API limitations.
From #1348, the highest value to survive array initialization (on a desktop system) is 3232, and 1024 - which is the 
more usual CPU_SETSIZE limit, was demonstrated to work fine on an actual bignuma system.
2017-12-05 12:54:15 +01:00
Martin Kroeker b414283f48
Disable gemv unrolling
as a (hopefully temporary) workaround for #1332
2017-12-03 22:41:54 +01:00
Martin Kroeker 38763ec4f3
Disable multithreading for trmv
as a (hopefully temporary) workaround for #1332
2017-12-03 22:40:54 +01:00
Martin Kroeker 452fbef0bf
Merge pull request #1381 from martin-frbg/ctest-warnings
Fix compiler warnings in ctest
2017-12-03 21:35:20 +01:00
Martin Kroeker 8c8313983b
Fix compiler warnings in ctest
Various fixes for const correctness, stray tab characters and unused labels
2017-12-03 18:19:30 +01:00
Martin Kroeker 881a50c093
Merge pull request #1379 from martin-frbg/warnfix
Work around compiler warnings for unused variables
2017-12-03 13:04:02 +01:00
Martin Kroeker 8213385ab8
Work around compiler warnings for unused variables in the generic zgemm3m_Xcopy kernels 2017-12-02 22:51:58 +01:00
Martin Kroeker bede1c4fb4
Merge pull request #1372 from martin-frbg/param
Correct zgeadd_k prototype
2017-12-02 16:49:47 +01:00
Martin Kroeker 1d2da67841
Prefix make jobs with travis_wait (#1378)
* Prefix make with travis_wait to prevent it getting killed for producing no output

* Extend travis_wait to 30mins for the windows build

* Trying 45 mins wait time

* Increase travis_wait time to 45 minutes for linux builds as well
2017-12-02 12:59:27 +01:00
Martin Kroeker 0dc291d3fa
Merge pull request #1377 from isuruf/threads
Allow overriding NUM_THREADS in cmake
2017-12-01 16:22:35 +01:00
Isuru Fernando e0ddd7d124 Allow overriding NUM_THREADS 2017-12-01 01:42:45 -06:00
Martin Kroeker adf4316f0e
Merge pull request #1376 from xoviat/patch-2
[appveyor] fix test directory
2017-12-01 08:11:12 +01:00
xoviat 7fce11a5b8
[appveyor] fix test directory 2017-11-30 16:31:09 -06:00
Martin Kroeker c40f01ccea
Merge pull request #1375 from xoviat/patch-1
[appveyor] Use out-of-tree build and cache
2017-11-30 22:43:54 +01:00
xoviat c567e34e6b
[appveyor] fix syntax 2017-11-30 15:33:32 -06:00
xoviat c917278d23
[appveyor] Use out-of-tree build and cache 2017-11-30 15:30:10 -06:00
Martin Kroeker 0639ed1258
Merge pull request #1373 from mc10/patch-1
README: Use the SVG Travis badge
2017-11-30 12:54:52 +01:00
Kevin Ji f017e169dc
README: Use the SVG Travis badge 2017-11-29 15:21:12 -08:00
Martin Kroeker 7e860acd38
Correct zgeadd_k prototype 2017-11-29 19:57:35 +01:00
Martin Kroeker db00a51e6b
Merge pull request #1371 from martin-frbg/develop
Add trivially optimized DSDOT for POWER8
2017-11-29 19:55:21 +01:00
martin 7a4b3cfbf8 Add trivially optimized DSDOT for POWER8 2017-11-28 18:38:07 +01:00
Martin Kroeker 6c77b5f267
Merge pull request #1369 from martin-frbg/dsdot
Add optimized dsdot to all other x86_64 kernels that use sdot.c
2017-11-28 18:15:31 +01:00
Martin Kroeker d8b3c3c7db
Merge pull request #1368 from brada4/develop
Eliminate warnings
2017-11-28 18:15:04 +01:00
Martin Kroeker beb18492fd
Merge pull request #1366 from martin-frbg/develop
Update LAPACK to 3.8.0
2017-11-26 19:12:00 +01:00
Andrew 441a9c8385 more dead increments clang4 scan-build deadcode.deadstores 2017-11-26 17:24:08 +01:00
Andrew 1236dbe5a6 Eliminate 2-8 dead increments code 2017-11-26 13:26:11 +01:00
Andrew ef95cd471f elminate unread variable, after reiteration 3 of them (clang4) 2017-11-25 02:54:37 +01:00
Martin Kroeker c92cd6d162
Add trivially optimized dsdot based on sdot 2017-11-24 20:05:27 +01:00
Martin Kroeker cae5d9a20b
Add trivially optimized dsdot based on sdot 2017-11-24 20:04:29 +01:00
Martin Kroeker 3d891c3106
Add trivially optimized dsdot based on sdot 2017-11-24 20:03:40 +01:00
Martin Kroeker 4fbdcfa823
Add trivially optimized dsdot based on sdot 2017-11-24 20:02:28 +01:00
Martin Kroeker 1bb6a96ebc
Add trivially optimized dsdot based on sdot 2017-11-24 20:01:42 +01:00
Martin Kroeker 6bd163f37a
Add trivially optimized dsdot based on sdot 2017-11-24 20:00:23 +01:00
Martin Kroeker f0333333d1
Add trivially optimized dsdot based on sdot 2017-11-24 19:59:28 +01:00
Andrew e14d50d86e eliminate Wunused-const gcc7 warning 2017-11-24 19:13:24 +01:00
Andrew e89b979b2c fix spurious compiler warning fix (no code change) 2017-11-24 18:39:04 +01:00
Andrew 7e9b29b9b8 fix spurious compiler warning (no code change) 2017-11-24 18:36:37 +01:00
martin 5056a044b2 fix location of lapacke_nancheck 2017-11-24 09:15:20 +01:00
martin 4054d32def update cmake files 2017-11-24 08:15:40 +01:00
martin 2d52f0f4c3 update cmakefiles for lapack 3.8.0 2017-11-23 21:22:01 +01:00
martin 3be5c3d343 Update LAPACK to 3.8.0 2017-11-23 18:13:35 +01:00
Martin Kroeker b18730f9e1
Merge pull request #1365 from xoviat/patch-1
[appveyor] use cmake from conda forge
2017-11-22 21:13:41 +01:00
xoviat be9e63c021
[appveyor] use cmake from conda forge 2017-11-21 18:44:02 -06:00
Martin Kroeker 21844238ca
Merge pull request #1364 from martin-frbg/shmem-init
Handle shmem init failures in cpu affinity setup code
2017-11-19 12:50:16 +01:00
Martin Kroeker 07e7c36dac
Handle shmem init failures in cpu affinity setup code
Failures to obtain or attach shared memory segments would lead to an exit without explanation of the exact cause.
This change introduces a more verbose error message and tries to make the code continue without setting cpu affinity.
Fixes #1351
2017-11-18 23:57:44 +01:00
Martin Kroeker 9251a2efde
Merge pull request #1359 from brada4/develop
Eliminate mode variable where not needed in syrk interface
2017-11-18 23:47:17 +01:00