Martin Kroeker
c49c6b237d
Merge pull request #1382 from martin-frbg/dtrmv-1332
...
Work around errors in multithreaded dtrmv
2017-12-05 19:53:23 +01:00
Martin Kroeker
e2469a9ebc
Merge pull request #1386 from martin-frbg/bignuma
...
Limit MAX_CPU to 1024 for now
2017-12-05 19:52:52 +01:00
Martin Kroeker
5b71f3a8e4
Merge pull request #1387 from martin-frbg/cmakeandroid
...
Explicitly link against libm on Android with cmake as well
2017-12-05 19:52:03 +01:00
Martin Kroeker
9381ac2748
Explicitly link against libm on Android with cmake as well
...
Patch from #1384
2017-12-05 13:02:48 +01:00
Martin Kroeker
28ae3ca76f
Limit MAX_CPU to 1024 for now
...
Some Linux distributions (notably SuSE) have raised CPU_SETSIZE to 4096, apparently disregarding API limitations.
From #1348 , the highest value to survive array initialization (on a desktop system) is 3232, and 1024 - which is the
more usual CPU_SETSIZE limit, was demonstrated to work fine on an actual bignuma system.
2017-12-05 12:54:15 +01:00
Martin Kroeker
b414283f48
Disable gemv unrolling
...
as a (hopefully temporary) workaround for #1332
2017-12-03 22:41:54 +01:00
Martin Kroeker
38763ec4f3
Disable multithreading for trmv
...
as a (hopefully temporary) workaround for #1332
2017-12-03 22:40:54 +01:00
Martin Kroeker
452fbef0bf
Merge pull request #1381 from martin-frbg/ctest-warnings
...
Fix compiler warnings in ctest
2017-12-03 21:35:20 +01:00
Martin Kroeker
8c8313983b
Fix compiler warnings in ctest
...
Various fixes for const correctness, stray tab characters and unused labels
2017-12-03 18:19:30 +01:00
Martin Kroeker
881a50c093
Merge pull request #1379 from martin-frbg/warnfix
...
Work around compiler warnings for unused variables
2017-12-03 13:04:02 +01:00
Martin Kroeker
8213385ab8
Work around compiler warnings for unused variables in the generic zgemm3m_Xcopy kernels
2017-12-02 22:51:58 +01:00
Martin Kroeker
bede1c4fb4
Merge pull request #1372 from martin-frbg/param
...
Correct zgeadd_k prototype
2017-12-02 16:49:47 +01:00
Martin Kroeker
1d2da67841
Prefix make jobs with travis_wait ( #1378 )
...
* Prefix make with travis_wait to prevent it getting killed for producing no output
* Extend travis_wait to 30mins for the windows build
* Trying 45 mins wait time
* Increase travis_wait time to 45 minutes for linux builds as well
2017-12-02 12:59:27 +01:00
Martin Kroeker
0dc291d3fa
Merge pull request #1377 from isuruf/threads
...
Allow overriding NUM_THREADS in cmake
2017-12-01 16:22:35 +01:00
Isuru Fernando
e0ddd7d124
Allow overriding NUM_THREADS
2017-12-01 01:42:45 -06:00
Martin Kroeker
adf4316f0e
Merge pull request #1376 from xoviat/patch-2
...
[appveyor] fix test directory
2017-12-01 08:11:12 +01:00
xoviat
7fce11a5b8
[appveyor] fix test directory
2017-11-30 16:31:09 -06:00
Martin Kroeker
c40f01ccea
Merge pull request #1375 from xoviat/patch-1
...
[appveyor] Use out-of-tree build and cache
2017-11-30 22:43:54 +01:00
xoviat
c567e34e6b
[appveyor] fix syntax
2017-11-30 15:33:32 -06:00
xoviat
c917278d23
[appveyor] Use out-of-tree build and cache
2017-11-30 15:30:10 -06:00
Martin Kroeker
0639ed1258
Merge pull request #1373 from mc10/patch-1
...
README: Use the SVG Travis badge
2017-11-30 12:54:52 +01:00
Kevin Ji
f017e169dc
README: Use the SVG Travis badge
2017-11-29 15:21:12 -08:00
Martin Kroeker
7e860acd38
Correct zgeadd_k prototype
2017-11-29 19:57:35 +01:00
Martin Kroeker
db00a51e6b
Merge pull request #1371 from martin-frbg/develop
...
Add trivially optimized DSDOT for POWER8
2017-11-29 19:55:21 +01:00
martin
7a4b3cfbf8
Add trivially optimized DSDOT for POWER8
2017-11-28 18:38:07 +01:00
Martin Kroeker
6c77b5f267
Merge pull request #1369 from martin-frbg/dsdot
...
Add optimized dsdot to all other x86_64 kernels that use sdot.c
2017-11-28 18:15:31 +01:00
Martin Kroeker
d8b3c3c7db
Merge pull request #1368 from brada4/develop
...
Eliminate warnings
2017-11-28 18:15:04 +01:00
Martin Kroeker
beb18492fd
Merge pull request #1366 from martin-frbg/develop
...
Update LAPACK to 3.8.0
2017-11-26 19:12:00 +01:00
Andrew
441a9c8385
more dead increments clang4 scan-build deadcode.deadstores
2017-11-26 17:24:08 +01:00
Andrew
1236dbe5a6
Eliminate 2-8 dead increments code
2017-11-26 13:26:11 +01:00
Andrew
ef95cd471f
elminate unread variable, after reiteration 3 of them (clang4)
2017-11-25 02:54:37 +01:00
Martin Kroeker
c92cd6d162
Add trivially optimized dsdot based on sdot
2017-11-24 20:05:27 +01:00
Martin Kroeker
cae5d9a20b
Add trivially optimized dsdot based on sdot
2017-11-24 20:04:29 +01:00
Martin Kroeker
3d891c3106
Add trivially optimized dsdot based on sdot
2017-11-24 20:03:40 +01:00
Martin Kroeker
4fbdcfa823
Add trivially optimized dsdot based on sdot
2017-11-24 20:02:28 +01:00
Martin Kroeker
1bb6a96ebc
Add trivially optimized dsdot based on sdot
2017-11-24 20:01:42 +01:00
Martin Kroeker
6bd163f37a
Add trivially optimized dsdot based on sdot
2017-11-24 20:00:23 +01:00
Martin Kroeker
f0333333d1
Add trivially optimized dsdot based on sdot
2017-11-24 19:59:28 +01:00
Andrew
e14d50d86e
eliminate Wunused-const gcc7 warning
2017-11-24 19:13:24 +01:00
Andrew
e89b979b2c
fix spurious compiler warning fix (no code change)
2017-11-24 18:39:04 +01:00
Andrew
7e9b29b9b8
fix spurious compiler warning (no code change)
2017-11-24 18:36:37 +01:00
martin
5056a044b2
fix location of lapacke_nancheck
2017-11-24 09:15:20 +01:00
martin
4054d32def
update cmake files
2017-11-24 08:15:40 +01:00
martin
2d52f0f4c3
update cmakefiles for lapack 3.8.0
2017-11-23 21:22:01 +01:00
martin
3be5c3d343
Update LAPACK to 3.8.0
2017-11-23 18:13:35 +01:00
Martin Kroeker
b18730f9e1
Merge pull request #1365 from xoviat/patch-1
...
[appveyor] use cmake from conda forge
2017-11-22 21:13:41 +01:00
xoviat
be9e63c021
[appveyor] use cmake from conda forge
2017-11-21 18:44:02 -06:00
Martin Kroeker
21844238ca
Merge pull request #1364 from martin-frbg/shmem-init
...
Handle shmem init failures in cpu affinity setup code
2017-11-19 12:50:16 +01:00
Martin Kroeker
07e7c36dac
Handle shmem init failures in cpu affinity setup code
...
Failures to obtain or attach shared memory segments would lead to an exit without explanation of the exact cause.
This change introduces a more verbose error message and tries to make the code continue without setting cpu affinity.
Fixes #1351
2017-11-18 23:57:44 +01:00
Martin Kroeker
9251a2efde
Merge pull request #1359 from brada4/develop
...
Eliminate mode variable where not needed in syrk interface
2017-11-18 23:47:17 +01:00