QWR QWR
|
28ca97015d
|
power8:Added initial zgemv_(t|n) ,i(d|z)amax,i(d|z)amin,dgemv_t(transposed),zrot
z13: improved zgemv_(t|n)_4,zscal,zaxpy
|
2018-03-27 14:54:41 +00:00 |
the mslm
|
2c0a008281
|
dgemm_ncopy_4_ save/restore
|
2018-02-18 01:30:17 +00:00 |
the mslm
|
c5425daa6b
|
power8 ?gemm_tcopy save/restore
|
2018-02-16 23:36:46 +00:00 |
Abdelrauf
|
60596a1abc
|
Merge branch 'develop' into develop
|
2018-01-31 16:17:04 -08:00 |
Abdelrauf
|
afd514c25d
|
small fix inside ifdef z13mvc . (z13mvc code is not used in production)
|
2018-01-31 18:30:59 -05:00 |
Martin Kroeker
|
f45776ec1f
|
Merge pull request #1440 from quickwritereader/develop
small corrections
|
2018-01-31 23:48:47 +01:00 |
Martin Kroeker
|
e388459a27
|
Merge pull request #1419 from brada4/develop
Initialize unitialized values for repeated calls
|
2018-01-31 23:48:34 +01:00 |
Abdelrauf
|
f653e7a18d
|
small fix
small fix inside ifdef z13mvc . (z13mvc code is not used in production)
|
2018-01-31 07:49:38 -08:00 |
the mslm
|
f946a89432
|
zscal (case: real alpha=0 ) mikrokernel shift&mem fix , da_i as input reg. small typo fixes
|
2018-01-26 19:25:27 -08:00 |
Martin Kroeker
|
485df77612
|
Make USE_TRMM depend on TARGET_CORE not TARGET
Fixes #1432 (and possibly other DTRMM-related failures on Haswell and related architectures when built with cmake)
|
2018-01-26 23:20:00 +01:00 |
Martin Kroeker
|
e4c71a799a
|
Merge pull request #1426 from quickwritereader/develop
(Z13 ) Blas1 mikrokernels can be inlined by gcc. Refactoring,fixes,tunings
|
2018-01-20 17:34:54 +01:00 |
the mslm
|
2619ad7ea5
|
Blas1 mikrokernels can be inlined by gcc. Refactoring ( symbolic operan
names). Some fixes and tunings
|
2018-01-19 19:24:35 -08:00 |
Andrew
|
e5cc3d72c0
|
core.IdenticalExpr clang501 checker
|
2018-01-19 23:17:43 +01:00 |
Andrew
|
4938faa822
|
core.IdenticalExpr clang501 checker
|
2018-01-19 23:15:58 +01:00 |
Andrew
|
9fa986337d
|
add missing brackets to silence indentation warnings gcc721
|
2018-01-19 23:11:12 +01:00 |
Andrew
|
3eed97f6b9
|
Initialize values to silence cppcheck
|
2018-01-12 22:35:00 +01:00 |
Andrew
|
13e137fbc9
|
Initialize uninitialized variables (cppcheck)
|
2018-01-12 22:33:41 +01:00 |
Martin Kroeker
|
3d23f45107
|
Merge pull request #1415 from quickwritereader/develop
(Z systems Z13) small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels…
|
2018-01-11 08:35:02 +01:00 |
Abdelrauf
|
87669d1c0a
|
small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels can be inlined
|
2018-01-10 20:36:53 -05:00 |
Martin Kroeker
|
42285d8e70
|
Merge pull request #1410 from brada4/develop
Address warnings #1357
|
2018-01-06 20:02:46 +01:00 |
Andrew
|
d602b99386
|
LAPACK helpers in C that need care too
|
2018-01-02 14:38:50 +01:00 |
Andrew
|
4d0b005e5b
|
Eliminate remaining unused results in kernels (clang5 analyzer)
|
2018-01-01 20:54:39 +01:00 |
Martin Kroeker
|
b81656936f
|
Merge pull request #1409 from martin-frbg/issue1292-2
Tag %1 and %2 as both input and output operands
|
2017-12-31 20:18:48 +01:00 |
Martin Kroeker
|
b973990df2
|
Tag %1 and %2 as both input and output operands
fix from #1292 extended to the other gemv microkernels
|
2017-12-31 18:03:36 +01:00 |
Martin Kroeker
|
1e31124eb0
|
Merge pull request #1406 from martin-frbg/issue1292
Tag %1 and %2 as both input and output
|
2017-12-30 14:52:03 +01:00 |
Martin Kroeker
|
cc9500db41
|
Merge pull request #1403 from brada4/develop
Address few more warnings
|
2017-12-30 14:51:34 +01:00 |
Martin Kroeker
|
723f396a20
|
Tag %1 and %2 as both input and output
The inline assembly modifies its input operands, so mark them as output to avoid surprises with optimization. Fixes #1292
|
2017-12-29 23:56:41 +01:00 |
Andrew
|
03e5ff0687
|
initialize potentially unitialized variables (clang5)
|
2017-12-26 09:24:24 +01:00 |
Andrew
|
47deec2c1a
|
fix couple of dead assignment warnings
|
2017-12-22 00:56:35 +01:00 |
Martin Kroeker
|
43c0622e7b
|
Retire Piledriver/Steamroller/Excavator daxpy microkernels as well
related to issue #1332
|
2017-12-13 18:40:39 +01:00 |
Martin Kroeker
|
0623636c98
|
Use Sandybridge daxpy kernel on Haswell and Zen for now
The testcase from #1332 exposes a problem in daxpy_microk_haswell-2.c that is not seen with
any of the other Intel x86_64 microkernels.
|
2017-12-10 19:24:31 +01:00 |
Andrew
|
281a2b952f
|
warning cleanup (#1380)
* dead increments in driver/level2
* dead increments in kernel/generic
* part dead increments in kernel/x86_64
|
2017-12-05 19:54:10 +01:00 |
Martin Kroeker
|
8213385ab8
|
Work around compiler warnings for unused variables in the generic zgemm3m_Xcopy kernels
|
2017-12-02 22:51:58 +01:00 |
Martin Kroeker
|
db00a51e6b
|
Merge pull request #1371 from martin-frbg/develop
Add trivially optimized DSDOT for POWER8
|
2017-11-29 19:55:21 +01:00 |
martin
|
7a4b3cfbf8
|
Add trivially optimized DSDOT for POWER8
|
2017-11-28 18:38:07 +01:00 |
Martin Kroeker
|
6c77b5f267
|
Merge pull request #1369 from martin-frbg/dsdot
Add optimized dsdot to all other x86_64 kernels that use sdot.c
|
2017-11-28 18:15:31 +01:00 |
Andrew
|
441a9c8385
|
more dead increments clang4 scan-build deadcode.deadstores
|
2017-11-26 17:24:08 +01:00 |
Andrew
|
1236dbe5a6
|
Eliminate 2-8 dead increments code
|
2017-11-26 13:26:11 +01:00 |
Martin Kroeker
|
c92cd6d162
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 20:05:27 +01:00 |
Martin Kroeker
|
cae5d9a20b
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 20:04:29 +01:00 |
Martin Kroeker
|
3d891c3106
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 20:03:40 +01:00 |
Martin Kroeker
|
4fbdcfa823
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 20:02:28 +01:00 |
Martin Kroeker
|
1bb6a96ebc
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 20:01:42 +01:00 |
Martin Kroeker
|
6bd163f37a
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 20:00:23 +01:00 |
Martin Kroeker
|
f0333333d1
|
Add trivially optimized dsdot based on sdot
|
2017-11-24 19:59:28 +01:00 |
Andrew
|
e89b979b2c
|
fix spurious compiler warning fix (no code change)
|
2017-11-24 18:39:04 +01:00 |
Andrew
|
7e9b29b9b8
|
fix spurious compiler warning (no code change)
|
2017-11-24 18:36:37 +01:00 |
Martin Kroeker
|
6157d0902a
|
Merge pull request #1358 from martin-frbg/unused_vars
Clean up spurious unused variables in the kernels
|
2017-11-15 11:31:43 +01:00 |
Martin Kroeker
|
3fea849bbf
|
Remove unused variables from Haswell dtrmm and Bulldozer dtrsm
|
2017-11-14 23:35:10 +01:00 |
Martin Kroeker
|
8f177621bc
|
Remove unused variables at0...at3 from ?symv_U
|
2017-11-14 23:32:25 +01:00 |