Commit Graph

163 Commits

Author SHA1 Message Date
Martin Kroeker 9f5a74f3c7
Merge pull request #2486 from qqqil/develop
add benchmark for trsv
2020-03-06 14:30:09 +01:00
Martin Kroeker 2afb10975d
Merge pull request #2485 from Darkness303/develop
Add syr2 benchmark
2020-03-06 14:29:27 +01:00
chenxuqiang 32c847df45 benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
Signed-off-by: Xuqiang Chen chenxuqiang3@hisilicon.com
2020-03-06 01:02:02 -05:00
shengyang e0df9485d4 Add benchmark file rotm.c and modify benchmark/Makefile to test s/drotm
modified:   benchmark/Makefile
	new file:   benchmark/rotm.c
2020-03-05 10:05:59 +08:00
s00527847 0f1a2b12f9 add benchmark for spr/spr2 2020-03-04 15:50:19 -05:00
q00437336 233838b4bc change clock to CLOCK_PROCESS_CPUTIME_ID 2020-03-04 03:54:40 -05:00
q00437336 de74e11641 add benchmark for trsv 2020-03-04 03:23:22 -05:00
Darkness303 114dbec947 1.Add syr2 benchmark
2.Fixed some errors
2020-03-04 14:09:10 +08:00
wuanjun 00447568 87baf9cfe6 Merge branch 'develop' of https://github.com/aaawuanjun/OpenBLAS into develop 2020-03-03 19:03:28 +08:00
wuanjun 00447568 f682d19ed4 [OpenBlas]: add benchmark file trmv.c and modify benchmark/Makefile to test s/d/c/ztrmv 2020-03-03 17:37:33 +08:00
wuanjun 00447568 790d50fbba [OpenBlas]: add benchmark file trmv.c and modify benchmark/Makefile to test s/d/c/ztrmv 2020-03-03 17:13:49 +08:00
Martin Kroeker 59243d49ab
Merge pull request #2479 from Darkness303/develop
Fix potential index overflows at large matrix sizes in the benchmark codes
2020-03-03 08:46:49 +01:00
Martin Kroeker e326c89ae8
Merge pull request #2478 from MacChen02/develop
Update benchmark statistical time function
2020-03-02 21:20:51 +01:00
jianghesong 0f65c05cd1 fix core dumped error 2020-03-02 19:13:45 +08:00
MacChen02 917d243580
Update benchmark statistical time function
The function gettimeofday does not count the time,when testing the axpy small data volume use case.
Use the function clock_gettime to replace the gettimeofday function to count the time.
2020-03-02 14:36:27 +08:00
j00520245 e1062400c4 New add syr benchmark 2020-02-28 16:36:53 +08:00
wuanjun 00447568 c93ae92579 [OpenBlas]:benchmark/copy.c has time,x,y data loop problems 2020-02-24 11:23:39 +08:00
w00421467 ce9ea8f826 Fix another branch 2020-02-05 15:07:18 +08:00
w00421467 0b909203cb Fix bugs in benchmark of gemv 2020-02-05 14:53:37 +08:00
AbdelRauf a469b32cf4 sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 2019-06-04 07:11:30 +00:00
Andrew 3e601bd419 disable NaN checks before BLAS calls dgemm.R 2019-01-16 11:54:22 +02:00
Andrew 478d3c4569 disable NaN checks before BLAS calls deig.R (shorten matrix def) 2019-01-16 11:41:46 +02:00
Andrew 3afceb6c2a disable NaN checks before BLAS calls deig.R 2019-01-16 11:38:14 +02:00
Andrew 7af8b21dbb disable NaN checks before BLAS calls dsolve.R (shorter formula) 2019-01-16 11:34:46 +02:00
Andrew 2777a7f506 disable NaN checks before BLAS calls dsolve.R (shorter config part) 2019-01-16 11:23:51 +02:00
Andrew b70fd23836 disable NaN checks before BLAS calls dsolve.R 2019-01-16 11:18:54 +02:00
maamountki 33043f563f
Disable scal to benchmark zgemv separately by default 2018-08-10 01:54:18 +03:00
Martin Kroeker 35c5a32309
Correct index variables used in MFlops calculation
Fixes #1474
2018-03-27 21:52:29 +02:00
Tim Moon a89d6711c6 Increasing flexibility of GEMM benchmark.
m, n, and k can be set to arbitrary constants. A and B matrices can be transposed independently.
2017-09-28 12:56:29 -07:00
Ashwin Sekhar T K efda640723 Benchmark: Add MFlops print in iamax benchmark 2017-01-24 23:13:47 -08:00
Ashwin Sekhar T K 1530e78cfe Benchmarks: Avoid building lapack benchmarks when NO_LAPACK=1 2017-01-24 20:50:23 -08:00
Ashwin Sekhar T K ca0b36b012 THUNDERX2T99: Add Optimized SNRM2 Implementation 2017-01-24 10:23:21 +05:30
Andrew 8379550076 anti GC and reflow 2017-01-07 19:01:42 +01:00
Ashwin Sekhar T K 8d86d14d3f Add time prints in benchmark output 2016-07-14 13:48:13 +05:30
Ashwin Sekhar T K 925d4e1dc6 Add IAMAX and NRM2 benchmarks 2016-07-14 13:46:01 +05:30
Werner Saar 318cad9c37 added trsm bencharks for POWER8 to benchmark/Makefile 2016-05-22 13:51:47 +02:00
Werner Saar dd2b897795 added bugfixes for some make files and smallscaling.c 2016-04-21 12:54:32 +02:00
Werner Saar 1ca750471a added cholesky benchmarks to Makefile for ESSL 2016-04-10 11:28:20 +02:00
Werner Saar 08bddde3f3 updated benchmark Makefile for ESSL 2016-04-08 10:37:59 +02:00
Werner Saar 12540cedb5 added ESSL to Makefile for benchmarks 2016-04-03 07:21:48 +02:00
Werner Saar 7a92c1538e added benchmark test for srot and drot 2016-03-26 07:14:13 +01:00
Jerome Robert 323c237e7b Fix smallscaling compilation
Also revert 0bbca5e
2016-03-10 20:24:41 +01:00
Werner Saar 0bbca5e803 removed build of smallscaling, because build on arm, arm64 and power fails 2016-03-06 11:54:41 +01:00
Werner Saar 8d652f11e7 updated smallscaling.c to build without C99 or C11
increased the threshold value of nep.in to 40
2016-03-06 08:40:51 +01:00
Jerome Robert 73397faf68 Add benchmark/smallscaling.c
* Bench small matrices with multi-threading
* Close #727
2016-02-08 11:25:27 +01:00
John Kirkham a48b247e9e benchmark/scripts/SCIPY/dsyrk.py: Overwrite will work on a Fortran array of the correct type. 2016-01-19 15:32:28 -05:00
John Kirkham b1b115ecd6 benchmark/scripts/SCIPY/ssyrk.py: Overwrite will work on a Fortran array of the correct type. 2016-01-19 15:31:37 -05:00
John Kirkham 07bba933ff benchmark/scripts/SCIPY/dsyrk.py: Arrays should be Fortran order. 2016-01-19 15:29:43 -05:00
John Kirkham e85f8af519 benchmark/scripts/SCIPY/ssyrk.py: Arrays should be Fortran order. 2016-01-19 15:28:22 -05:00
John Kirkham adfa0ab878 benchmark/scripts/SCIPY/ssyrk.py: Fix PEP8 issues. 2016-01-19 15:06:17 -05:00
John Kirkham cbb6649e97 benchmark/scripts/SCIPY/dsyrk.py: Fix PEP8 issues. 2016-01-19 15:05:18 -05:00
John Kirkham 77abc9b280 benchmark/scripts/SCIPY/ssyrk.py: Write values into `C`. 2016-01-19 15:00:54 -05:00
John Kirkham 81e8690763 benchmark/scripts/SCIPY/dsyrk.py: Write values into `C`. 2016-01-19 15:00:23 -05:00
John Kirkham dd04a8ac22 benchmark/scripts/SCIPY/ssyrk.py: Use the environment python. 2016-01-19 14:05:14 -05:00
John Kirkham cb554b3a9c benchmark/scripts/SCIPY/dsyrk.py: Use the environment python. 2016-01-19 14:04:55 -05:00
John Kirkham 1153459d1b benchmark/scripts/SCIPY/ssyrk.py: Drop unneeded semicolons. 2016-01-19 14:00:51 -05:00
John Kirkham 1a73390ffe benchmark/scripts/SCIPY/dsyrk.py: Drop unneeded semicolons. 2016-01-19 14:00:51 -05:00
John Kirkham 8b981e41a1 benchmark/scripts/SCIPY/ssyrk.py: Allocate `C` using zeros instead of randomly generating it. 2016-01-19 14:00:48 -05:00
John Kirkham c10b1f555d benchmark/scripts/SCIPY/dsyrk.py: Allocate `C` using zeros instead of randomly generating it. 2016-01-19 14:00:26 -05:00
Werner Saar c6a27bbe64 added benchmark tests for ssyrk and dsyrk 2016-01-10 12:19:03 +01:00
Ashwin Sekhar T K 67874468a6 Fix bug in benchmark/gemm.c 2015-11-09 14:15:54 +05:30
Werner Saar 8614057ea9 added benchmark scripts for numpy, octave and R 2015-06-08 14:06:38 +02:00
Werner Saar 7f375f9e8f updated geev benchmark 2015-06-08 12:58:38 +02:00
Werner Saar e19948baa1 small modification of gemm.c 2015-06-03 09:11:51 +02:00
Werner Saar 6a13a94e71 added gesv benchmark 2015-06-02 13:35:49 +02:00
Werner Saar 9c4817d07b bugfix for Makefile on mac 2015-05-31 14:16:51 +02:00
Werner Saar 02c7766f68 bugfixes, to build benchmarks with mingw on Windows OS 2015-05-29 12:56:22 +02:00
Werner Saar c47c8e8cf5 added blas level1 swap benchmark 2015-05-21 08:51:42 +02:00
Werner Saar 23fbc5728e added blas level1 copy benchmark 2015-05-20 11:05:00 +02:00
Andreas Noack cda29f183b Add vecLib benchmarks 2015-05-06 21:52:34 -04:00
Werner Saar be3c843700 added loops to trsm.c 2015-05-06 09:21:19 +02:00
Werner Saar e77db2af31 add benchmarks for zgeru and cgeru 2015-04-25 14:53:07 +02:00
Werner Saar 46c7b4d5c8 added asum benchmark 2015-04-19 11:24:07 +02:00
Werner Saar 8e05d291b5 added scal benchmark 2015-04-18 08:41:41 +02:00
Werner Saar ba926e807c added cdot- and zdot benchmark 2015-04-07 11:56:06 +02:00
Werner Saar ddf983d643 added optimizations for steamroller 2014-12-30 20:14:45 +08:00
Werner Saar 887aed634d modified sources for OS Darwin 2014-12-19 12:40:46 +01:00
Werner Saar 1e566223ed added code for the size of n 2014-12-17 15:02:11 +01:00
Werner Saar 113b48ca22 modified makefile for acml6.1 2014-12-17 14:12:21 +01:00
wernsaar dab4edd069 added benchmark for gemm3m functions 2014-09-21 12:00:41 +02:00
wernsaar 210bec9111 added plot-header to compare multithreading 2014-09-02 14:11:42 +02:00
wernsaar d7f91f8b4f extended gemv.c benchmark 2014-09-01 15:07:36 +02:00
wernsaar 53f1277b6b modified benchmark/gemv.c 2014-08-31 15:38:18 +02:00
wernsaar 29125864b3 updated gemm.c 2014-08-23 17:28:01 +02:00
wernsaar 55e81da379 added axpy benchmark-test 2014-08-23 13:12:44 +02:00
wernsaar 53ec5789e2 bugfix for Makefile 2014-08-22 17:02:55 +02:00
wernsaar 7e404de3de bugfix in Makefile 2014-08-22 11:51:30 +02:00
wernsaar e4472ad850 added sdot and ddot benchmarks 2014-08-22 11:42:07 +02:00
wernsaar fb0b4552a5 added hemv benchmark 2014-08-22 10:00:09 +02:00
wernsaar 6f73ffc114 added benchmarks for csymv and zsymv 2014-08-21 19:33:57 +02:00
wernsaar f5f50b3563 added benchmarks for lapack potrf, potrs and potri functions 2014-08-01 21:08:37 +02:00
wernsaar 651dd22d7d added benchmark program for lapack ?getri functions 2014-08-01 08:55:20 +02:00
wernsaar 7c611a2f95 bugfix for zgeev 2014-07-31 12:35:38 +02:00
wernsaar 296564e369 added lapack geev benchmark 2014-07-31 10:35:25 +02:00
wernsaar 799a0eabbd bugfix in cholesky.c 2014-07-30 14:00:19 +02:00
wernsaar ca63503e61 extented plot-filter.sh for linpack and cholesky benchmarks 2014-07-30 13:03:42 +02:00
wernsaar e213a42cde added a sample plot-filter scripts and a header file for gnuplot 2014-07-21 14:50:24 +02:00
wernsaar e4663be46a added symv benchmark 2014-07-21 07:50:54 +02:00
wernsaar 11637b6926 add benchmark for ger 2014-07-21 06:25:42 +02:00
wernsaar bcb115b55b added benchmark for gemv 2014-07-15 13:35:36 +02:00