s00548429
bec7923a0d
Fix the functional bugs for zamax.
2020-03-09 15:36:50 +08:00
s00548429
c5bdd21352
Add benchmark for ?amax, ?max, ?amin, ?min, i?max, i?amin and i?min.
2020-03-09 14:59:03 +08:00
njutcz
d2d16d091e
Merge pull request #1 from xianyi/develop
...
update
2020-03-09 10:39:40 +08:00
Martin Kroeker
b6a6ccbbea
Merge pull request #2495 from ZuoQ3/develop
...
add benchmark for axpby test
2020-03-08 08:09:58 +01:00
Martin Kroeker
8b720f7365
Merge pull request #2494 from shengyang-3390/develop
...
add benchmark for csrot and zdrot
2020-03-07 23:04:21 +01:00
Martin Kroeker
14df234edb
Merge pull request #2489 from jijiwawa/brightness
...
Remove redundant code
2020-03-07 22:26:00 +01:00
s00527847
bbeda55b7b
add trmm.c
2020-03-07 13:09:19 -05:00
s00527847
efcf89aec7
Remove redundant code
2020-03-07 12:03:05 -05:00
Martin Kroeker
37d456f7e0
Merge pull request #2493 from martin-frbg/plainmake
...
Fix use of make vs $(MAKE) in building lapack-testing
2020-03-07 16:55:53 +01:00
Martin Kroeker
0b9e96922b
Merge pull request #2488 from liujingjue/develop
...
Modify the main Makefile in OpenBLAS
2020-03-07 16:52:29 +01:00
zq
0c8162eba6
Add benchmark file axpby.c and modify benchmark/Makefile to test s/d/c/zaxpby
2020-03-07 17:48:55 +08:00
zq
9a94a30132
Merge pull request #1 from xianyi/develop
...
update
2020-03-07 17:04:59 +08:00
shengyang
09c7a191bd
add benchmark for csrot and zdrot
...
modified: benchmark/Makefile
modified: benchmark/rot.c
2020-03-07 15:17:49 +08:00
l00546269
8a8df530e2
[OpenBLAS]:modifed the Makefile
...
[Description]: check the compiler version and show the detail info
2020-03-07 10:14:33 +08:00
Martin Kroeker
37f46f2fa0
Fix another spot where make was used instead of $(MAKE)
...
Broke lapack-testing on BSD as their default "make" does not support GNU Makefile syntax
2020-03-06 15:37:26 +01:00
Martin Kroeker
9afc561be4
Merge pull request #36 from xianyi/develop
...
rebase
2020-03-06 15:32:27 +01:00
Martin Kroeker
dca3e0cf20
Merge pull request #2491 from chenxuqiang/hbmv_benchmark
...
benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
2020-03-06 15:06:42 +01:00
Martin Kroeker
c9f8db979b
Merge pull request #2490 from shengyang-3390/develop
...
Add benchmark file rotm.c and modify benchmark/Makefile to test s/drotm
2020-03-06 15:05:55 +01:00
Martin Kroeker
18099de976
Merge pull request #2487 from jijiwawa/develop
...
add benchmark for spr/spr2
2020-03-06 14:42:25 +01:00
Martin Kroeker
97c36ca58c
Merge branch 'develop' into develop
2020-03-06 14:41:40 +01:00
Martin Kroeker
9f5a74f3c7
Merge pull request #2486 from qqqil/develop
...
add benchmark for trsv
2020-03-06 14:30:09 +01:00
Martin Kroeker
2afb10975d
Merge pull request #2485 from Darkness303/develop
...
Add syr2 benchmark
2020-03-06 14:29:27 +01:00
Martin Kroeker
dbef479227
Merge pull request #2469 from AGSaidi/acq-rel-2
...
Use acq/rel semantics to pass flags/pointers in getrf_parallel.
2020-03-06 14:28:58 +01:00
Ali Saidi
208c7e7ca5
Use acq/rel semantics to pass flags/pointers in getrf_parallel.
...
The current implementation has locks, but the locks each only
have a critical section of one variable so atomic reads/writes
with barriers can be used to achieve the same behavior.
Like the previous patch, pthread_mutex_lock isn't fair, so in a
tight loop the previous thread that has the lock can keep it
starving another thread, even if that thread is about to write
the data that will stop the current thread from spinning.
On a 64c Arm system this improves performance by 20x on sgesv.goto.
2020-03-06 06:22:31 +00:00
chenxuqiang
32c847df45
benchmark/hpmv&hbmv: add benchmark/hpmv.c and benchmark/hbmv.c
...
Signed-off-by: Xuqiang Chen chenxuqiang3@hisilicon.com
2020-03-06 01:02:02 -05:00
shengyang
e0df9485d4
Add benchmark file rotm.c and modify benchmark/Makefile to test s/drotm
...
modified: benchmark/Makefile
new file: benchmark/rotm.c
2020-03-05 10:05:59 +08:00
s00527847
0f1a2b12f9
add benchmark for spr/spr2
2020-03-04 15:50:19 -05:00
q00437336
233838b4bc
change clock to CLOCK_PROCESS_CPUTIME_ID
2020-03-04 03:54:40 -05:00
l00546269
13f9afbd99
[OpenBLAS]:modifed the Makefile
...
[Description]:add c/fortran compiler version information in final note
2020-03-04 16:47:23 +08:00
q00437336
de74e11641
add benchmark for trsv
2020-03-04 03:23:22 -05:00
Martin Kroeker
ad9e53154d
Merge pull request #2484 from RajalakshmiSR/power-dynamic
...
Fix DYNAMIC_ARCH build for POWER9
2020-03-04 08:06:06 +01:00
Martin Kroeker
6e70621b0d
Merge pull request #2483 from aaawuanjun/develop
...
Add benchmark file trmv.c and modify benchmark/Makefile to test s/d/c/ztrmv
2020-03-04 07:59:56 +01:00
Martin Kroeker
e6edb7431f
Merge pull request #2466 from AGSaidi/acq-rel-1
...
Switch blas_server to use acq/rel semantics
2020-03-04 07:59:31 +01:00
Darkness303
114dbec947
1.Add syr2 benchmark
...
2.Fixed some errors
2020-03-04 14:09:10 +08:00
Martin Kroeker
d68e4ba59b
Fix cut/paste glitch
2020-03-03 21:37:48 +01:00
Martin Kroeker
635c9e4e09
Restore initializers for mutex and conditional
2020-03-03 21:04:12 +01:00
Rajalakshmi Srinivasaraghavan
2afc074803
Fix DYNAMIC_ARCH build for POWER9
...
Setting DYNAMIC_ARCH=1 on POWER9 does not build POWER9 files due to some
compiler version checks. This patch fixes some of the macros that are used
to check compiler version. On fixing those checks, there are some new make
failures related to icamin, icamax, isamin, isamax and caxpy files on POWER9.
This patch fixes those failures as well.
2020-03-03 12:35:10 -06:00
wuanjun 00447568
5d6c688a7e
Merge branch 'develop' of https://github.com/aaawuanjun/OpenBLAS into develop
2020-03-03 19:03:57 +08:00
wuanjun 00447568
87baf9cfe6
Merge branch 'develop' of https://github.com/aaawuanjun/OpenBLAS into develop
2020-03-03 19:03:28 +08:00
wuanjun 00447568
c0ca7d6258
Merge branch 'develop' of https://github.com/aaawuanjun/OpenBLAS into develop
2020-03-03 17:39:26 +08:00
wuanjun 00447568
f682d19ed4
[OpenBlas]: add benchmark file trmv.c and modify benchmark/Makefile to test s/d/c/ztrmv
2020-03-03 17:37:33 +08:00
wuanjun 00447568
790d50fbba
[OpenBlas]: add benchmark file trmv.c and modify benchmark/Makefile to test s/d/c/ztrmv
2020-03-03 17:13:49 +08:00
Martin Kroeker
59243d49ab
Merge pull request #2479 from Darkness303/develop
...
Fix potential index overflows at large matrix sizes in the benchmark codes
2020-03-03 08:46:49 +01:00
Martin Kroeker
d41f83e128
Merge pull request #2436 from marxin/improve-utest-coverage
...
Improve test coverage for utests.
2020-03-03 08:43:00 +01:00
Martin Kroeker
ee4ca7ca6b
Merge pull request #2481 from ChinouneMehdi/fix2480
...
Fix #2480
2020-03-02 21:21:29 +01:00
Martin Kroeker
e326c89ae8
Merge pull request #2478 from MacChen02/develop
...
Update benchmark statistical time function
2020-03-02 21:20:51 +01:00
مهدي شينون (Mehdi Chinoune)
21f6c4b5a9
fixes #2480
2020-03-02 17:22:28 +01:00
Martin Liska
7ca4ffdbdd
Improve test coverage for utests.
2020-03-02 13:38:17 +01:00
jianghesong
0f65c05cd1
fix core dumped error
2020-03-02 19:13:45 +08:00
MacChen02
917d243580
Update benchmark statistical time function
...
The function gettimeofday does not count the time,when testing the axpy small data volume use case.
Use the function clock_gettime to replace the gettimeofday function to count the time.
2020-03-02 14:36:27 +08:00