Zhang Xianyi
458e3af5b1
Merge pull request #1555 from WestAlgo/develop
...
Change _STDC_VERSION__ to __STDC_VERSION__
2018-05-11 12:25:24 +08:00
Zhiyong Dang
3716267124
Change _STDC_VERSION__ to __STDC_VERSION__
...
Change-Id: Id3fa4e8d9eedd4ef7230df69b611e7f397301a42
2018-05-11 12:15:08 +08:00
Zhang Xianyi
50acc40613
Merge pull request #1536 from WestAlgo/develop
...
Fix race condition in blas_server_omp.c
2018-05-11 10:09:14 +08:00
Martin Kroeker
c720f1f019
Merge pull request #1554 from martin-frbg/lapack-249
...
LAPACKE fixes from lapack PR249
2018-05-10 15:32:08 +02:00
Martin Kroeker
d7d950fcf2
LAPACKE fixes from lapack PR249
...
Copied from Reference-LAPACK/lapack#249 , this fixes out-of-bounds memory accesses
in the nancheck calls of the LAPACKE lacgv, lassq,larfg,larfb,larfx and mtr functions
2018-05-10 13:15:42 +02:00
Martin Kroeker
12398e53ce
Merge pull request #1553 from martin-frbg/ifort-openmpflag
...
Change -openmp to -fopenmp for ifort entry as well
2018-05-09 14:39:52 +02:00
Martin Kroeker
193f835662
Change -openmp to -fopenmp for ifort entry as well
2018-05-09 12:34:09 +02:00
Martin Kroeker
7e3151ead7
Merge pull request #1551 from martin-frbg/f_check_fix
...
Fixes for ifort 2018
2018-05-09 09:02:52 +02:00
Martin Kroeker
e3a069f108
Merge pull request #1550 from martin-frbg/ifort-openmpflag
...
Update compiler flag for openmp use with ICC
2018-05-09 09:02:38 +02:00
Martin Kroeker
6fff8c626a
Merge pull request #1549 from martin-frbg/fix_ompcheck
...
Drop C-style "L" suffx from OPENMP version number tests in the LAPACK source
2018-05-08 23:52:55 +02:00
Martin Kroeker
d2b9389f1b
Fixes for ifort 2018
...
1. the already deprecated -openmp option was removed in 2018, switch to -fopenmp
2. add leading blank in search for "zho_ge__" symbol to work around misleading tags in the 2018 assembly
Expected to fix #1548
2018-05-08 21:55:37 +02:00
Martin Kroeker
65b8a5c5d8
Update compiler flag for openmp use with ICC
...
The deprecated -openmp option was finally removed in favor of -qopenmp or -fopenmp, picking the latter to stay compatible with Intel compiler versions before 2015 (when -q options were introduced). Fixes #1546
2018-05-08 21:47:10 +02:00
Martin Kroeker
9795adc7ef
Drop C-style "L" suffix from OPENMP version number in check
2018-05-08 21:39:42 +02:00
Martin Kroeker
1a8e487c4a
Drop C-style "L" suffix from OPENMP version number in check
2018-05-08 21:38:25 +02:00
Martin Kroeker
5966fd52a2
Drop C-style "L" suffix from OPENMP version number in check
2018-05-08 21:36:56 +02:00
Martin Kroeker
dbafe6357b
Merge pull request #1543 from martin-frbg/mips32
...
Fix MIPS32 build and add MIPS 1004K cpu (MT7621 SOC)
2018-05-02 22:47:45 +02:00
Martin Kroeker
71051259e0
Restore compiler options for mips P5600 target
2018-05-02 20:37:06 +02:00
Martin Kroeker
73cc321190
Add MIPS 1004K target
2018-05-02 20:27:56 +02:00
Martin Kroeker
018f2dad27
Switch mips32 target to USE_TRMM to fix complex TRMM
2018-05-02 20:25:32 +02:00
Martin Kroeker
9d5098dbc9
Add MIPS 1004K target (Mediatek MT7621 SOC)
2018-05-02 20:20:44 +02:00
Martin Kroeker
d94d7baf7e
Add mips32r2 api target
2018-05-02 20:17:26 +02:00
Martin Kroeker
3af1b5c805
Make cpuid_mips compile again and add 1004K cpu
2018-05-02 20:12:25 +02:00
Martin Kroeker
88e224f4c0
Merge pull request #1542 from martin-frbg/quickdiv64
...
Avoid out-of-bounds accesses in blas_quickdivide on big X86 systems
2018-05-02 18:11:50 +02:00
Martin Kroeker
d0c0506588
Omit the divide table overflow check on small systems
2018-05-02 14:44:50 +02:00
Martin Kroeker
e93355e5e1
Omit the table overflow check when building for small systems
2018-05-02 14:43:08 +02:00
Martin Kroeker
c1eb06e102
Update common_x86_64.h
2018-04-29 14:40:12 +02:00
Martin Kroeker
8145ecd70b
Avoid out-of-bounds reads from blas_quick_divide_table on big systems
2018-04-29 14:38:55 +02:00
Martin Kroeker
26ce518d46
Avoid out of bounds reads from blas_quick_divide_table on big systems
...
Should fix #1541
2018-04-29 14:34:33 +02:00
Martin Kroeker
1d27fa8507
Merge pull request #1539 from martin-frbg/ztrmv-1332
...
Disable multithreading in ztrmv
2018-04-27 23:10:21 +02:00
Martin Kroeker
802cf6b22d
Merge pull request #1486 from martin-frbg/atomic
...
Use _Atomic instead of volatile for thread safety where C11 is supported
2018-04-27 23:09:57 +02:00
Martin Kroeker
894433a7c7
Update Makefile.rule
2018-04-27 12:08:06 +02:00
Zhiyong Dang
1b83341d19
Fix race condition in blas_server_omp.c
...
Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d
2018-04-27 17:00:42 +08:00
Martin Kroeker
954f1832de
Merge pull request #1540 from martin-frbg/mips32-zasum
...
Fix typo in MIPS P5600 complex ASUM code selection
2018-04-25 23:23:00 +02:00
Martin Kroeker
941ad280a8
Fix typo in MIPS P5600 complex ASUM code selection
2018-04-25 22:50:10 +02:00
Martin Kroeker
a8ed428bab
Disable multithreading in ztrmv
...
BLAS-Tester shows that the same problem exists as with DTRMV (issue #1332 )
2018-04-25 22:35:46 +02:00
Martin Kroeker
1da365312a
Merge pull request #1538 from martin-frbg/arm7utest
...
Fix handling of zero INCX, INCY in ArmV7 AXPY and ROT
2018-04-25 08:38:58 +02:00
Martin Kroeker
2d0929fa7c
Move the test for zero incx,incy in ARMV7 ROT
...
to pass the related utest (see #1469 )
2018-04-24 22:43:00 +02:00
Martin Kroeker
125343cc88
Drop test for zero incx,incy in armv7 AXPY
...
...to pass the related utest (see #1469 )
2018-04-24 22:39:50 +02:00
Martin Kroeker
8a3b6fa108
Use generic zrot.c on ppc64/POWER6 to work around utest failure from … ( #1535 )
...
* Use generic C implementation of zrot on ppc64/POWER6 to work around utest failure from #1469
2018-04-23 19:05:49 +02:00
Martin Kroeker
78694f1b7e
Merge pull request #1534 from xianyi/revert-1333-haswell32
...
Revert "Fix 32bit HASWELL builds"
2018-04-22 23:34:17 +02:00
Martin Kroeker
9c5518319a
Revert "Fix 32bit HASWELL builds"
2018-04-22 20:20:04 +02:00
Martin Kroeker
86f49c529d
Merge pull request #1532 from martin-frbg/utest-cblas
...
Do not try to build the fork utest when NO_CBLAS=1
2018-04-20 23:44:15 +02:00
Martin Kroeker
625c74a38f
fork utest depends on CBLAS
2018-04-20 15:43:59 +02:00
Martin Kroeker
5fcaca6438
fork utest depends on CBLAS
2018-04-20 15:42:13 +02:00
Martin Kroeker
4fcdd24459
Merge pull request #1530 from ashwinyes/develop_20180419_Tx2AutoDetect
...
ARM64: Enable Auto Detection of ThunderX2T99
2018-04-19 14:10:57 +02:00
Ashwin Sekhar T K
68a3c4fca6
ARM64: Enable Auto Detection of ThunderX2T99
2018-04-19 09:05:25 +00:00
Martin Kroeker
0c4718c57a
Merge pull request #1523 from martin-frbg/utest_waith
...
Include sys/types.h for proper typedefs related to wait()
2018-04-15 13:09:30 +02:00
Martin Kroeker
f29389c7ac
Merge pull request #1520 from martin-frbg/cpucounts
...
Catch invalid cpu count returned by CPU_COUNT_S
2018-04-14 22:24:34 +02:00
Martin Kroeker
734d7c6a93
Include sys/types.h for proper typedefs related to wait()
...
Should fix #1519
2018-04-14 18:59:46 +02:00
Martin Kroeker
7c861605b2
Catch invalid cpu count returned by CPU_COUNT_S
...
mips32 was seen to return zero here, driving nthreads to zero with subsequent fpe in blas_quickdivide
2018-04-14 18:29:10 +02:00