Commit Graph

60 Commits

Author SHA1 Message Date
Martin Kroeker 2a9981a244
Add -lgomp when IBM xlf is combined with gcc in OPENMP builds 2023-09-24 10:19:11 +02:00
Martin Kroeker 44e6e5479b
Use the C compiler for the C SBGEMM test source 2023-09-21 23:01:21 +02:00
Aiden Grossman b209915121 Fix build with clang
There are two instances when building the tests where OpenBLAS fails to
build with OpenMP and clang due to library paths getting reset as flags
are set rather than appended. This seems to only affect certain
clang/libomp installations, but if it's already grabbing the correct
library paths we might as well use them.
2023-07-28 12:59:44 -07:00
Martin Kroeker 3d338b57de
remove spurious loops 2022-08-08 17:09:45 +02:00
Martin Kroeker d9dc015cfc
Use blasint for INTERFACE64 compatibility 2022-08-08 14:52:10 +02:00
Rajalakshmi Srinivasaraghavan 1d97405c02 POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
2022-07-28 07:43:53 -05:00
Martin Kroeker 74dba0c90a
Fix missing braces from previous commit (PR3600) 2022-04-09 20:03:36 +02:00
Martin Kroeker bef3871833
Disable flang (over-)optimizations in BLAS tests (#3600)
* limit flang optimizations to -O2
2022-04-09 17:14:24 +02:00
Martin Kroeker 145ccf8943
Prevent powershell from adding a BOM to test input (#3595)
* Prevent addition of a BOM to test input (which would distort the names of output files)
2022-04-07 14:24:19 +02:00
Martin Kroeker 2408315d10
Skip tests if Windows powershell added a BOM 2022-03-22 21:37:55 +01:00
Martin Kroeker 5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 2021-07-30 14:46:19 +02:00
Martin Kroeker b4f4ed378b
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 2021-07-30 14:21:08 +02:00
Martin Kroeker 3cfdb1770c
Remove code that disabled EXTRALIB on RISCV C910V 2021-07-06 20:21:07 +02:00
Martin Kroeker 30d835168a
Merge pull request #3088 from xoviat/msvc
add misc fixes.
2021-03-14 17:14:28 +01:00
Martin Kroeker 6ae7af78a3
Support compilation with nagfor 2021-03-11 11:53:51 +01:00
xoviat 3165c915b6 fix test helpers 2021-01-27 15:24:49 -06:00
xoviat 457ccc42c9
Merge branch 'develop' into msvc 2021-01-27 14:15:59 -06:00
Martin Kroeker 2b114c3f30
Restore proper Makefile 2020-11-22 17:16:22 +01:00
Martin Kroeker 65eb7afaf4
Change ifndef CROSS to ifneq 2020-11-22 16:25:36 +01:00
damonyu ef8e7d0279 Add the support for RISC-V Vector.
Change-Id: Iae7800a32f5af3903c330882cdf6f292d885f266
2020-10-15 16:09:02 +08:00
Martin Kroeker c1f4f5d4e7
Replace Makefile with simplified version again 2020-10-14 01:08:50 +02:00
Martin Kroeker bc5c7f9578
Cleanup 2020-10-13 19:56:09 +02:00
Rajalakshmi Srinivasaraghavan b5d30b390d Fix build issues with bfloat16
This patch fixes compilation errors due to recent renaming from SH to SB
with BUILD_BFLOAT16.
2020-10-13 11:00:22 -05:00
Martin Kroeker 5800758b43
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 2020-10-11 23:44:38 +02:00
Martin Kroeker 924fd806d0
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 2020-10-11 23:43:36 +02:00
Martin Kroeker 4db09c6cec
Rename compare_sgemm_shgemm.c to compare_sgemm_sbgemm.c 2020-10-11 23:42:45 +02:00
Martin Kroeker 5f23bdf437
remove debug output 2020-10-11 17:23:08 +02:00
Martin Kroeker 08f4749eb4
Adapt tests to having only a subset of types in the build 2020-10-11 14:25:24 +02:00
Martin Kroeker 0eaae30e8c
Adapt tests to having only a subset of types in the build 2020-09-22 23:28:03 +02:00
Martin Kroeker 593ce9e237
Make building individual tests depend on BUILD_SINGLE etc defines 2020-09-13 21:50:12 +02:00
Martin Kroeker 18a11137f1
Update BLAS tests to correspond to Reference-LAPACK 3.9.0
replaces calculation of machine precision with call to epsilon intrinsic and removes the requirement for previous output files to be removed before rerunning tests
2020-06-14 10:26:25 +02:00
Martin Kroeker 13c28889a2
Update "cosmetic fixes for non-C99 compilers" 2020-06-06 15:22:27 +02:00
Martin Kroeker 28915eed72
Cosmetic fixes for non-C99 compilers 2020-06-05 10:05:34 +02:00
Rajalakshmi Srinivasaraghavan 8efba9b7c0 Improve shgemm test
This patch adds another check to test shgemm results.
2020-05-11 17:15:10 -05:00
Rajalakshmi Srinivasaraghavan 564b0d39ef Add test for shgemm
This patch has Makefile changes to add test for shgemm which
compares sgemm and shgemm result.
2020-04-29 13:40:34 -05:00
Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS
This patch adds support for bfloat16 data type matrix multiplication kernel.
For architectures that don't support bfloat16, it is defined as unsigned short
(2 bytes).  Default unroll sizes can be changed as per architecture as done for
SGEMM and for now 8 and 4 are used for M and N.  Size of ncopy/tcopy can be
changed as per architecture requirement and for now, size 2 is used.

Added shgemm in kernel/power/KERNEL.POWER9 and tested in powerpc64le and
powerpc64.  For reference, added a small test compare_sgemm_shgemm.c to compare
sgemm and shgemm output.

This patch does not cover OpenBLAS test, benchmark and lapack tests for shgemm.
Complex type implementation can be discussed and added once this is approved.
2020-04-14 14:55:08 -05:00
Martin Kroeker 2d8781b0dc
Do not attempt to run test without fortran 2020-03-13 20:11:19 +01:00
xoviat dbf9ad1f3d tests: add windows compatibility 2019-05-05 13:09:39 -05:00
luz.paz daf2fec12d Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
2019-04-29 17:03:56 -04:00
Martin Kroeker 6a5ab083b7
Handle special case of gfortran+clang+OpenMP 2018-06-19 20:47:33 +02:00
Martin Kroeker 53026dc63a
Update single and double precision BLAS1 tests from LAPACK 3.8.0
adding tests for SROTMG, SROTM, SDSDOT, DROTMG, DROTM, DSDOT
2018-02-18 12:44:14 +01:00
Sacha Refshauge 4474465438 Remove _static usages for tests 2017-08-20 00:13:46 +10:00
Isuru Fernando d245caa49a Support out-of-source build 2017-08-01 15:16:14 +05:30
John Biddiscombe 053044ae4d Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
2016-05-25 09:13:28 +02:00
Aleksey Kuleshov 3d50ccdc0d allow building tests when CROSS compiling but don't run them 2016-04-26 12:36:47 +03:00
Zhang Xianyi aca7d7e953 Detect cmake test result. 2015-10-20 03:35:25 +08:00
Zhang Xianyi f8eba3d548 Fixed cmake build bugs on Linux. 2015-08-11 16:25:16 -05:00
wernsaar 9d7057366d bugfix for GEMM3M functions 2014-09-21 11:41:43 +02:00
wernsaar 7f234f8ed1 added GEMM3M tests 2014-09-21 10:55:08 +02:00
wernsaar d49fd33885 disabled SYMM3M and HEMM3M functions because segment violations 2014-09-20 15:27:40 +02:00