Rajalakshmi Srinivasaraghavan
71d733e5f7
POWER: Avoid m4 conversions for C files
...
This patch removes intermediate m4 conversions used in sbgemm
compilation as it is not needed for .c files.
Tested on AIX with gcc and IBM Open XL C.
2023-10-11 17:18:42 -05:00
Rajalakshmi Srinivasaraghavan
82fc29a57a
POWER10: Fallback to POWER8 functions
...
As cgemm and zgemm kernels are not optimized for big endian falling
back to POWER8 versions. Tested on AIX using gcc and Open XL C.
2023-10-11 17:04:42 -05:00
Martin Kroeker
bf3183d31d
Merge pull request #4253 from martin-frbg/issue4239-2
...
Require "classic ld" with XCODE 15.x on Mac
2023-10-10 18:44:08 +02:00
Martin Kroeker
103d6f4e42
Require "classic ld" with XCODE 15.x on Mac
2023-10-10 16:15:52 +02:00
Martin Kroeker
4a0f86397b
Merge pull request #4235 from angsch/develop
...
Fix division by zero in [z]rotg
2023-10-09 08:43:42 +02:00
Martin Kroeker
617294b9e4
Merge pull request #4251 from martin-frbg/4142-2
...
Correct function prototypes in f2c-converted sources (lapack-netlib and ctest)
2023-10-08 18:11:12 +02:00
Martin Kroeker
c5e7339c9e
correct prototypes for INTERFACE64 builds
2023-10-08 16:13:37 +02:00
Martin Kroeker
d8126c76e7
fix prototype
2023-10-08 13:38:39 +02:00
Martin Kroeker
769a58e9d1
fix prototypes of stest and itest1 for INTERFACE64
2023-10-08 12:51:41 +02:00
Martin Kroeker
c30b530878
fix prototypes of ctest and itest for INTERFACE64
2023-10-08 11:59:19 +02:00
Martin Kroeker
2b865da730
fix prototypes of stest and ctest for INTERFACE64
2023-10-08 11:55:10 +02:00
Martin Kroeker
65bfe1a06d
fix prototype of itest1 for INTERFACE64
2023-10-08 11:36:06 +02:00
Martin Kroeker
1806cfecbc
fix function prototypes in f2c-converted files
2023-10-07 22:38:30 +02:00
Martin Kroeker
281f1e4432
fix function prototypes in f2c-converted files
2023-10-07 22:36:29 +02:00
Martin Kroeker
4041b7fb42
fix function prototypes in f2c-converted files
2023-10-07 22:33:08 +02:00
Martin Kroeker
b626544ca3
complete function prototypes and remove unused functions
2023-10-07 22:31:03 +02:00
Martin Kroeker
974cd11834
Merge pull request #4249 from RajalakshmiSR/clang_fix
...
powerpc: Fix build errors with Open XL C
2023-10-05 13:25:13 +02:00
Rajalakshmi Srinivasaraghavan
db0805906b
powerpc: Fix build errors with Open XL C
...
This patch fixes errors when using Open XL C compiler on AIX.
Tested with gcc/xlf and ibm-clang/xlf compiler combinations.
2023-10-04 14:04:03 -05:00
Martin Kroeker
6287a23fc5
Merge pull request #4246 from martin-frbg/issue4241
...
Improve function prototypes to fix strict-prototype warnings
2023-10-01 12:39:48 +02:00
Martin Kroeker
90f890ee67
fix improper function prototypes (empty parentheses) (USE_TLS branch)
2023-09-30 23:12:36 +02:00
Martin Kroeker
cf2174fb69
fix improper function prototypes (empty parentheses)
2023-09-30 17:04:39 +02:00
Martin Kroeker
2dba455d2e
revert accidental changes
2023-09-30 15:33:55 +02:00
Martin Kroeker
60ff5872af
fix improper function prototypes (empty parentheses)
2023-09-30 13:01:44 +02:00
Martin Kroeker
1d4aa8d7d5
fix improper function prototypes (empty parentheses)
2023-09-30 13:00:51 +02:00
Martin Kroeker
f4f31fb53b
fix improper function prototypes (empty parentheses)
2023-09-30 12:59:44 +02:00
Martin Kroeker
cd8ac192a9
fix improper function prototypes (empty parentheses)
2023-09-30 12:58:30 +02:00
Martin Kroeker
675cd551da
fix improper function prototypes (empty parentheses)
2023-09-30 12:56:38 +02:00
Martin Kroeker
13ba4edf43
fix function prototypes (empty parentheses)
2023-09-30 12:53:35 +02:00
Martin Kroeker
c6b1d8e7a3
fix improper function prototypes (empty parentheses)
2023-09-30 12:52:06 +02:00
Martin Kroeker
c4bd4a2e5d
fix improper function prototypes (empty parentheses)
2023-09-30 12:49:24 +02:00
Martin Kroeker
ccbb91e4a7
fix improper function prototypes (empty parentheses)
2023-09-30 12:46:34 +02:00
Martin Kroeker
189627709e
Merge pull request #4244 from XiWeiGu/loongarch64_fixed_dynamic_arch
...
Loongarch64 fixed dynamic arch
2023-09-30 10:16:43 +02:00
Martin Kroeker
5ac0ec70a1
Merge pull request #4245 from RajalakshmiSR/xlf_fix
...
powerpc: Fix build errors with xlf
2023-09-29 23:51:53 +02:00
Rajalakshmi Srinivasaraghavan
a11e1e10f4
powerpc: Fix build errors with xlf
...
This patch fixes errors when using xlf as fortran compiler on Linux.
Tested with gcc/xlf and clang/xlf compiler combinations.
2023-09-29 10:32:34 -05:00
gxw
211bb35dee
gh-actions: Adding DYNAMIC_ARCH test for LoongArch64
2023-09-27 11:08:03 +08:00
gxw
d15e0a055c
LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH
2023-09-27 10:05:27 +08:00
Martin Kroeker
e2ca22f8d8
Merge pull request #4242 from XiWeiGu/loongarch64_dtrsm
...
LoongArch64: Add dtrsm kernel
2023-09-26 19:21:48 +02:00
gxw
4670eb1462
LoongArch64: Add dtrsm kernel
2023-09-26 15:45:14 +08:00
Martin Kroeker
138ed79fe7
Merge pull request #4238 from martin-frbg/issue4237
...
Fix building on POWER with IBM XLF 16.1.1
2023-09-24 14:31:33 +02:00
Martin Kroeker
2a9981a244
Add -lgomp when IBM xlf is combined with gcc in OPENMP builds
2023-09-24 10:19:11 +02:00
Martin Kroeker
7a96908d0c
Add -lgomp when IBM xlf is combined with gcc in OPENMP builds
2023-09-24 10:18:24 +02:00
Martin Kroeker
4de963dc17
Enforce trailing underscores on symbols when IBM xlf is combined with gcc
2023-09-24 10:16:37 +02:00
Martin Kroeker
8012afcabb
Avoid using some gcc-specific flags with IBM xlf
2023-09-24 10:15:12 +02:00
Martin Kroeker
bb47183222
Force -qextname for trailing underscore generation when IBM xlf is used with gcc
2023-09-24 10:13:47 +02:00
Martin Kroeker
b926e70ebd
Fix typo in build rule of "profiled" sbgemm
2023-09-21 23:07:32 +02:00
Martin Kroeker
2390e0bfbc
Quote the BU (underscore) option as it may not be set
2023-09-21 23:04:25 +02:00
Martin Kroeker
44e6e5479b
Use the C compiler for the C SBGEMM test source
2023-09-21 23:01:21 +02:00
Martin Kroeker
48b1b7cbc9
Merge pull request #4233 from martin-frbg/issue4216
...
Fix handling of additional buffer structures in case of overflow
2023-09-21 11:12:52 +02:00
Martin Kroeker
bb90b6dfcd
Merge pull request #4157 from steppi/cirun
...
Set up arm64 graviton CI using cirun
2023-09-21 07:28:40 +02:00
Angelika Schwarz
db3a43c8ed
Simplify rotg
...
* The check da != ZERO is no longer necessary since there
is a special case ada == ZERO, where ada = |da|.
* Add the missing check c != ZERO before the division.
Note that with these two changes the long double code
follows the float/double version of the code.
2023-09-20 19:43:00 +02:00