Commit Graph

7695 Commits

Author SHA1 Message Date
Martin Kroeker
425bcc1f8b Merge pull request #4256 from ChipKerchner/fixBfloat16BitsStruct
Fix bfloat16_bits union so that it always the sizeof unsigned short for AIX.
2023-10-12 22:01:50 +02:00
Martin Kroeker
789cdcc94f Merge pull request #4259 from martin-frbg/azureosxclang
AzureCI: move OSX-Clang jobs to macOS-12 to resolve setup/build timeouts
2023-10-12 20:04:28 +02:00
Rajalakshmi Srinivasaraghavan
9f42570e33 POWER: Increase macro size limit for AIX
This patch increases the macro size limit from 4096 to 16384 to
allow compiling larger assembly files in AIX.
Tested with GCC and IBM Open XL C.
2023-10-12 12:37:40 -05:00
Martin Kroeker
9f49aef91b Merge pull request #4255 from RajalakshmiSR/AIX-P10
POWER10: Fix compilation issues with Open XL C
2023-10-12 18:59:17 +02:00
Martin Kroeker
fe75c88a2c AzureCI: move OSX-Clang jobs to macOS-12 to resolve setup/build timeouts 2023-10-12 18:20:09 +02:00
Chip-Kerchner
d46eba06a7 Pack structure only on AIX. 2023-10-12 09:41:33 -05:00
Martin Kroeker
90231bfc4e Merge pull request #4258 from martin-frbg/issue4257
Fix build on Fujitsu A64FX
2023-10-12 16:38:28 +02:00
Martin Kroeker
e7d05402e0 Fix up S/D GEMM copy function definitions after #4009 2023-10-12 14:24:53 +02:00
Chip-Kerchner
e98e3c4783 Fix float32_bits union so that it always the sizeof float. 2023-10-11 18:05:55 -05:00
Chip-Kerchner
97a61d0577 Fix bfloat16_bits union so that it always the sizeof unsigned short. 2023-10-11 17:36:43 -05:00
Rajalakshmi Srinivasaraghavan
71d733e5f7 POWER: Avoid m4 conversions for C files
This patch removes intermediate m4 conversions used in sbgemm
compilation as it is not needed for .c files.
Tested on AIX with gcc and IBM Open XL C.
2023-10-11 17:18:42 -05:00
Rajalakshmi Srinivasaraghavan
82fc29a57a POWER10: Fallback to POWER8 functions
As cgemm and zgemm kernels are not optimized for big endian falling
back to POWER8 versions.  Tested on AIX using gcc and Open XL C.
2023-10-11 17:04:42 -05:00
Martin Kroeker
bf3183d31d Merge pull request #4253 from martin-frbg/issue4239-2
Require "classic ld" with XCODE 15.x on Mac
2023-10-10 18:44:08 +02:00
Martin Kroeker
103d6f4e42 Require "classic ld" with XCODE 15.x on Mac 2023-10-10 16:15:52 +02:00
Martin Kroeker
4a0f86397b Merge pull request #4235 from angsch/develop
Fix division by zero in [z]rotg
2023-10-09 08:43:42 +02:00
Martin Kroeker
617294b9e4 Merge pull request #4251 from martin-frbg/4142-2
Correct function prototypes in f2c-converted sources (lapack-netlib and ctest)
2023-10-08 18:11:12 +02:00
Martin Kroeker
c5e7339c9e correct prototypes for INTERFACE64 builds 2023-10-08 16:13:37 +02:00
Martin Kroeker
d8126c76e7 fix prototype 2023-10-08 13:38:39 +02:00
Martin Kroeker
769a58e9d1 fix prototypes of stest and itest1 for INTERFACE64 2023-10-08 12:51:41 +02:00
Martin Kroeker
c30b530878 fix prototypes of ctest and itest for INTERFACE64 2023-10-08 11:59:19 +02:00
Martin Kroeker
2b865da730 fix prototypes of stest and ctest for INTERFACE64 2023-10-08 11:55:10 +02:00
Martin Kroeker
65bfe1a06d fix prototype of itest1 for INTERFACE64 2023-10-08 11:36:06 +02:00
Martin Kroeker
1806cfecbc fix function prototypes in f2c-converted files 2023-10-07 22:38:30 +02:00
Martin Kroeker
281f1e4432 fix function prototypes in f2c-converted files 2023-10-07 22:36:29 +02:00
Martin Kroeker
4041b7fb42 fix function prototypes in f2c-converted files 2023-10-07 22:33:08 +02:00
Martin Kroeker
b626544ca3 complete function prototypes and remove unused functions 2023-10-07 22:31:03 +02:00
Chip-Kerchner
880af052dd Fix dynamic dispatch P9 for clang. 2023-10-06 13:41:49 -05:00
Chip-Kerchner
3655632611 Another small change. 2023-10-06 13:11:40 -05:00
Chip-Kerchner
36e08f6994 One more small change. 2023-10-06 13:08:41 -05:00
Chip-Kerchner
298bf1f240 Reduce differences. 2023-10-06 12:50:28 -05:00
Chip-Kerchner
71c6689af4 Fix dynamic dispatch to work for clang. 2023-10-06 12:20:40 -05:00
Chip-Kerchner
c60f9d9c08 Add missing CPU_POWER5. 2023-10-06 09:49:17 -05:00
Martin Kroeker
974cd11834 Merge pull request #4249 from RajalakshmiSR/clang_fix
powerpc: Fix build errors with Open XL C
2023-10-05 13:25:13 +02:00
Rajalakshmi Srinivasaraghavan
db0805906b powerpc: Fix build errors with Open XL C
This patch fixes errors when using Open XL C compiler on AIX.
Tested with gcc/xlf and ibm-clang/xlf compiler combinations.
2023-10-04 14:04:03 -05:00
Chip Kerchner
3cc72a3797 Only include cpu_id and cpu_supports in AIX and fix parameter types. 2023-10-04 09:54:37 -05:00
Chip Kerchner
7f7483ff16 Merge branch 'XLC-AIX' of github.ibm.com:PowerAppLibs/openblas into XLC-AIX 2023-10-03 12:28:26 -05:00
Chip-Kerchner
09212f84bf Fix default case for cpu_is. 2023-10-03 12:23:21 -05:00
Chip Kerchner
1c4ec47ed0 Merge branch 'XLC-AIX' of github.ibm.com:PowerAppLibs/openblas into XLC-AIX 2023-10-03 10:30:46 -05:00
Chip-Kerchner
2d0b233425 Fix missing parens. 2023-10-03 10:26:14 -05:00
Chip-Kerchner
a8c90eb3ed Added cpu_is 2023-10-03 10:24:04 -05:00
Chip Kerchner
4dc435bd10 Merge branch 'XLC-AIX' of github.ibm.com:PowerAppLibs/openblas into XLC-AIX 2023-10-02 13:30:22 -05:00
Chip-Kerchner
b677d0d5fd Adding missing endif 2023-10-02 13:09:12 -05:00
Chip-Kerchner
e5dc376912 Remove duplicate defines. 2023-10-02 12:48:47 -05:00
Chip-Kerchner
10210748de Revert PGI changes. 2023-10-02 12:44:07 -05:00
Chip-Kerchner
a922a07e61 Cleanup white spaces. 2023-10-02 12:24:30 -05:00
Chip-Kerchner
12130ee961 Remove tab. 2023-10-02 12:19:22 -05:00
Chip-Kerchner
eb738d9929 Minor changes. 2023-10-02 12:14:46 -05:00
Chip-Kerchner
48da98b2a7 Merge remote-tracking branch 'origin/develop' into XLC-AIX 2023-10-02 12:01:33 -05:00
Chip-Kerchner
3b1150fcee Fix CPU identification to work on AIX. 2023-10-02 12:00:48 -05:00
Chip Kerchner
af3e172353 Merge branch 'develop' into XLC-AIX 2023-10-02 07:09:04 -05:00