Martin Kroeker
|
e9f480111e
|
fix sbgemm bfloat16 conversion errors introduced in PR 4488
|
2024-02-07 19:57:18 +01:00 |
Martin Kroeker
|
fb99fc2e6e
|
fix type conversion warnings
|
2024-02-07 13:42:08 +01:00 |
Chip-Kerchner
|
d46eba06a7
|
Pack structure only on AIX.
|
2023-10-12 09:41:33 -05:00 |
Chip-Kerchner
|
e98e3c4783
|
Fix float32_bits union so that it always the sizeof float.
|
2023-10-11 18:05:55 -05:00 |
Chip-Kerchner
|
97a61d0577
|
Fix bfloat16_bits union so that it always the sizeof unsigned short.
|
2023-10-11 17:36:43 -05:00 |
Martin Kroeker
|
3d338b57de
|
remove spurious loops
|
2022-08-08 17:09:45 +02:00 |
Martin Kroeker
|
d9dc015cfc
|
Use blasint for INTERFACE64 compatibility
|
2022-08-08 14:52:10 +02:00 |
Rajalakshmi Srinivasaraghavan
|
1d97405c02
|
POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
|
2022-07-28 07:43:53 -05:00 |
Martin Kroeker
|
924fd806d0
|
Rename "HALF" and "sh" to "BFLOAT16" and "sb"
|
2020-10-11 23:43:36 +02:00 |
Martin Kroeker
|
4db09c6cec
|
Rename compare_sgemm_shgemm.c to compare_sgemm_sbgemm.c
|
2020-10-11 23:42:45 +02:00 |