Chip Kerchner
|
39fd29f1de
|
Minor improvement and turn off BF16 GEMV forwarding by default.
|
2024-09-08 18:28:31 -05:00 |
Chip Kerchner
|
76227e2948
|
Initial commit for vectorized BF16 GEMV. Added GEMM_GEMV_FORWARD_BF16 to enable using BF16 GEMV for one dimension matrices. Updated unit test to support inc_x != 1 or inc_y for GEMV.
|
2024-09-06 14:03:31 -05:00 |
Chip Kerchner
|
89702e1f4a
|
Fix zero element GEMV test.
|
2024-08-16 11:37:39 -05:00 |
Chip Kerchner
|
77f85c7c00
|
GEMV tests don't like zero elements.
|
2024-08-16 11:15:32 -05:00 |
Chip Kerchner
|
868aa857bc
|
Change malloc zero to return one byte and update the SBGEMM test to again use sizes of zero.
|
2024-08-16 10:28:10 -05:00 |
Chip Kerchner
|
b1802f4dc8
|
Fix unit test to start at 1 instead of 0 - since malloc zero bytes fails on some systems.
|
2024-08-16 09:51:37 -05:00 |
Chip Kerchner
|
c23897f585
|
Add GEMV testing to SBGEMx vs SGEMx testing.
|
2024-08-14 15:55:23 -05:00 |
Chip Kerchner
|
31226740d6
|
Cleanup of SBGEMM unit test.
|
2024-08-14 08:10:25 -05:00 |
Chip Kerchner
|
b1737698db
|
Fix DEFAULTS in SBGEMM for POWER10. Also comparisons for SBGEMM unit test can be exactly due to epilison differences.
|
2024-08-13 07:01:21 -05:00 |
Chip Kerchner
|
f708944fea
|
Add all 4 variations of the SBGEMM to compare_sgemm_sbgemm
|
2024-07-10 13:07:48 -05:00 |
Martin Kroeker
|
e9f480111e
|
fix sbgemm bfloat16 conversion errors introduced in PR 4488
|
2024-02-07 19:57:18 +01:00 |
Martin Kroeker
|
fb99fc2e6e
|
fix type conversion warnings
|
2024-02-07 13:42:08 +01:00 |
Chip-Kerchner
|
d46eba06a7
|
Pack structure only on AIX.
|
2023-10-12 09:41:33 -05:00 |
Chip-Kerchner
|
e98e3c4783
|
Fix float32_bits union so that it always the sizeof float.
|
2023-10-11 18:05:55 -05:00 |
Chip-Kerchner
|
97a61d0577
|
Fix bfloat16_bits union so that it always the sizeof unsigned short.
|
2023-10-11 17:36:43 -05:00 |
Martin Kroeker
|
3d338b57de
|
remove spurious loops
|
2022-08-08 17:09:45 +02:00 |
Martin Kroeker
|
d9dc015cfc
|
Use blasint for INTERFACE64 compatibility
|
2022-08-08 14:52:10 +02:00 |
Rajalakshmi Srinivasaraghavan
|
1d97405c02
|
POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
|
2022-07-28 07:43:53 -05:00 |
Martin Kroeker
|
924fd806d0
|
Rename "HALF" and "sh" to "BFLOAT16" and "sb"
|
2020-10-11 23:43:36 +02:00 |
Martin Kroeker
|
4db09c6cec
|
Rename compare_sgemm_shgemm.c to compare_sgemm_sbgemm.c
|
2020-10-11 23:42:45 +02:00 |