Files
OpenBLAS/kernel
Rajalakshmi Srinivasaraghavan 0826d68f93 POWER10: Change the packing format for bfloat16
As the new MMA instructions need the inputs in 4x2 order for bfloat16,
changing the format in copy/packing code.  This avoids permute instructions
in the gemm kernel inner loop.
2020-10-13 16:05:10 -05:00
..
2019-03-30 22:18:03 +01:00
2020-04-19 15:44:30 +02:00
2019-03-30 22:25:06 +01:00
2020-08-13 06:18:00 +08:00
2020-10-13 11:00:22 -05:00
2020-10-13 11:00:22 -05:00