Handling as 4x8 with vector pairs gives better performance than existing code in POWER10.
7.9 KiB
7.9 KiB
Handling as 4x8 with vector pairs gives better performance than existing code in POWER10.