Files
vectorscan/src
Yoan Picchi 7054378c93 Speed up truffle with 256b TBL instructions
256b wide SVE vectors allow some simplification of truffle.
Up to 40% speedup on graviton3. Going from 12500 MB/s to 17000 MB/s
onhe microbenchmark.
SVE2 also offer this capability for 128b vector with a speedup around
25% compared to normal SVE

Add unit tests and benchmark for this wide variant

Signed-off-by: Yoan Picchi <yoan.picchi@arm.com>
2024-05-22 16:13:53 +00:00
..
2024-05-02 14:30:18 +03:00
2015-10-20 09:13:35 +11:00
2024-04-29 13:28:16 +03:00
2015-10-20 09:13:35 +11:00
2021-10-12 11:51:34 +03:00
2024-05-20 17:09:30 +03:00
2019-01-21 09:59:37 +08:00
2021-10-12 11:51:34 +03:00
2024-05-16 12:03:42 +03:00
2023-10-03 09:57:10 +03:00
2015-10-20 09:13:35 +11:00
2023-09-05 13:58:17 +03:00
2017-08-21 11:19:11 +10:00