Danila Kutenin 49eb18ee4f Optimize vectorscan for aarch64 by using shrn instruction
This optimization is based on the thread
https://twitter.com/Danlark1/status/1539344279268691970 and uses
shift right and narrow by 4 instruction https://developer.arm.com/documentation/ddi0596/2020-12/SIMD-FP-Instructions/SHRN--SHRN2--Shift-Right-Narrow--immediate--

To achieve that, I needed to redesign a little movemask into comparemask
and have an additional step towards mask iteration. Our benchmarks
showed 10-15% improvement on average for long matches.
2022-06-26 22:55:45 +00:00
..
2015-10-20 09:13:35 +11:00
2015-10-20 09:13:35 +11:00
2017-01-17 11:38:09 +11:00
2021-01-25 14:13:13 +02:00
2021-02-15 13:54:19 +02:00
2021-02-15 13:54:19 +02:00
2015-10-20 09:13:35 +11:00
2015-10-20 09:13:35 +11:00
2015-10-20 09:13:35 +11:00
2022-02-18 17:14:51 +00:00
2021-10-12 11:51:35 +03:00
2015-10-20 09:13:35 +11:00
2015-10-20 09:13:35 +11:00