Files
vectorscan/src/nfa
Danila Kutenin eb7b0bb50c Optimize vectorscan for aarch64 by using shrn instruction
This optimization is based on the thread
https://twitter.com/Danlark1/status/1539344279268691970 and uses
shift right and narrow by 4 instruction https://developer.arm.com/documentation/ddi0596/2020-12/SIMD-FP-Instructions/SHRN--SHRN2--Shift-Right-Narrow--immediate--

To achieve that, I needed to redesign a little movemask into comparemask
and have an additional step towards mask iteration. Our benchmarks
showed 10-15% improvement on average for long matches.
2022-06-26 22:55:45 +00:00
..
2022-02-18 17:14:51 +00:00
2022-02-18 17:14:51 +00:00
2022-02-18 17:14:51 +00:00
2015-10-20 09:13:35 +11:00
2017-08-21 11:12:16 +10:00
2016-12-02 11:32:28 +11:00
2016-08-10 15:10:25 +10:00
2015-10-20 09:13:35 +11:00
2017-08-21 11:12:16 +10:00
2017-08-21 11:12:16 +10:00
2016-10-28 14:44:12 +11:00
2016-10-28 14:44:12 +11:00
2021-01-25 14:13:13 +02:00
2016-10-28 14:46:19 +11:00
2015-10-20 09:13:35 +11:00
2016-10-28 14:46:19 +11:00
2016-12-02 11:32:36 +11:00
2017-08-21 11:12:16 +10:00
2016-12-02 11:32:28 +11:00
2017-04-26 15:19:36 +10:00
2015-10-20 09:13:35 +11:00
2016-08-10 15:10:46 +10:00
2021-01-25 14:13:13 +02:00
2021-10-12 11:51:34 +03:00
2016-08-10 15:01:08 +10:00
2016-12-14 15:27:18 +11:00
2015-10-20 09:13:35 +11:00
2017-09-18 13:25:52 +10:00
2017-09-18 13:25:52 +10:00
2021-10-12 11:51:34 +03:00
2016-03-01 11:21:39 +11:00
2022-02-18 19:31:03 +00:00