Konstantinos Margaritis
|
a38324a5a3
|
add arm rshift128/rshift128
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
603bc14cdd
|
fix failing corner case, add pshufb_maskz()
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
e35b88f2c8
|
use STL make_unique, remove wrapper header, breaks C++17 compilation
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
6f44a1aa26
|
remove low4bits from the arguments, fix cases that mostly affect loading large (64) vectors and falling out of bounds
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
b67cd7dfd0
|
use rshift128() instead of vector-wide right shift
|
2021-10-12 11:51:34 +03:00 |
|
George Wort
|
4bc28272da
|
Fix CROSS_COMPILE_AARCH64 for SVE issues.
Change-Id: I7b9ba3ccb754d96eee22ca01714c783dae1e4956
|
2021-10-12 11:51:34 +03:00 |
|
George Wort
|
9fb79ac3ec
|
Add SVE2 support for vermicelli
Change-Id: Ia025de53521fbaefe5fb1e4425aaf75c7d80a14e
|
2021-10-12 11:51:34 +03:00 |
|
apostolos
|
6f88ecac44
|
Supervector test fixes
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
d04b899c29
|
fix truffle SIMD for S>16 as well
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
d453a612dc
|
fix last failing Shufti/Truffle tests
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
0ed10082b1
|
fix rtruffle, was failing Lbr and a few ReverseTruffle tests
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
845e533b66
|
move firstMatch, lastMatch to own header in util
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
41ff0962c4
|
minor fixes
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
2753dbb3b0
|
rename supervector class header, use dup_*() functions names instead of set1_*(), minor fixes
|
2021-10-12 11:51:34 +03:00 |
|
apostolos
|
1ce5e17ce9
|
Truffle simd vectorized
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
23b075cbd4
|
refactor shufti algorithm to use SuperVector class, WIP
|
2021-10-12 11:51:34 +03:00 |
|
Konstantinos Margaritis
|
556206f138
|
replace push_back by emplace_back where possible
|
2021-10-12 11:51:33 +03:00 |
|
Konstantinos Margaritis
|
d3ff893871
|
prefetch works best when addresses are 64-byte aligned
|
2021-10-12 11:50:32 +03:00 |
|
Konstantinos Margaritis
|
27bd09454f
|
use correct function names for AVX512, fix build failure
|
2021-02-15 13:54:19 +02:00 |
|
Wang Xiang W
|
5f930b267c
|
Limex: exception handling with AVX512
|
2021-01-25 14:13:13 +02:00 |
|
Wang Xiang W
|
9ea1e4be3d
|
limex: add fast NFA check
|
2021-01-25 14:13:13 +02:00 |
|
Chang, Harry
|
5ad3d64b4b
|
Discard HAVE_AVX512VBMI checks at Sheng/McSheng compile time.
|
2021-01-25 14:13:13 +02:00 |
|
Zhu,Wenjun
|
d96f1ab505
|
MCSHENG64: extend to 64-state based on mcsheng
|
2021-01-25 14:13:13 +02:00 |
|
Chang, Harry
|
a388a0f193
|
Fix sheng64 dump compile issue in clang.
|
2021-01-25 14:13:13 +02:00 |
|
Chang, Harry
|
c41d33c53f
|
Fix sheng64 compile issue in clang and in DEBUG_OUTPUT mode on SKX.
|
2021-01-25 14:13:13 +02:00 |
|
Chang, Harry
|
ed4b0f713a
|
SHENG64: 64-state 1-byte shuffle based DFA.
|
2021-01-25 14:13:13 +02:00 |
|
Chang, Harry
|
6a42b37fca
|
SHENG32: Compile priority sheng > mcsheng > sheng32.
|
2021-01-25 14:13:13 +02:00 |
|
Chang, Harry
|
cc747013c4
|
SHENG32: 32-state 1-byte shuffle based DFA.
|
2021-01-25 14:13:13 +02:00 |
|
Hong, Yang A
|
d71515be04
|
DFA: use sherman economically
|
2021-01-25 14:13:13 +02:00 |
|
Konstantinos Margaritis
|
b62247a36e
|
borrow cache prefetching tricks from the Marvell port, seem to improve performance by 5-28%
|
2021-01-25 12:13:35 +02:00 |
|
Konstantinos Margaritis
|
5333467249
|
fix names, use own intrinsic instead of explicit _mm* ones
|
2020-09-23 11:51:21 +03:00 |
|
Hong, Yang A
|
88a18dcf98
|
add AVX512 support for vermicelli model
|
2020-05-25 13:47:53 +00:00 |
|
Pavel Shlyak
|
3ca3602755
|
A tiny cleanup
|
2019-12-02 16:40:38 +00:00 |
|
Hong, Yang A
|
b5a8644b1f
|
mcclellan: fix dump issue in wide-state case.
|
2019-01-21 09:59:29 +08:00 |
|
Hong, Yang A
|
805a550a0a
|
mcclellan: wide state fixes for sanitisers and accept state construction
|
2019-01-21 09:58:18 +08:00 |
|
Hong, Yang A
|
c06d5e1c14
|
DFA state compression: 16-bit wide and sherman co-exist
|
2019-01-21 09:56:37 +08:00 |
|
Wang, Xiang W
|
8a0e4f8249
|
Use std::distance explicitly to avoid ambiguity with boost
|
2019-01-11 16:05:55 +08:00 |
|
Justin Viiret
|
16076ed4a3
|
mcsheng: debug format string fixes
|
2018-06-27 13:39:30 +08:00 |
|
Justin Viiret
|
25adf3f512
|
sheng: fix reportCurrent eod flag
eod here should be 0, not 1. The reportCurrent NFA API function for
Sheng is unused at the moment, so this wasn't causing any problems
earlier.
|
2018-06-27 13:39:24 +08:00 |
|
Justin Viiret
|
e65479dae5
|
mcclellancompile: MAX_SHERMAN_LIST_LEN can be 9
|
2018-06-27 13:39:10 +08:00 |
|
Justin Viiret
|
ce7cfbde82
|
misc: docs, typo fixes, small cleanups
|
2018-06-27 13:39:05 +08:00 |
|
Hong, Yang A
|
ae918116ab
|
find_better_daddy: position change
|
2017-09-18 13:31:09 +10:00 |
|
Justin Viiret
|
ea2e85ac87
|
ng_squash: switch to using unordered_map
Also some cleaning up, small performance improvements.
|
2017-09-18 13:29:34 +10:00 |
|
Matthew Barr
|
29e1aae3fb
|
Use an unsigned byte, not char
|
2017-09-18 13:29:33 +10:00 |
|
Justin Viiret
|
b694fed727
|
mcclellancompile: simplify calc_min_dist_from_bob
|
2017-09-18 13:29:33 +10:00 |
|
Justin Viiret
|
58c3de0d33
|
mcclellancompile: don't copy dstate unnecessarily
|
2017-09-18 13:29:33 +10:00 |
|
Justin Viiret
|
3b392d6b70
|
accel_dfa_build_strat: make extend() faster
|
2017-09-18 13:29:33 +10:00 |
|
Justin Viiret
|
d5b3f2b508
|
gatherReports: fewer map lookups
|
2017-09-18 13:26:05 +10:00 |
|
Justin Viiret
|
ba6f638c40
|
accel_dfa_build_strat: use flat_set
|
2017-09-18 13:26:05 +10:00 |
|
Justin Viiret
|
14333f5b0b
|
rdfa: make getImplAlphaSize() inline
|
2017-09-18 13:25:52 +10:00 |
|