637 Commits

Author SHA1 Message Date
Justin Viiret
8435f918d1 limex: use cyclics mask for repeat compress 2016-08-10 14:52:56 +10:00
Justin Viiret
ae5e347778 limex: only compress active repeats 2016-08-10 14:52:56 +10:00
Justin Viiret
aa54352a3a limex: switch off stale repeats during compress 2016-08-10 14:52:56 +10:00
Justin Viiret
19e79be87d limex: compress repeats before state 2016-08-10 14:52:56 +10:00
Justin Viiret
7cc5346c11 assign groups: turn pair<pair> into a tuple 2016-08-10 14:49:24 +10:00
Justin Viiret
bfaa0acaea rose: preserve lit properties when building masks
This fixes a bug with commit 6a6b0e5, which did not preserve the
requires_explode and requires_benefits properties when a new literal was
generated to add an HWLM and/cmp mask.

Also extends the requires_explode handling to allow masked literals.
2016-08-10 14:48:29 +10:00
Boris Nagaev
b73bd9b6e1 new test: deserializers fail with garbage input 2016-08-10 14:08:59 +10:00
Boris Nagaev
66c2a53d33 hs_serialized_database_info: check whole input
Most lines of hs_serialized_database_info are not needed,
as the information is provided by db_decode_header.
Moreover, db_decode_header changes its first argument,
so it points to end of header after db_decode_header.
2016-08-10 13:27:44 +10:00
Boris Nagaev
373a624bad simplegrep: open file in binary mode ("rb")
Otherwise it hangs on binary files (platform MinGW).
2016-08-10 13:20:26 +10:00
Justin Viiret
e9cfbae68f workaround for freebsd/clang/libc++ build issues
Rather than relying on set's constructor from {}, explicitly construct
the set.
2016-07-08 11:07:51 +10:00
Matthew Barr
e3d416a6ea Apply some consistency to the names we give shifts 2016-07-08 11:07:50 +10:00
Matthew Barr
c76ff285e7 remove unnecessary function proto 2016-07-08 11:07:50 +10:00
Matthew Barr
9c915cc936 remove only use of cmpmsk8 and unused cmpmsk16 2016-07-08 11:07:50 +10:00
Matthew Barr
0722b5db5b Remove GCC-style compound statements
These do not appear to give us benefits over inlining on recent compilers.
2016-07-08 11:07:50 +10:00
Matthew Barr
1b3e795fc9 teddy: we only need the upper lane
Just use an extract, no need to shuffle first.
2016-07-08 11:07:50 +10:00
Matthew Barr
adf820bbba simd: simplify the set-all-ones util funcs
Modern compilers (gcc, icc) get this right, with the benefit of
removing our last use of inline asm in this file.
2016-07-08 11:07:50 +10:00
Matthew Barr
4d6934fc77 Move limex specific shuffle utils and ssse3 funcs 2016-07-08 11:07:50 +10:00
Justin Viiret
9f98f4c7b2 nfa: standardise callback start, end naming 2016-07-08 11:02:05 +10:00
Justin Viiret
cf9e40ae1c nfa: unify NfaCallback and SomNfaCallback
Use just one callback type, with both start and end offsets.
2016-07-08 11:01:56 +10:00
Xiang Wang
9087d59be5 tamarama: add container engine for exclusive nfas
Add the new Tamarama engine that acts as a container for infix/suffix
engines that can be proven to run exclusively of one another.

This reduces stream state for pattern sets with many exclusive engines.
2016-07-08 11:01:34 +10:00
Justin Viiret
013dbd3b3c rose: re-inline literal handling program exec 2016-07-08 11:01:34 +10:00
Justin Viiret
76d96809f8 rose: move roseRunProgram into its own unit
The roseRunProgram function had gotten very large for the number of
sites it was being inlined into, with negative effects on performance in
large cases. This change moves it into its own translation unit.
2016-07-08 11:01:34 +10:00
Alex Coyte
f166bc5658 allow some prefixes that may squash the literal match to run eagerly 2016-07-08 11:01:34 +10:00
Matthew Barr
f9ded59361 Disable strict aliasing
Strict aliasing allows to compiler to make some optimisations, but they aren't
without risk. The benefits do not appear to be worth the risk.
2016-07-08 10:59:40 +10:00
Alex Coyte
8699e35c09 prevent merging the e and f tables if the ftable is squashable 2016-07-08 10:59:40 +10:00
Alex Coyte
575e8c06dc only show floating groups to the floating table 2016-07-08 10:59:40 +10:00
Justin Viiret
1c2b0a271d role aliasing: tidy up naming 2016-07-08 10:59:40 +10:00
Justin Viiret
1d18852dc9 role aliasing: unify map update code 2016-07-08 10:59:40 +10:00
Justin Viiret
6239805561 rose: don't build empty sparse iter subprograms 2016-07-08 10:59:40 +10:00
Justin Viiret
679042779a role aliasing: use a reverse ghost map 2016-07-08 10:59:40 +10:00
Justin Viiret
1e5fcd5e80 tighten ghost vertex checks 2016-07-08 10:59:40 +10:00
Justin Viiret
0749f7c06d rose: allow ghosts to be aliased 2016-07-08 10:59:40 +10:00
Justin Viiret
e915ca21c5 limex: tidy up scoring code 2016-07-08 10:57:29 +10:00
Justin Viiret
8648397257 limex: invert scoring to count up from zero 2016-07-08 10:57:29 +10:00
Kirill Rybalchenko
9d2403e8bb limex: implement variable shift NFA engines
Replaces the old LimEx NFA engines, which were specialised for model
size and number of shifts, with a new set of engines that can handle a
variable number of shifts.
2016-07-08 10:57:29 +10:00
Justin Viiret
cdaf705a87 rose: pick up more prefix->lookaround conversions 2016-07-08 10:57:29 +10:00
Justin Viiret
d3c56b532b rose build: dedupe hasLastByteHistorySucc func 2016-07-08 10:57:00 +10:00
Justin Viiret
cf8e8c90ff eod: move stream eod code to stream.c 2016-07-08 10:55:36 +10:00
Justin Viiret
d5c1280b9f eod: tidy up, rename to roseStreamEodExec 2016-07-08 10:55:36 +10:00
Justin Viiret
66e0b77aa4 block: unify roseBlockExec and roseBlockExec_i 2016-07-08 10:55:36 +10:00
Justin Viiret
513ac11dbc block: move matcher invocations into functions 2016-07-08 10:55:36 +10:00
Justin Viiret
a9eba12cce rose: inline block-mode eod check 2016-07-08 10:55:36 +10:00
Justin Viiret
159c09b70e roseEnginesEod: trust the queue structure 2016-07-08 10:55:36 +10:00
Justin Viiret
d9bd6d5dee roseSuffixesEod: trust the queue structure 2016-07-08 10:55:36 +10:00
Justin Viiret
3e0232f0d6 eod: retire getELiteralMatcher 2016-07-08 10:55:36 +10:00
Justin Viiret
426bfc9cfb rose_build_bytecode: clean up 2016-07-08 10:55:36 +10:00
Justin Viiret
ae157034e9 eod: tidy runtime 2016-07-08 10:55:36 +10:00
Justin Viiret
78e4332a8b move eod iter program into general eod program 2016-07-08 10:55:36 +10:00
Justin Viiret
39461cc806 eod: move hwlm execution into MATCHER_EOD instr 2016-07-08 10:55:36 +10:00
Justin Viiret
b8f771e824 rose_build_bytecode: tidy up addPredBlocks 2016-07-08 10:55:36 +10:00