1691 Commits

Author SHA1 Message Date
Alex Bondarev
90ac746303 SVE enabled on user input. updated README
tune and arch flags will be applied from autodetect only if they have been created by the process, otherwise the old logical flow remains wrt the flags
2022-09-18 12:04:05 +03:00
Alex Bondarev
4ab0730dbe additional mcpu flag cleanup 2022-09-16 00:03:08 +03:00
Alex Bondarev
d0a017da99 removed cpu reference flags and fixed tune flag 2022-09-15 18:38:01 +03:00
Alex Bondarev
69e6176e09 updated README to reflect CMake changes 2022-09-13 18:29:06 +03:00
Alex Bondarev
ee0c8f763f fix to correctly place the autodetected flags and to activate SVE options 2022-09-13 18:21:10 +03:00
Konstantinos Margaritis
f6250ae3e5 bump version vectorscan/5.4.8 2022-09-13 12:57:08 +00:00
Konstantinos Margaritis
361feb64e3
Merge pull request #124 from VectorCamp/develop
Merge develop to master
2022-09-13 15:52:20 +03:00
Konstantinos Margaritis
d0ae940261
Merge pull request #123 from VectorCamp/feature/neon-shift-optimizations
[NEON] simplify/optimize shift/align primitives
2022-09-13 09:13:05 +03:00
Konstantinos Margaritis
67b414f2f9 [NEON] simplify/optimize shift/align primitives 2022-09-12 13:09:51 +00:00
Konstantinos Margaritis
db2a6d65f1
Merge pull request #121 from liquidaty/mingw64-develop
fix to enable successful build with mingw64
2022-09-09 13:42:49 +03:00
liquidaty
f4840adf3d fix to enable successful build with mingw64 2022-09-08 09:59:37 -07:00
Konstantinos Margaritis
0c97e5f2c2
Merge pull request #119 from VectorCamp/feature/vsx-optimizations
VSX optimizations
2022-09-08 13:41:13 +03:00
Konstantinos Margaritis
e3c237a7e0 use correct intrinsic for lshiftbyte_m128 2022-09-07 16:00:10 +03:00
Konstantinos Margaritis
756ef409b4 provide non-immediate versions of lshiftbyte/rshiftbyte on x86 2022-09-07 15:07:20 +03:00
Konstantinos Margaritis
1ae0d15181 readd simd_onebit_masks for x86, needs more work 2022-09-07 13:42:25 +03:00
Konstantinos Margaritis
0af2ba8616 [NEON] optimize mask1bit128, get rid of simd_onebit_masks 2022-09-07 10:20:01 +00:00
Konstantinos Margaritis
02ae2a3cad remove simd_onebit_masks from arm/x86 headers, as they moved to common 2022-09-07 12:41:32 +03:00
Konstantinos Margaritis
305a041c73 [VSX] optimize alignr method 2022-09-07 12:35:28 +03:00
Konstantinos Margaritis
a837cf3bee [VSX] optimize shift operators 2022-09-07 12:16:14 +03:00
Konstantinos Margaritis
be20c2c519 [VSX] optimize shifting methods, replace template Unroller 2022-09-07 12:14:15 +03:00
Konstantinos Margaritis
dc6b8ae92d optimize comparemask implementation, clean up code, use union types instead of casts 2022-09-07 02:02:11 +03:00
Konstantinos Margaritis
7295b9c718 [VSX] add algorithm for alignr w/o use of immediates 2022-09-07 00:01:54 +03:00
Konstantinos Margaritis
94fe406f0c [VSX] correct lshiftbyte_m128/rshiftbyte_m128, variable_byte_shift 2022-09-06 23:59:51 +03:00
Konstantinos Margaritis
17467ff21b [VSX] huge optimization of movemask128 2022-09-06 20:08:44 +03:00
Konstantinos Margaritis
0e7874f122 [VSX] optimize and correct lshift_m128/rshift_m128 2022-09-06 18:48:19 +03:00
Konstantinos Margaritis
026f761671 [VSX] optimized mask1bit128(), moved simd_onebit_masks to common 2022-09-06 18:10:55 +03:00
Konstantinos Margaritis
43c053a069 add popcount32x4, popcount64x4 helper functions 2022-09-06 16:55:56 +03:00
Konstantinos Margaritis
c043730675
Merge pull request #118 from VectorCamp/bugfix/hyperscan-backport-202208
Bugfix/hyperscan backport 202208
2022-09-03 09:32:43 +03:00
Konstantinos Margaritis
74ab41897c Add missing <memory> header 2022-08-30 20:40:23 +03:00
Liu Zixian
c597f69c59 fix build with glibc-2.34
SIGTSKSZ is no long a constant after glibc 2.34
https://sourceware.org/pipermail/libc-alpha/2021-August/129718.html
2022-08-29 15:37:59 +03:00
Hong, Yang A
70b2a28386 literal API: add empty string check.
fixes github issue #302, #304
2022-08-29 15:08:54 +03:00
Hong, Yang A
4f27a70dd7 chimera: fix SKIP flag issue
fix github issue #360
2022-08-29 15:03:34 +03:00
Chang, Harry
31afacc7be Corpus editor: fix random char value of UTF-8. 2022-08-29 15:03:30 +03:00
Chang, Harry
a9ca0e4de3 Corpus generator: fix random char value of UTF-8.
fixes github issue #184
2022-08-29 15:03:26 +03:00
Hong, Yang A
4d4940dfbe bugfix: fix overflow risk of strlen function 2022-08-29 15:03:22 +03:00
hongyang7
2731a3384b Fix segfaults on allocation failure (#4)
Throw std::bad_alloc instead of returning nullptr from
ue2::AlignedAllocator. Allocators for STL containers are expected never
to return with an invalid pointer, and instead must throw on failure.
Violating this expectation can lead to invalid pointer dereferences.

Co-authored-by: johanngan <johanngan.us@gmail.com>

fixes github issue #317 (PR #320)
2022-08-29 15:03:18 +03:00
Chang, Harry
c1659b8544 Logical Combination: bypass combination flag in hs_expression_info.
Fixes github issue #291
2022-08-29 15:03:14 +03:00
Hong, Yang A
decabdfede update year for bugfix #302-#305 2022-08-29 15:03:11 +03:00
Hong, Yang A
a119693a66 mcclellan: improve wide-state checking in Sherman optimization
fixes github issue #305
2022-08-29 15:03:06 +03:00
Hong, Yang A
cafd5248b1 literal API: add instruction support
fixes github issue #303
2022-08-29 15:02:59 +03:00
Konstantinos Margaritis
6259783d79
Merge pull request #116 from pareenaverma/develop
Fixed the PCRE download location
2022-07-20 23:08:11 +03:00
Konstantinos Margaritis
19947f70d2
Merge pull request #113 from danlark1/develop
Optimize vectorscan for aarch64 by using shrn instruction
2022-07-20 16:41:33 +03:00
Ubuntu
b5e1384995 Fixed the PCRE download location 2022-07-20 13:26:52 +00:00
Danila Kutenin
db52ce6f08 Fix avx512 movemask call 2022-07-20 09:03:50 +01:00
Danila Kutenin
7e7f604f7d Fix ppc64el debug 2022-06-26 23:05:17 +00:00
Danila Kutenin
849846700a Minor fix 2022-06-26 23:02:02 +00:00
Danila Kutenin
8a49e20bcd Fix formatting of a couple files 2022-06-26 22:59:58 +00:00
Danila Kutenin
49eb18ee4f Optimize vectorscan for aarch64 by using shrn instruction
This optimization is based on the thread
https://twitter.com/Danlark1/status/1539344279268691970 and uses
shift right and narrow by 4 instruction https://developer.arm.com/documentation/ddi0596/2020-12/SIMD-FP-Instructions/SHRN--SHRN2--Shift-Right-Narrow--immediate--

To achieve that, I needed to redesign a little movemask into comparemask
and have an additional step towards mask iteration. Our benchmarks
showed 10-15% improvement on average for long matches.
2022-06-26 22:55:45 +00:00
Konstantinos Margaritis
73695e419c
Merge pull request #108 from jth/cmake-python
CMake: Use non-deprecated method for finding python
2022-05-20 09:05:11 +03:00
Jan Henning
85a77e3eff Bump scripts to python3 2022-05-19 16:25:08 +02:00