Alex Bondarev
49a8775fbd
clang SVE build fix
2022-09-18 19:42:45 +03:00
Alex Bondarev
d144762b4a
SVE enabled on user input. updated README
...
tune and arch flags will be applied from autodetect only if they have been created by the process, otherwise the old logical flow remains wrt the flags
2022-09-18 12:04:05 +03:00
Alex Bondarev
17858d4da3
additional mcpu flag cleanup
2022-09-16 00:03:08 +03:00
Alex Bondarev
e9530145c3
removed cpu reference flags and fixed tune flag
2022-09-15 18:38:01 +03:00
Alex Bondarev
1368a72d55
updated README to reflect CMake changes
2022-09-13 18:29:06 +03:00
Alex Bondarev
779cbbc1c1
fix to correctly place the autodetected flags and to activate SVE options
2022-09-13 18:21:10 +03:00
Konstantinos Margaritis
0e9e57dd58
bump version
2022-09-13 12:57:08 +00:00
Konstantinos Margaritis
d7f6a163d7
Merge pull request #124 from VectorCamp/develop
...
Merge develop to master
2022-09-13 15:52:20 +03:00
Konstantinos Margaritis
5b67015dc0
Merge pull request #123 from VectorCamp/feature/neon-shift-optimizations
...
[NEON] simplify/optimize shift/align primitives
2022-09-13 09:13:05 +03:00
Konstantinos Margaritis
4b41c5fe25
[NEON] simplify/optimize shift/align primitives
2022-09-12 13:09:51 +00:00
Konstantinos Margaritis
778951e046
Merge pull request #121 from liquidaty/mingw64-develop
...
fix to enable successful build with mingw64
2022-09-09 13:42:49 +03:00
liquidaty
b29cb26625
fix to enable successful build with mingw64
2022-09-08 09:59:37 -07:00
Konstantinos Margaritis
dd57e71f3c
Merge pull request #119 from VectorCamp/feature/vsx-optimizations
...
VSX optimizations
2022-09-08 13:41:13 +03:00
Konstantinos Margaritis
a0e53c7d85
use correct intrinsic for lshiftbyte_m128
2022-09-07 16:00:10 +03:00
Konstantinos Margaritis
37b2cae189
provide non-immediate versions of lshiftbyte/rshiftbyte on x86
2022-09-07 15:07:20 +03:00
Konstantinos Margaritis
ce90e58af1
readd simd_onebit_masks for x86, needs more work
2022-09-07 13:42:25 +03:00
Konstantinos Margaritis
0052df5f5b
[NEON] optimize mask1bit128, get rid of simd_onebit_masks
2022-09-07 10:20:01 +00:00
Konstantinos Margaritis
76a31d1bc0
remove simd_onebit_masks from arm/x86 headers, as they moved to common
2022-09-07 12:41:32 +03:00
Konstantinos Margaritis
f71de24f37
[VSX] optimize alignr method
2022-09-07 12:35:28 +03:00
Konstantinos Margaritis
4e3b3c3948
[VSX] optimize shift operators
2022-09-07 12:16:14 +03:00
Konstantinos Margaritis
ba1b805f60
[VSX] optimize shifting methods, replace template Unroller
2022-09-07 12:14:15 +03:00
Konstantinos Margaritis
d87b0e2a7e
optimize comparemask implementation, clean up code, use union types instead of casts
2022-09-07 02:02:11 +03:00
Konstantinos Margaritis
c097f169ad
[VSX] add algorithm for alignr w/o use of immediates
2022-09-07 00:01:54 +03:00
Konstantinos Margaritis
bdc3947746
[VSX] correct lshiftbyte_m128/rshiftbyte_m128, variable_byte_shift
2022-09-06 23:59:51 +03:00
Konstantinos Margaritis
59ace0ebf8
[VSX] huge optimization of movemask128
2022-09-06 20:08:44 +03:00
Konstantinos Margaritis
ef9116b52e
[VSX] optimize and correct lshift_m128/rshift_m128
2022-09-06 18:48:19 +03:00
Konstantinos Margaritis
6dce55c3fe
[VSX] optimized mask1bit128(), moved simd_onebit_masks to common
2022-09-06 18:10:55 +03:00
Konstantinos Margaritis
27dbdcf122
add popcount32x4, popcount64x4 helper functions
2022-09-06 16:55:56 +03:00
Konstantinos Margaritis
361e9be4a0
Merge pull request #118 from VectorCamp/bugfix/hyperscan-backport-202208
...
Bugfix/hyperscan backport 202208
2022-09-03 09:32:43 +03:00
Konstantinos Margaritis
c0436e7cad
Add missing <memory> header
2022-08-30 20:40:23 +03:00
Liu Zixian
ca7bd22d26
fix build with glibc-2.34
...
SIGTSKSZ is no long a constant after glibc 2.34
https://sourceware.org/pipermail/libc-alpha/2021-August/129718.html
2022-08-29 15:37:59 +03:00
Hong, Yang A
c81a6d14b0
literal API: add empty string check.
...
fixes github issue #302 , #304
2022-08-29 15:08:54 +03:00
Hong, Yang A
ef3ca02ce4
chimera: fix SKIP flag issue
...
fix github issue #360
2022-08-29 15:03:34 +03:00
Chang, Harry
cc948674dd
Corpus editor: fix random char value of UTF-8.
2022-08-29 15:03:30 +03:00
Chang, Harry
d57dd3367c
Corpus generator: fix random char value of UTF-8.
...
fixes github issue #184
2022-08-29 15:03:26 +03:00
Hong, Yang A
1ae75b67a1
bugfix: fix overflow risk of strlen function
2022-08-29 15:03:22 +03:00
hongyang7
cb94b9d7b7
Fix segfaults on allocation failure ( #4 )
...
Throw std::bad_alloc instead of returning nullptr from
ue2::AlignedAllocator. Allocators for STL containers are expected never
to return with an invalid pointer, and instead must throw on failure.
Violating this expectation can lead to invalid pointer dereferences.
Co-authored-by: johanngan <johanngan.us@gmail.com >
fixes github issue #317 (PR #320 )
2022-08-29 15:03:18 +03:00
Chang, Harry
5307bbd1dc
Logical Combination: bypass combination flag in hs_expression_info.
...
Fixes github issue #291
2022-08-29 15:03:14 +03:00
Hong, Yang A
2e60bdde8a
update year for bugfix #302-#305
2022-08-29 15:03:11 +03:00
Hong, Yang A
e05b154abb
mcclellan: improve wide-state checking in Sherman optimization
...
fixes github issue #305
2022-08-29 15:03:06 +03:00
Hong, Yang A
5ae40498a5
literal API: add instruction support
...
fixes github issue #303
2022-08-29 15:02:59 +03:00
Konstantinos Margaritis
bef496e60f
Merge pull request #116 from pareenaverma/develop
...
Fixed the PCRE download location
2022-07-20 23:08:11 +03:00
Konstantinos Margaritis
6a2700f1d4
Merge pull request #113 from danlark1/develop
...
Optimize vectorscan for aarch64 by using shrn instruction
2022-07-20 16:41:33 +03:00
Ubuntu
42f5165f7e
Fixed the PCRE download location
2022-07-20 13:26:52 +00:00
Danila Kutenin
1e09891b2b
Fix avx512 movemask call
2022-07-20 09:03:50 +01:00
Danila Kutenin
2dd7b9a4f9
Fix ppc64el debug
2022-06-26 23:05:17 +00:00
Danila Kutenin
45fe139224
Minor fix
2022-06-26 23:02:02 +00:00
Danila Kutenin
4b83ea1c78
Fix formatting of a couple files
2022-06-26 22:59:58 +00:00
Danila Kutenin
eb7b0bb50c
Optimize vectorscan for aarch64 by using shrn instruction
...
This optimization is based on the thread
https://twitter.com/Danlark1/status/1539344279268691970 and uses
shift right and narrow by 4 instruction https://developer.arm.com/documentation/ddi0596/2020-12/SIMD-FP-Instructions/SHRN--SHRN2--Shift-Right-Narrow--immediate--
To achieve that, I needed to redesign a little movemask into comparemask
and have an additional step towards mask iteration. Our benchmarks
showed 10-15% improvement on average for long matches.
2022-06-26 22:55:45 +00:00
Konstantinos Margaritis
99eb384f62
Merge pull request #108 from jth/cmake-python
...
CMake: Use non-deprecated method for finding python
2022-05-20 09:05:11 +03:00