Commit Graph

1798 Commits

Author SHA1 Message Date
Danila Kutenin
1e09891b2b Fix avx512 movemask call 2022-07-20 09:03:50 +01:00
Danila Kutenin
2dd7b9a4f9 Fix ppc64el debug 2022-06-26 23:05:17 +00:00
Danila Kutenin
45fe139224 Minor fix 2022-06-26 23:02:02 +00:00
Danila Kutenin
4b83ea1c78 Fix formatting of a couple files 2022-06-26 22:59:58 +00:00
Danila Kutenin
eb7b0bb50c Optimize vectorscan for aarch64 by using shrn instruction
This optimization is based on the thread
https://twitter.com/Danlark1/status/1539344279268691970 and uses
shift right and narrow by 4 instruction https://developer.arm.com/documentation/ddi0596/2020-12/SIMD-FP-Instructions/SHRN--SHRN2--Shift-Right-Narrow--immediate--

To achieve that, I needed to redesign a little movemask into comparemask
and have an additional step towards mask iteration. Our benchmarks
showed 10-15% improvement on average for long matches.
2022-06-26 22:55:45 +00:00
Konstantinos Margaritis
99eb384f62 Merge pull request #108 from jth/cmake-python
CMake: Use non-deprecated method for finding python
2022-05-20 09:05:11 +03:00
Jan Henning
218a1cf76a Bump scripts to python3 2022-05-19 16:25:08 +02:00
Jan Henning
3c4f5983dc Use non-deprecated method of finding python 2022-05-19 10:20:17 +02:00
Konstantinos Margaritis
ebf4ab9e5a Update CMakeLists.txt 2022-05-05 12:14:53 +03:00
Konstantinos Margaritis
6def0d73c3 Merge pull request #103 from VectorCamp/develop
Develop
2022-05-05 10:34:56 +03:00
Konstantinos Margaritis
cc65177804 Update Jenkinsfile 2022-05-04 21:57:38 +03:00
Konstantinos Margaritis
f25c4bb61c Update Jenkinsfile 2022-05-04 16:41:10 +03:00
Konstantinos Margaritis
f355d77314 Update Jenkinsfile 2022-05-04 16:30:22 +03:00
Konstantinos Margaritis
4bb34c4140 Merge pull request #105 from VectorCamp/bugfix/jenkins
fix large pipeline error
2022-05-04 16:27:22 +03:00
Konstantinos Margaritis
1dca71c410 fix large pipeline error 2022-05-04 16:26:02 +03:00
Konstantinos Margaritis
e6cb2439d4 Delete JenkinsFile 2022-05-04 16:14:19 +03:00
Konstantinos Margaritis
fbc83d134c Merge pull request #104 from VectorCamp/bugfix/jenkinsfile
add Jenkinsfile back to master branch
2022-05-04 16:04:00 +03:00
Konstantinos Margaritis
330935a394 add Jenkinsfile back to master branch 2022-05-04 16:01:53 +03:00
Konstantinos Margaritis
8d53b19788 add Jenkinsfile back to master branch 2022-04-19 11:36:25 +03:00
Konstantinos Margaritis
d4044039db Merge pull request #102 from danlark1/patch-2
Optimized and correct version of movemask128 for ARM
2022-04-18 20:56:26 +03:00
Daniel Kutenin
2360314f9d Optimized and correct version of movemask128 for ARM
Closes #99

https://gcc.godbolt.org/z/cTjKqzcvn

Previous version was not correct because movemask thought of having bytes 0xFF. We can fully match the semantics + do it faster with USRA instructions.

Re-submission to a develop branch
2022-04-18 13:37:53 +01:00
Konstantinos Margaritis
7f2b6c5aa1 Merge pull request #94 from a16bitsysop/fat_runtime
change FAT_RUNTIME to a normal option so it can be set to off
2022-04-18 11:08:29 +03:00
Konstantinos Margaritis
d2e1478d47 Merge pull request #93 from danlark1/master
Fix all ASAN issues in vectorscan
2022-04-18 11:07:18 +03:00
Duncan Bellamy
649121ee04 move to original position 2022-02-22 19:21:18 +00:00
Duncan Bellamy
20cb5cd2b2 change FAT_RUNTIME to a normal option so it can be set to off
fixes #89
2022-02-20 13:16:58 +00:00
Danila Kutenin
faca38e058 Fix a couple of tests 2022-02-18 19:31:03 +00:00
Danila Kutenin
f46b787332 Add sanitize options 2022-02-18 18:35:26 +00:00
Danila Kutenin
a526f6bb6b Fix all ASAN issues in vectorscan 2022-02-18 17:14:51 +00:00
Konstantinos Margaritis
91e1212084 Merge pull request #90 from BigRedEye/vectorscan-master
Fix word boundary assertions under C++20
2022-02-08 09:20:34 +02:00
BigRedEye
89bc7af867 fix: Mark operator bool explicit 2022-02-08 00:22:23 +03:00
Konstantinos Margaritis
7712703529 Merge pull request #86 from VectorCamp/develop
New release 5.4.6
2022-01-21 12:25:40 +02:00
Konstantinos Margaritis
4bed72525d Merge pull request #87 from VectorCamp/feature/move-debian-package-to-separate-branch
keep debian folder in a separate branch
2022-01-21 12:24:03 +02:00
Konstantinos Margaritis
01de6e7d7b keep debian folder in a separate branch 2022-01-21 12:07:25 +02:00
Konstantinos Margaritis
ad209e1855 Merge pull request #85 from VectorCamp/feature/add-debian-package
Feature/add debian package
2022-01-21 10:12:04 +02:00
Konstantinos Margaritis
8bc999d467 change source format to native, as we include debian folder 2022-01-20 21:03:02 +02:00
Konstantinos Margaritis
96f3092262 fix description, remove sse4.2-support from b-depends 2022-01-20 21:02:46 +02:00
Konstantinos Margaritis
f398a672d0 add ITP bug report 2022-01-20 21:02:30 +02:00
Konstantinos Margaritis
29f31b1c42 add sse4.2-support package to enforce such dependency 2022-01-19 15:08:52 +02:00
Konstantinos Margaritis
0f3558fa71 remove preinst script, not needed as we bumped our deps 2022-01-19 15:08:04 +02:00
Konstantinos Margaritis
ff005c5347 add our copyrights, minor fixes 2022-01-19 14:31:59 +02:00
Konstantinos Margaritis
774279220c defer setting arch/tune flags for FAT_RUNTIME 2022-01-18 20:34:45 +02:00
Konstantinos Margaritis
1c7f01ab17 enable FAT_RUNTIME 2022-01-18 20:32:22 +02:00
Konstantinos Margaritis
d1ced630b8 fix DEB_CMAKE_FLAGS depending on DEB_HOST_ARCH 2021-12-22 13:25:29 +02:00
Konstantinos Margaritis
404785f30f Initial attempt at debian packaging, modified hyperscan packaging 2021-12-22 13:13:12 +02:00
Konstantinos Margaritis
dd7de46109 Merge pull request #82 from VectorCamp/feature/add-macos-support
Minor changes to enable compilation on Mac M1
2021-12-12 01:13:14 +02:00
Konstantinos Margaritis
bacb015db7 Minor changes to enable compilation on Mac M1 2021-12-11 15:43:55 +02:00
Konstantinos Margaritis
61ca9717d6 Merge pull request #81 from VectorCamp/feature/add-clang-support
Feature/add clang support
2021-12-07 22:16:38 +02:00
Konstantinos Margaritis
242a460115 minor fixes 2021-12-07 08:49:59 +00:00
Konstantinos Margaritis
b6ddf2b41c fix clang-release-arm compilation 2021-12-07 08:43:52 +00:00
Konstantinos Margaritis
f4ccc40c58 fix wrong castings for NEON 2021-12-06 21:35:51 +00:00