Konstantinos Margaritis
cc1a8dd47e
fix SVE2 build after the changes
2021-11-25 18:48:24 +02:00
apostolos
2136580d50
resolving conficts after merging
2021-11-13 18:58:22 +02:00
Konstantinos Margaritis
694e2faf7f
remove vermicelli.h and replace it with vermicelli.hpp
2021-11-02 22:30:53 +02:00
Konstantinos Margaritis
c5e70b11b7
remove vermicelli_sse.h
2021-11-01 16:40:01 +00:00
Konstantinos Margaritis
814a4ef0c0
add to CMake
2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
5e59b36634
add arm vector types in union, avoid -flax-conversions, fix castings
2021-11-01 16:52:17 +02:00
Vectorcamp
28f8f30866
compile fixes for vsc port
2021-10-14 13:53:55 +03:00
Vectorcamp
3746542a0d
ppc64el arcitecture added in CMakelists file
2021-10-14 13:53:55 +03:00
Vectorcamp
5c70d92925
test commit from VM and CMakelists add power support
2021-10-14 13:53:55 +03:00
Konstantinos Margaritis
14be68587b
add initial ppc64el support
...
(cherry picked from commit 63e26a4b28 )
(cherry picked from commit c214ba253327114c16d0724f75c998ab00d44919)
2021-10-14 13:53:55 +03:00
Konstantinos Margaritis
9e07d7971d
bump version
2021-10-12 11:51:35 +03:00
Konstantinos Margaritis
f2e45ccc06
remove simd_utils.c
2021-10-12 11:51:35 +03:00
Konstantinos Margaritis
6ec68bbedd
do not include the Supervector impl.cpp files in fat runtime
2021-10-12 11:51:35 +03:00
Duncan Bellamy
314116cbb5
remove adding CMAKE_CXX_IMPLICIT_LINK_LIBRARIES to PRIVATE_LIBS
...
as on alpine linux this add gcc_s which is a shared library
on alpine:
Libs.private: -lstdc++ -lm -lssp_nonshared -lgcc_s -lgcc -lc -lgcc_s -lgcc
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
e2fc2c3dfe
remove confusing OPTIMISE flag
2021-10-12 11:51:34 +03:00
apostolos
bb9bcb3760
micro-benchmarks for shufti, trufle and noodle added
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
cf4b95fff2
remove Windows/ICC support
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
acacafe1af
add missing compile flags
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
44496d7508
add accidentally removed lines
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
cd5c251f67
* add -fno-new-ttp-matching to fix build-failures on newer gcc compilers with C++17
...
* add explicit -mssse3, -mavx2 in compiler flags in respective build profiles
2021-10-12 11:51:34 +03:00
George Wort
ceb230c7db
Replace USE_ARM_SVE with HAVE_SVE.
...
Change-Id: I469efaac197cba93201f2ca6eca78ca61be3054d
2021-10-12 11:51:34 +03:00
George Wort
b54710d208
Implement new Vermicelli16 acceleration functions using SVE2.
...
The scheme utilises the MATCH and NMATCH instructions to
scan for 16 characters at the same rate as vermicelli
scans for one.
Change-Id: Ie2cef904c56651e6108593c668e9b65bc001a886
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
a9413d1397
change C/C++ standard used to C17/C++17
2021-10-12 11:51:34 +03:00
George Wort
7e5138b78f
Fix CROSS_COMPILE_AARCH64 for SVE issues.
...
Change-Id: I7b9ba3ccb754d96eee22ca01714c783dae1e4956
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
2ed6ca72b5
disable OPTIMISE by default
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
472b119e35
compilation fixes for debug mode
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
e434f985c0
handle GNUCC_ARCH on non-x86 properly
2021-10-12 11:51:34 +03:00
apostolos
bab390d442
Truffle simd vectorized
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
fb71288740
add missing ARM SuperVector methods, some tests still fail, WIP
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
f689179a82
refactor shufti algorithm to use SuperVector class, WIP
2021-10-12 11:51:34 +03:00
George Wort
d59f11dc01
Add SVE, SVE2, and SVE2_BITPERM as targets
...
Change-Id: I5231e2eb0a31708a16c853dc83ea48db32e0b0a5
2021-10-12 11:51:34 +03:00
George Wort
503483a8ee
Enable cross compilation to aarch64
...
Change-Id: Iafc8ac60926f5286990ce63a4ff4f8b6a7c46bef
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
c96cfd73c4
rename project, change to noodle_engine.cpp
2021-10-12 11:51:34 +03:00
Konstantinos Margaritis
1cdb7312cb
use -O3 for C++ code as well, makes a difference
2021-10-12 11:51:33 +03:00
Konstantinos Margaritis
b641eee167
change project name in CMakeLists
2021-04-12 21:06:28 +03:00
Konstantinos Margaritis
ff526ebfb8
bump version
2021-02-15 20:19:09 +02:00
Konstantinos Margaritis
71c59a95e9
add BUILD_AVX2 definition, enable non-AVX2 building selectively
2021-02-15 13:54:19 +02:00
Wang Xiang W
bd29733de2
Bump version number for release
2021-01-25 14:13:13 +02:00
Hong, Yang A
5450ef34f2
examples: add cmake enabling option BUILD_EXAMPLES.
2021-01-25 14:13:13 +02:00
Chang, Harry
86b57e409f
Add cpu feature / target info "AVX512VBMI".
2021-01-25 14:13:13 +02:00
Chang, Harry
6b6c37fe9b
Fix cmake error on ICX under release mode.
2021-01-25 14:13:13 +02:00
Konstantinos Margaritis
59af972d72
fix IA32 build, as we need minimum SSSE3 support for compilation to succeed
2020-12-30 19:57:44 +02:00
Konstantinos Margaritis
845c321506
needed for ARM vector type conversions
2020-10-30 10:50:24 +02:00
Konstantinos Margaritis
74919026c3
fix compilation paths for cpuid_flags for x86
2020-10-16 12:29:45 +03:00
Konstantinos Margaritis
64535610f5
add arm simple cpuid_flags
2020-10-15 16:26:04 +03:00
Konstantinos Margaritis
a0912a2ce5
move cpuid_flags.h header to common
2020-10-08 20:49:33 +03:00
Konstantinos Margaritis
cde2f4c81b
add necessary modifications to CMake system to enable building on ARM, add arm_neon.h intrinsic header to intrinsics.h
2020-10-06 12:44:23 +03:00
Konstantinos Margaritis
09993e5190
fix include paths for masked_move
2020-09-18 12:55:57 +03:00
Konstantinos Margaritis
cc81d7f578
move cpuid stuff to util/arch/x86
2020-09-17 20:35:39 +03:00
Hong, Yang A
c758cdfb07
Bump version number for release
2020-05-25 14:22:49 +00:00