Commit Graph

1593 Commits

Author SHA1 Message Date
Konstantinos Margaritis
c68e46489b fix build failures with clang on x86, make sure compilation works on other Power as well 2021-12-03 16:24:58 +02:00
Konstantinos Margaritis
1609e7a56e clang is more strict 2021-12-02 23:09:53 +02:00
Konstantinos Margaritis
d2cbac5f1f don't fail if mtune does not return a valid configuration 2021-12-02 23:09:34 +02:00
Konstantinos Margaritis
a728878e50 Power does not use -march 2021-12-02 18:01:26 +02:00
Konstantinos Margaritis
d86e6bed69 fix build with clang, in particular VSX uses long long instead of int64_t, gcc allows this, clang does not 2021-12-02 18:01:00 +02:00
Konstantinos Margaritis
65bd33ffff use same definition of the union for all types 2021-12-02 18:00:02 +02:00
Konstantinos Margaritis
4249f5a2a9 set -msse4.2 only on Intel 2021-12-01 21:45:31 +00:00
Konstantinos Margaritis
abeb50843c rework CMakeLists.txt to ensure it works with clang 2021-12-01 23:23:37 +02:00
Konstantinos Margaritis
8520cbc524 fix misompilations with clang++, as it is more strict 2021-12-01 23:22:15 +02:00
Konstantinos Margaritis
896d28845c bump base requirements to SSE4.2 2021-12-01 23:20:02 +02:00
Konstantinos Margaritis
5251c328b3 fix miscompilation with clang 2021-12-01 23:18:57 +02:00
Konstantinos Margaritis
8ac83c11e3 Merge pull request #80 from VectorCamp/bugfix/fix-SVE2-build
fix SVE2 build after the changes
2021-11-25 22:19:12 +02:00
Konstantinos Margaritis
cc1a8dd47e fix SVE2 build after the changes 2021-11-25 18:48:24 +02:00
Konstantinos Margaritis
2ce799c229 Merge pull request #79 from Apostolos00tapsas/feature/complete-power9-VSX-support
Feature/complete power9 vsx support
2021-11-25 18:40:17 +02:00
Konstantinos Margaritis
7862af63ec fix unit-internal release builds using __builtin_constant_p() as well 2021-11-25 15:09:01 +02:00
Konstantinos Margaritis
77f9b7edf9 nit 2021-11-25 06:21:07 +00:00
Konstantinos Margaritis
959fea25f7 use __builtin_constant_p() instead for arm as well 2021-11-25 06:20:53 +00:00
Apostolos Tapsas
d73bf231ee Removed duplicates 2021-11-24 15:09:53 +00:00
Apostolos Tapsas
e655d76a01 *fix palignr implementation for VSX Release mode
*add unit test for palignr
*enable unit test building for Release mode
2021-11-24 15:03:49 +00:00
Apostolos Tapsas
8a6c3f81e4 Removed accidentaly included header file 2021-11-24 12:11:21 +00:00
Apostolos Tapsas
bc2dcc317d found and solved very hard to track bug of intrinsic function palignr, that manifested only in Release builds and not Debug builds in a particular number of tests 2021-11-24 11:18:18 +00:00
Apostolos Tapsas
df03302062 WIP:tracking last bugs in failing tests for release build 2021-11-16 15:24:22 +00:00
Apostolos Tapsas
aac39f3208 vermicelli and match implementations for ppc64el added 2021-11-13 19:36:46 +00:00
apostolos
2136580d50 resolving conficts after merging 2021-11-13 18:58:22 +02:00
Konstantinos Margaritis
092e217e85 Merge pull request #78 from VectorCamp/feature/refactor-vermicelli
Feature/refactor vermicelli
2021-11-12 23:06:46 +02:00
apostolos
6440d18b48 SuperVector opandnot test enriched 2021-11-10 15:12:25 +02:00
apostolos
537d81a27e test for load m128 from u64a function added 2021-11-10 09:01:28 +02:00
Konstantinos Margaritis
a69fe428e3 add len parameter to arm matchers as well 2021-11-08 19:45:36 +00:00
Konstantinos Margaritis
3fd710706a split vermicelli block implementations per arch 2021-11-08 19:45:21 +00:00
Apostolos Tapsas
5611771a1a simd_utils functions fixed 2021-11-08 14:22:58 +00:00
Apostolos Tapsas
5b18538373 SuperVector constructors as well as andnot implementation fixed 2021-11-05 13:34:48 +00:00
Konstantinos Margaritis
6317e24a82 add len parameter and mask, fixes corner cases on AVX512 2021-11-05 14:30:22 +02:00
Konstantinos Margaritis
694e2faf7f remove vermicelli.h and replace it with vermicelli.hpp 2021-11-02 22:30:53 +02:00
Konstantinos Margaritis
80286f38cb refactor vermicelliDoubleMaskedExec() 2021-11-02 22:30:21 +02:00
Konstantinos Margaritis
144f68ae03 add vermicelli.hpp to includes 2021-11-01 16:40:17 +00:00
Konstantinos Margaritis
c5e70b11b7 remove vermicelli_sse.h 2021-11-01 16:40:01 +00:00
Konstantinos Margaritis
570a3c7203 fix arm matchers 2021-11-01 16:31:38 +00:00
Konstantinos Margaritis
4db360c7b6 complete refactoring and unification of Vermicelli functions 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
94b467dc12 remove unneeded header 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
0d886f7800 add new include file 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
54245bc5ac renamed matcher functions, added new ones for Vermicelli 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
1c1a06aaae nits 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
9e69273807 move casemask helper functions to separate header 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
814a4ef0c0 add to CMake 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
d6fe28afc8 added refactored vermicelli_simd.cpp implementation 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
b4d53349cd add Vermicelli/RVermicelli to microbenchmark utility 2021-11-01 16:53:21 +02:00
Konstantinos Margaritis
5e59b36634 add arm vector types in union, avoid -flax-conversions, fix castings 2021-11-01 16:52:17 +02:00
Konstantinos Margaritis
466650ceac complete refactoring and unification of Vermicelli functions 2021-11-01 16:51:18 +02:00
Konstantinos Margaritis
5dd0cd001e remove unneeded header 2021-11-01 16:50:38 +02:00
apostolos
3a4d8afb48 prints commants and formating fixes 2021-11-01 10:09:15 +02:00