1740 Commits

Author SHA1 Message Date
Konstantinos Margaritis
451d539f1d Power does not use -march 2021-12-02 18:01:26 +02:00
Konstantinos Margaritis
5aae719ecd fix build with clang, in particular VSX uses long long instead of int64_t, gcc allows this, clang does not 2021-12-02 18:01:00 +02:00
Konstantinos Margaritis
4aa32275f1 use same definition of the union for all types 2021-12-02 18:00:02 +02:00
Konstantinos Margaritis
5d23e6dab6 set -msse4.2 only on Intel 2021-12-01 21:45:31 +00:00
Konstantinos Margaritis
1f4143de81 rework CMakeLists.txt to ensure it works with clang 2021-12-01 23:23:37 +02:00
Konstantinos Margaritis
0221dc1771 fix misompilations with clang++, as it is more strict 2021-12-01 23:22:15 +02:00
Konstantinos Margaritis
7d600c4fcb bump base requirements to SSE4.2 2021-12-01 23:20:02 +02:00
Konstantinos Margaritis
404a0ab0f4 fix miscompilation with clang 2021-12-01 23:18:57 +02:00
Konstantinos Margaritis
6f20276b2f
Merge pull request #80 from VectorCamp/bugfix/fix-SVE2-build
fix SVE2 build after the changes
2021-11-25 22:19:12 +02:00
Konstantinos Margaritis
81fba99f3a fix SVE2 build after the changes 2021-11-25 18:48:24 +02:00
Konstantinos Margaritis
8dec4e8d85
Merge pull request #79 from Apostolos00tapsas/feature/complete-power9-VSX-support
Feature/complete power9 vsx support
2021-11-25 18:40:17 +02:00
Konstantinos Margaritis
7ceca78db4 fix unit-internal release builds using __builtin_constant_p() as well 2021-11-25 15:09:01 +02:00
Konstantinos Margaritis
00384c9e37 nit 2021-11-25 06:21:07 +00:00
Konstantinos Margaritis
cd95b1a38c use __builtin_constant_p() instead for arm as well 2021-11-25 06:20:53 +00:00
Apostolos Tapsas
725a8d8f1a Removed duplicates 2021-11-24 15:09:53 +00:00
Apostolos Tapsas
35e5369c70 *fix palignr implementation for VSX Release mode
*add unit test for palignr
*enable unit test building for Release mode
2021-11-24 15:03:49 +00:00
Apostolos Tapsas
bfc8da1102 Removed accidentaly included header file 2021-11-24 12:11:21 +00:00
Apostolos Tapsas
e13bfec734 found and solved very hard to track bug of intrinsic function palignr, that manifested only in Release builds and not Debug builds in a particular number of tests 2021-11-24 11:18:18 +00:00
Apostolos Tapsas
0287724413 WIP:tracking last bugs in failing tests for release build 2021-11-16 15:24:22 +00:00
Apostolos Tapsas
54158a1746 vermicelli and match implementations for ppc64el added 2021-11-13 19:36:46 +00:00
apostolos
e09d8674b4 resolving conficts after merging 2021-11-13 18:58:22 +02:00
Konstantinos Margaritis
7e7c50bdd7
Merge pull request #78 from VectorCamp/feature/refactor-vermicelli
Feature/refactor vermicelli
2021-11-12 23:06:46 +02:00
apostolos
4114b8a480 SuperVector opandnot test enriched 2021-11-10 15:12:25 +02:00
apostolos
942deb7d80 test for load m128 from u64a function added 2021-11-10 09:01:28 +02:00
Konstantinos Margaritis
41b98d7d8f add len parameter to arm matchers as well 2021-11-08 19:45:36 +00:00
Konstantinos Margaritis
dcf6b59e8d split vermicelli block implementations per arch 2021-11-08 19:45:21 +00:00
Apostolos Tapsas
82bea29f4e simd_utils functions fixed 2021-11-08 14:22:58 +00:00
Apostolos Tapsas
ba90cdeb5a SuperVector constructors as well as andnot implementation fixed 2021-11-05 13:34:48 +00:00
Konstantinos Margaritis
24fa54081b add len parameter and mask, fixes corner cases on AVX512 2021-11-05 14:30:22 +02:00
Konstantinos Margaritis
210295a702 remove vermicelli.h and replace it with vermicelli.hpp 2021-11-02 22:30:53 +02:00
Konstantinos Margaritis
869d2bd53b refactor vermicelliDoubleMaskedExec() 2021-11-02 22:30:21 +02:00
Konstantinos Margaritis
16f3cca98b add vermicelli.hpp to includes 2021-11-01 16:40:17 +00:00
Konstantinos Margaritis
59505f98ba remove vermicelli_sse.h 2021-11-01 16:40:01 +00:00
Konstantinos Margaritis
d55c74b6c4 fix arm matchers 2021-11-01 16:31:38 +00:00
Konstantinos Margaritis
f6fd845400 complete refactoring and unification of Vermicelli functions 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
d47641c2fc remove unneeded header 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
bc1a1127cf add new include file 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
5eabceddcf renamed matcher functions, added new ones for Vermicelli 2021-11-01 16:28:50 +00:00
Konstantinos Margaritis
16e5e2ae64 nits 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
713aaef799 move casemask helper functions to separate header 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
4a569affbc add to CMake 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
2fa947af9c added refactored vermicelli_simd.cpp implementation 2021-11-01 16:05:43 +00:00
Konstantinos Margaritis
9abfdcaa84 add Vermicelli/RVermicelli to microbenchmark utility 2021-11-01 16:53:21 +02:00
Konstantinos Margaritis
7b65b298c1 add arm vector types in union, avoid -flax-conversions, fix castings 2021-11-01 16:52:17 +02:00
Konstantinos Margaritis
44dc75a3ea complete refactoring and unification of Vermicelli functions 2021-11-01 16:51:18 +02:00
Konstantinos Margaritis
f4a490ac00 remove unneeded header 2021-11-01 16:50:38 +02:00
apostolos
d9d39d48c5 prints commants and formating fixes 2021-11-01 10:09:15 +02:00
Konstantinos Margaritis
dd45bf0d35 add new include file 2021-10-27 12:32:54 +03:00
Konstantinos Margaritis
8ae6e613cb renamed matcher functions, added new ones for Vermicelli 2021-10-27 12:32:03 +03:00
Konstantinos Margaritis
70414574ee nits 2021-10-27 12:31:04 +03:00