George Wort
|
0f39535621
|
Move SVE functions into their own files.
Change-Id: I995ba4b7d2b558ee403693ee45d747d414d3b177
|
2021-07-26 19:15:58 +03:00 |
|
George Wort
|
a94219aaed
|
Use SVE for single shufti.
Change-Id: Ic76940c5bb9b81a1c45d39e9ca396a158c50a7dc
|
2021-07-26 19:15:58 +03:00 |
|
George Wort
|
185c45263b
|
Add SVE2 support for dvermicelli
Change-Id: I056ef15e162ab6fb1f78964321ce893f4096367e
|
2021-07-26 19:15:58 +03:00 |
|
George Wort
|
7d7d31ec0d
|
Add SVE2 support for vermicelli
Change-Id: Ia025de53521fbaefe5fb1e4425aaf75c7d80a14e
|
2021-07-26 00:10:54 +03:00 |
|
Konstantinos Margaritis
|
be66cdb51d
|
fixes in shifting primitives
|
2021-02-08 19:38:20 +02:00 |
|
Konstantinos Margaritis
|
e2f253d8ab
|
remove loads from movemask128, variable_byte_shift, add palignr_imm(), minor fixes
|
2021-01-25 12:13:35 +02:00 |
|
Konstantinos Margaritis
|
5b85589274
|
add some useful intrinsics
|
2021-01-25 12:13:35 +02:00 |
|
Konstantinos Margaritis
|
773dc6fa69
|
optimize *shiftbyte_m128() functions to use palign instead of variable_byte_shift_m128()
|
2020-12-07 23:12:26 +02:00 |
|
Konstantinos Margaritis
|
38477b08bc
|
fix movq and load_m128_from_u64a and resp. test for NEON
|
2020-12-03 19:27:38 +02:00 |
|
Konstantinos Margaritis
|
1c26f044a7
|
when building in debug mode, vgetq_lane_*() and vextq_*() need immediate operands, and we have to use switch()'ed versions
|
2020-11-24 17:56:40 +02:00 |
|
Konstantinos Margaritis
|
7b8cf97546
|
add extra instructions (currently arm-only), fix order of elements in set4x32/set2x64
|
2020-11-05 19:18:53 +02:00 |
|
Konstantinos Margaritis
|
548242981d
|
fix ARM implementations
|
2020-10-30 10:38:41 +02:00 |
|
Konstantinos Margaritis
|
c5a7f4b846
|
add ARM simd_utils vectorized functions for 128-bit vectors
|
2020-10-15 16:26:49 +03:00 |
|
Konstantinos Margaritis
|
31ac6718dd
|
add ARM version of simd_utils.h
|
2020-10-13 09:19:56 +03:00 |
|