Matthew Barr
|
f6b688fc06
|
rename pshufb to pshufb_m128
|
2017-05-30 13:59:23 +10:00 |
|
Matthew Barr
|
a295c96198
|
rename vpshufb to pshufb_m256
|
2017-05-30 13:59:23 +10:00 |
|
Matthew Barr
|
eabe408e2b
|
avx512: shufti
|
2017-05-30 13:59:23 +10:00 |
|
Matthew Barr
|
423569ec82
|
De-multiaccel
|
2017-05-30 13:59:00 +10:00 |
|
Matthew Barr
|
8201183138
|
Check compiler architecture flags in one place
|
2017-04-26 15:18:26 +10:00 |
|
Matthew Barr
|
5a842caaf1
|
shufti: slightly faster short shufti operation
It is better to shift the high lane values in an XMM
before then using insert to combine the high and low lanes.
|
2016-12-02 11:33:51 +11:00 |
|
Matthew Barr
|
99e14df117
|
Fix combine2x128
|
2016-12-02 11:33:48 +11:00 |
|
Matthew Barr
|
c3b5efefb6
|
Add short avx2 shufti form
|
2016-10-28 14:46:31 +11:00 |
|
Matthew Barr
|
e3d416a6ea
|
Apply some consistency to the names we give shifts
|
2016-07-08 11:07:50 +10:00 |
|
Matthew Barr
|
4d6934fc77
|
Move limex specific shuffle utils and ssse3 funcs
|
2016-07-08 11:07:50 +10:00 |
|
Anatoly Burakov
|
47b17ade27
|
Multibyte shufti runtime
|
2016-03-01 11:21:39 +11:00 |
|
Matthew Barr
|
904e436f11
|
Initial commit of Hyperscan
|
2015-10-20 09:13:35 +11:00 |
|