Given that _mm_add_epi32() and _mm_sub_epi32() add and subtract four 32-bit integers respectively, you might think that _mm_mul_epi32() multiplies four 32-bit integers. Oh, if only things were that easy in SSE land.
-
-
This Tweet is unavailable.
-
My favorite is that AVX2's ALIGNR is very different from SSE's ALIGNR. https://github.com/rust-lang/regex/blob/60d087a23025e045ae754a345b04003c31d83d93/src/vector/avx2.rs#L164-L166 …
1 reply 0 retweets 2 likes
Replying to @burntsushi5
There are several AVX2 instructions like that (the shifts and shuffles work that way too). Apparently it’s because on Haswell they basically implemented AVX2 on top of the 128-bit SSE units.
7:44 AM - 8 Feb 2019
0 replies
0 retweets
2 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.