Given that _mm_add_epi32() and _mm_sub_epi32() add and subtract four 32-bit integers respectively, you might think that _mm_mul_epi32() multiplies four 32-bit integers. Oh, if only things were that easy in SSE land.
-
-
There are several AVX2 instructions like that (the shifts and shuffles work that way too). Apparently it’s because on Haswell they basically implemented AVX2 on top of the 128-bit SSE units.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.