It's not a good case for Intel's SIMD traditionally, because it wasn't designed very well :/ The LRB instructions...
... has scatter/gather and masked writes which makes this sort of thing easier, but not sure if they made AVX512.
-
-
Maybe ask
@tom_forsyth if AVX512 has good stuff for this? I'm not sure. -
But yeah, SSE2-wise, it's pretty janky, and you tend to have to do mask-and-shift stuff.
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.