Sometimes I wish I had never learned that division takes more cycles than multiplication. Now I’m cursed to forever microoptimize
-
-
I don’t only mean recip + mul, but also actual divide instruction. 1/cycle/core throughput or better will gradually become common (but with longer latency than mul).
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.