We really need -O2.5 that's like "yeah, go ahead and vectorize, but let's not go *completely* nuts, ok?" Or we could just kill -O3, that would be OK with me.
-
-
new optimization levels: -O0 -O1 -Os -O2 -OGentoo
2 replies 1 retweet 21 likes -
-
-
-Olordhecomin
1 reply 0 retweets 7 likes -
Replying to @stephentyrone @jeeger and
My original question is really about optimizing for maximum number of opcodes, or -Oops
2 replies 0 retweets 9 likes -
Replying to @moyix @stephentyrone and
brb building a superoptimizer that saves opcodes by doing incredibly slow and horrible things that thrash the pipeline
2 replies 0 retweets 9 likes -
-
Replying to @stephentyrone @FioraAeterna and
Has someone filed a bug on getting LLVM to emit XLAT where possible yet
1 reply 0 retweets 4 likes -
Replying to @pcwalton @stephentyrone and
my favorite x86 instruction, to be quite honest. right up there with xadd
2 replies 0 retweets 4 likes
I wanted to "suggest" using AAD with arbitrary base to save code size but Intel is apparently hell bent on ruining all my funpic.twitter.com/bJfVCLVOiT
-
-
Fun fact: There is a piece of popular software out there that uses DAA. …it’s ZSNES, to implement the SNES CPU’s BCD mode.
2 replies 0 retweets 6 likes -
I wonder if this is the right thing? - it looks like DAA might not do exactly the same thing for all inputs as 6502 BCD mode, but I only gave it a quick glance. Might write some test code for this, if this thought is still bothering me tomorrow...
1 reply 0 retweets 1 like - 1 more reply
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.