Tweets

Je hebt @dougallj geblokkeerd

Weet je zeker dat je deze Tweets wilt bekijken? @dougallj wordt niet gedeblokkeerd door Tweets te bekijken.

  1. Vastgemaakte Tweet
    7 mrt.

    I published my research on the Apple M1 CPU microarchitectures (Firestorm and Icestorm), with instruction tables describing throughput, latency, and uops for most instructions, and (an insane number of) detailed experiments and measurements.

    Ongedaan maken
  2. 29 aug.

    Oh, hey, found the screenshot:

    Deze collectie tonen
    Ongedaan maken
  3. heeft geretweet
    14 aug.

    Our August 2021 Progress Report is out! It's been a long time coming and we've made a lot of progress in the past few months, so be sure to check it out!

    Ongedaan maken
  4. heeft geretweet
    4 aug.

    Happy to share a little experiment we are trying. I am not sure if it will stick but we will see. Here is how I personally think about the matter 1/?

    Deze collectie tonen
    Ongedaan maken
  5. heeft geretweet
    31 jul.

    This is the companion paper to uica, which I just finished reading. Beyond the throughput predictor, it has many new (previously either unknown or unconfirmed) details about many Intel microarchitecture generations. Highly recommended!

    Ongedaan maken
  6. heeft geretweet
    30 jul.

    Today, I released uiCA, the " Code Analyzer". uiCA is based on data from , combined with a new detailed pipeline model. An online version (that also supports other tools) is available at (1/3)

    Deze collectie tonen
    Ongedaan maken
  7. heeft geretweet
    20 jul.

    After a month of reverse-engineering, I present documentation on the Arm® Mali™-G78 instruction set. Get it while it's hot 🔥

    Ongedaan maken
  8. heeft geretweet
    28 jun.

    My friend did some fancy cryptanalysis of a hash function and wrote up a detailed explanation, with interactive diagrams and everything:

    Ongedaan maken
  9. 18 jul.

    Oops, there's a typo in the image - the second group of 'v's should be 'p's. I'm also enjoying this old trick of multiplying by 0x0101010101010101 for popcount, which makes sense when you want the prefix sums of the popcounts of each byte.

    Deze collectie tonen
    Ongedaan maken
  10. 16 jul.

    And another variant, which would be okay for doing a single vector if clang's implementation of the PMULL2 intrinsic reliably generated a PMULL2:

    Deze collectie tonen
    Ongedaan maken
  11. 16 jul.

    I've been looking a bit at move-mask replacements on the M1 - I think this variation is new? And I ended up looking at other ARM64 optimisation ideas for simdjson:

    Arm SIMD intrinsics code implementing move-mask for four vectors, using four multiplies by a magic constant, two paired adds, and a table lookup, described in a little more detail at the link.
    Deze collectie tonen
    Ongedaan maken
  12. heeft geretweet
    12 jul.

    Great article by on memory disambiguation for speculative load/store reordering. I keep forgetting how much stuff is on the uarch-bench wiki.

    Ongedaan maken
  13. heeft geretweet
    10 jul.

    New blog post: "Entropy coding in Oodle Data: the big picture" Relatively short one, first of what will be a series.

    Ongedaan maken
  14. heeft geretweet
    8 jul.

    We're writing a new mach-o linker. Today's Chrome Canary is the first Chrome/Mac release linked with it :)

    Ongedaan maken
  15. heeft geretweet
    7 jul.

    One weird thing you can do with branch prediction/control speculation is to implement value speculation. Here's a demo/benchmark.

    Deze collectie tonen
    Ongedaan maken
  16. heeft geretweet
    17 sep. 2014

    Honestly, I'm not sure what to conclude from this.

    Ongedaan maken
  17. 8 jun.

    I guess today we find out how many things depend on Fastly

    Ongedaan maken
  18. heeft geretweet
    4 jun.

    Latency, throughput, and port usage data for Rocket Lake is now available at

    Deze collectie tonen
    Ongedaan maken
  19. heeft geretweet
    1 jun.

    Has someone done "I'm an x86 programmer" / "oh yeah, name 20 instructions" / "MOV" yet?

    Ongedaan maken
  20. heeft geretweet
    26 mei

    For those who liked the M1RACLES name, it really didn't take long to come up with it! Though I think might have been idly thinking of ideas for a few minutes prior too :)

    Ongedaan maken
  21. heeft geretweet
    25 mei

    You don't ship a new operating system every day, but today is that day.

    Ongedaan maken

Het laden lijkt wat langer te duren.

Twitter is mogelijk overbelast of ondervindt een tijdelijke onderbreking. Probeer het opnieuw of bekijk de Twitter-status voor meer informatie.

    Je bent misschien ook geïnteresseerd in

    ·