Conversation

On transformers: Transformers are successful because they are a simultaneous combination of multiple ideas: attention, ability to run quickly on a GPU, non-recurrence (and therefore less deep so easier to optimize)
1
15
On self play: “Self-play has a property that it can surprise us in truly novel ways.” (e.g., creative solutions to problems that weren’t anticipated — and are actually useful!)
4
1
33