Conversation

Yep, this is that chart. Surprised how many people don’t notice the scale and assume it’s linear. Also, sometimes hits you differently even if you do process the scale ;)
Image
2
7
Show replies
Replying to and
Could that say more that current deep rl approaches are inefficient since other breakthroughs like squeeze and excitation, ulm fit, transformers, and enas don’t need that much computational power?