You can’t just give OpenAI a mulligan on vision, if you want to claim superhuman performance. Not saying there is zero progess here but on the question of inducing a game from experience, and beating humans on a level playing field, it’s a step back compared to DQN on Atari.https://twitter.com/Smerity/status/1031248288958177280 …
Surely I didn’t single them out. I wrote a whole arXiv on AlphaGo, in January...
-
-
Fair enough, bad wording on my part. I just meant that sample efficiency is clearly an open problem to be addressed, yes, but the accomplishment here should still be recognized. I also wrote a whole article on this after all and tend to agreehttps://thegradient.pub/why-rl-is-flawed/ …
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.