Some comments on AlphaGo Zero:https://www.reddit.com/r/reinforcementlearning/comments/778vbk/mastering_the_game_of_go_without_human_knowledge/ …
-
-
No, he doesn't say that in the article, the article hardly even mentions stability. That one is in the AmA (excerpted in my /r/RL comments).
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.