Some comments on AlphaGo Zero:https://www.reddit.com/r/reinforcementlearning/comments/778vbk/mastering_the_game_of_go_without_human_knowledge/ …
-
Show this thread
-
So I'm just spitballing here but... is the stability of their tree-search self-play due to Hinton's dark knowledge preserving info thru time
2 replies 0 retweets 5 likesShow this thread -
Replying to @gwern
My uninformed guess fwiw: Go is trivial, so almost any brute-force algorithm would work if you throw enough GPUs at it
4 replies 0 retweets 7 likes -
Replying to @Meaningness @gwern
You really ARE emulating Dreyfus these days, are you? Care to put your money where your mouth is and brute force Go for a bit to show us?
3 replies 0 retweets 3 likes
If you are willing to pay for the GPU time, I would be tempted!
7:22 PM - 19 Oct 2017
0 replies
0 retweets
1 like
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.