Some comments on AlphaGo Zero:https://www.reddit.com/r/reinforcementlearning/comments/778vbk/mastering_the_game_of_go_without_human_knowledge/ …
-
-
Strictly speaking, lots of algos which are universal/consistent which you could use w/near-unlimited resources. eg Tabular q-learning
-
OpenAI's Saliman's simple evo strategies RL was like 10x more sample-inefficient so possibly you could evolve an AG with 10k GPUs as well.
End of conversation
New conversation -
-
-
You know, Dreyfus died a few months back. 'Always two there are... a master and an apprentice.'
-
I am interviewing applicants for the recently opened position. Interested?
End of conversation
New conversation -
-
-
If you are willing to pay for the GPU time, I would be tempted!
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.