Central planning is unbeatable as a resource allocation mechanism for small systems. As you attempt to scale it up, it becomes terribly ineffective, to such an extent that decentralized control algorithms wastly outperform it, despite being inherently wasteful
-
-
As for sampling -- that can only work if your objective space has a structure that can be easily embedded in a continuous space. Basically why deep RL doesn't work
-
Saying "deep RL doesn't work" is an excessively broad claim. The fact that the value function of Go can be embedded in this way makes me question the significant of this difficulty.
End of conversation
New conversation -
-
-
So that would imply this isn't a backprop problem, but rather a problem with optimization as a field? Could you give an example of an "open ended problem"
-
With GANs, you know what the end product is going to be: a model that has learned the latent distribution of your data and that you can sample from. You can train multiple GANs and get approximately the same results. So you know B and you know how to get from A to B
- Show replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.