1/ How accurately compute Policy Gradients and uncertainty in #ReinforcementLearning? We propose Deep Bayesian Quadrature Policy Optimization https://arxiv.org/pdf/2006.15637
@kazizzad @yisongyue #AI #DeepLearning
-
-
.
@ravitej_17 is primary author of this workShow this threadThanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.