#ReinforcementLearning of partially observable Markov decision process #POMDP using #tensors. Regret bounds http://goo.gl/4n996o
3:04 PM - 7 Mar 2016
0 replies
2 retweets
19 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.