Co-occurring keywords
Papers
Projected Natural Actor-Critic
NIPS 2013
Bayesian Mixture Modelling and Inference based Thompson Sampling in Monte-Carlo Tree Search
NIPS 2013
Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result
NIPS 2013
Online learning in episodic Markovian decision processes by relative entropy policy search
NIPS 2013
Value Pursuit Iteration
NIPS 2012