Co-occurring keywords
Papers
Online learning in episodic Markovian decision processes by relative entropy policy search
NIPS 2013
Guided Policy Search
ICML 2013
Value Pursuit Iteration
NIPS 2012
MDPs with Non-Deterministic Policies
NIPS 2008