Co-occurring keywords
Papers
Value Pursuit Iteration
NIPS 2012
Timely Object Recognition
NIPS 2012
Optimistic planning for Markov decision processes
AISTATS 2012
On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes
NIPS 2012
Imitation Learning by Coaching
NIPS 2012
Regularized Off-Policy TD-Learning
NIPS 2012