Co-occurring keywords
Papers
Timely Object Recognition
NIPS 2012
Optimistic planning for Markov decision processes
AISTATS 2012
On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes
NIPS 2012
Imitation Learning by Coaching
NIPS 2012
Regularized Off-Policy TD-Learning
NIPS 2012
The Fixed Points of Off-Policy TD
NIPS 2011
Transfer from Multiple MDPs
NIPS 2011