Co-occurring keywords
Papers
The Fixed Points of Off-Policy TD
NIPS 2011
Transfer from Multiple MDPs
NIPS 2011
Speedy Q-Learning
NIPS 2011
Generalized TD Learning
JMLR 2011
Policy Gradient Coagent Networks
NIPS 2011
Double Q-learning
NIPS 2010
Model-Free Monte Carlo-like Policy Evaluation
AISTATS 2010