Papers
True Online Temporal-Difference Learning
JMLR 2016
Learning Simple Algorithms from Examples
ICML 2016
Deep Exploration via Bootstrapped DQN
NIPS 2016
Value Iteration Networks
NIPS 2016