Papers
Clipped Action Policy Gradient
ICML 2018
Policy Optimization with Demonstrations
ICML 2018
Dual Policy Iteration
NIPS 2018
Balanced Policy Evaluation and Learning
NIPS 2018
Learning Abstract Options
NIPS 2018
Configurable Markov Decision Processes
ICML 2018
Is the Bellman residual a bad proxy?
NIPS 2017