Co-occurring keywords
reinforcement learning
(4122)
temporal difference learning
(149)
value function
(294)
offline reinforcement learning
(492)
causal inference
(1619)
function approximation
(319)
off-policy learning
(227)
markov decision process
(788)
temporal-difference learning
(42)
linear function approximation
(101)
Papers
Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
NIPS 2019
Planning with Expectation Models
IJCAI 2019
Balanced Policy Evaluation and Learning
NIPS 2018
Efficient Reinforcement Learning with Hierarchies of Machines by Leveraging Internal Transitions
IJCAI 2017
Differentially Private Policy Evaluation
ICML 2016