Co-occurring keywords
reinforcement learning
(4122)
temporal difference learning
(149)
value function
(294)
offline reinforcement learning
(492)
causal inference
(1619)
function approximation
(319)
off-policy learning
(227)
markov decision process
(788)
temporal-difference learning
(42)
linear function approximation
(101)
Papers
Policy Evaluation Using the Ω-Return
NIPS 2015
Generalized TD Learning
JMLR 2011
Model-Free Monte Carlo-like Policy Evaluation
AISTATS 2010