Co-occurring keywords
Papers
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
NIPS 2019
Consistent On-Line Off-Policy Evaluation
ICML 2017
Bounded Off-Policy Evaluation with Missing Data for Course Recommendation and Curriculum Design
ICML 2016
Toward Minimax Off-policy Value Estimation
AISTATS 2015
Model-Free Monte Carlo-like Policy Evaluation
AISTATS 2010