Co-occurring keywords
Papers
Cross-Validated Off-Policy Evaluation
AAAI 2025
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
NIPS 2024
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation
NIPS 2024
Hyperparameter Optimization Can Even Be Harmful in Off-Policy Learning and How to Deal with It
IJCAI 2024
Policy Evaluation for Reinforcement Learning from Human Feedback: A Sample Complexity Analysis
AISTATS 2024
Multiple-policy High-confidence Policy Evaluation
AISTATS 2023