Papers
Regularized Policies are Reward Robust
AISTATS 2021
Online Sparse Reinforcement Learning
AISTATS 2021
Adaptive Approximate Policy Iteration
AISTATS 2021
Optimizing Percentile Criterion using Robust MDPs
AISTATS 2021
Q-learning with Logarithmic Regret
AISTATS 2021
Minimax Model Learning
AISTATS 2021
Non-Stationary Off-Policy Optimization
AISTATS 2021