Papers
Regularized Policies are Reward Robust
AISTATS 2021
Adaptive Approximate Policy Iteration
AISTATS 2021
Provable Hierarchical Imitation Learning via EM
AISTATS 2021
Optimizing Percentile Criterion using Robust MDPs
AISTATS 2021
Provably Safe PAC-MDP Exploration Using Analogies
AISTATS 2021
Q-learning with Logarithmic Regret
AISTATS 2021
Minimax Model Learning
AISTATS 2021
Logistic Q-Learning
AISTATS 2021
Expected Eligibility Traces
AAAI 2021