Co-occurring keywords
Papers
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
AISTATS 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
AISTATS 2022
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
NIPS 2022
Variational Model-based Policy Optimization
IJCAI 2021
Independence-aware Advantage Estimation
IJCAI 2021