Co-occurring keywords
Papers
AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers
CORL 2019
Maximum Entropy Monte-Carlo Planning
NIPS 2019
Truly Proximal Policy Optimization
UAI 2019