Papers
High-Throughput Synchronous Deep RL
NIPS 2020
A Self-Tuning Actor-Critic Algorithm
NIPS 2020
Positive-Unlabeled Reward Learning
CORL 2020