Papers
Model-free Policy Learning with Reward Gradients
AISTATS 2022
Explicable Policy Search
NIPS 2022
Episodic Policy Gradient Training
AAAI 2022