Co-occurring keywords
Papers
Optimization Methods for Interpretable Differentiable Decision Trees Applied to Reinforcement Learning
AISTATS 2020
Worst Cases Policy Gradients
CORL 2019
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains
IJCAI 2019
Natural Option Critic
AAAI 2019
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies
IJCAI 2019
Trust Region Evolution Strategies
AAAI 2019