Co-occurring keywords
Papers
Learning from Interventions: Human-robot interaction as both explicit and implicit feedback
RSS 2020
Optimization Methods for Interpretable Differentiable Decision Trees Applied to Reinforcement Learning
AISTATS 2020
Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games
NIPS 2020
Contextual Online False Discovery Rate Control
AISTATS 2020
A PTAS for the Bayesian Thresholding Bandit Problem
AISTATS 2020
On Regret with Multiple Best Arms
NIPS 2020
Bandit Linear Control
NIPS 2020
From Finite to Countable-Armed Bandits
NIPS 2020