Co-occurring keywords
Papers
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
JMLR 2024
Preference-based Pure Exploration
NIPS 2024
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
JMLR 2024
Sample-Efficient Personalization: Modeling User Parameters as Low Rank Plus Sparse Components
AISTATS 2024
Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits
AISTATS 2024
Dissimilarity Bandits
AISTATS 2024
Complexity of Single Loop Algorithms for Nonlinear Programming with Stochastic Objective and Constraints
AISTATS 2024