Co-occurring keywords
Papers
Achieving $\tilde{O}(1/\epsilon)$ Sample Complexity for Constrained Markov Decision Process
NIPS 2024
Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs
NIPS 2024
Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning
NIPS 2024
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
NIPS 2024
Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage
NIPS 2024
Preference-based Pure Exploration
NIPS 2024
Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning
NIPS 2024