Co-occurring keywords
Papers
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
NIPS 2024
Rethinking Exploration in Reinforcement Learning with Effective Metric-Based Exploration Bonus
NIPS 2024
Diversity Is Not All You Need: Training A Robust Cooperative Agent Needs Specialist Partners
NIPS 2024
Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation
NIPS 2024
Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch
NIPS 2024
Variational Delayed Policy Optimization
NIPS 2024
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
NIPS 2024
Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy Exploration
NIPS 2024
Policy Mirror Descent with Lookahead
NIPS 2024