← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints ICML 2023

MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations ICML 2023

Learning in POMDPs is Sample-Efficient with Hindsight Observability ICML 2023

Bootstrapped Representations in Reinforcement Learning ICML 2023

Target-based Surrogates for Stochastic Optimization ICML 2023

On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs ICML 2023

Reward-Mixing MDPs with Few Latent Contexts are Learnable ICML 2023

Deep Laplacian-based Options for Temporally-Extended Exploration ICML 2023

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice ICML 2023

An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning ICML 2023

Model-based Offline Reinforcement Learning with Count-based Conservatism ICML 2023

LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework ICML 2023

Curious Replay for Model-based Adaptation ICML 2023

Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning ICML 2023

Beyond Reward: Offline Preference-guided Policy Optimization ICML 2023

Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments ICML 2023

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm AAAI 2023

Hybrid Search for Efficient Planning with Completeness Guarantees NIPS 2023

Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery NIPS 2023

Optimizing Prompts for Text-to-Image Generation NIPS 2023

Online Prototype Alignment for Few-shot Policy Transfer ICML 2023

An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning ICML 2023

The Benefits of Model-Based Generalization in Reinforcement Learning ICML 2023

Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics ICML 2023

The Wisdom of Hindsight Makes Language Models Better Instruction Followers ICML 2023