Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints
ICML 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
ICML 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
ICML 2023
Bootstrapped Representations in Reinforcement Learning
ICML 2023
Target-based Surrogates for Stochastic Optimization
ICML 2023
On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs
ICML 2023
Reward-Mixing MDPs with Few Latent Contexts are Learnable
ICML 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
ICML 2023
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
ICML 2023
An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning
ICML 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
ICML 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
ICML 2023
Curious Replay for Model-based Adaptation
ICML 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
ICML 2023
Beyond Reward: Offline Preference-guided Policy Optimization
ICML 2023
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
ICML 2023
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
AAAI 2023
Hybrid Search for Efficient Planning with Completeness Guarantees
NIPS 2023
Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery
NIPS 2023
Optimizing Prompts for Text-to-Image Generation
NIPS 2023
Online Prototype Alignment for Few-shot Policy Transfer
ICML 2023
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
ICML 2023
The Benefits of Model-Based Generalization in Reinforcement Learning
ICML 2023
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics
ICML 2023
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
ICML 2023
<
1
…
43
44
45
…
118
>