Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
ICML 2023
Counterfactual Learning with General Data-Generating Policies
AAAI 2023
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
AAAI 2023
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes
ICML 2023
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
ICML 2023
Online Reinforcement Learning with Uncertain Episode Lengths
AAAI 2023
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration & Planning
AAAI 2023
Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning
AAAI 2023
Goal-Conditioned Q-learning as Knowledge Distillation
AAAI 2023
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control
AAAI 2023
Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons
ICML 2023
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
EMNLP 2023
Q-functionals for Value-Based Continuous Control
AAAI 2023
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction
AAAI 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
AAAI 2023
The Benefits of Model-Based Generalization in Reinforcement Learning
ICML 2023
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics
ICML 2023
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
ICML 2023
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
ICML 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
ICML 2023
Online Prototype Alignment for Few-shot Policy Transfer
ICML 2023
On the Convergence of SARSA with Linear Function Approximation
ICML 2023
Temporal Abstraction in Reinforcement Learning with the Successor Representation
JMLR 2023
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks
JMLR 2023
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
<
1
…
40
41
42
…
118
>