Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
FLEX: an Adaptive Exploration Algorithm for Nonlinear Systems
ICML 2023
Human-Timescale Adaptation in an Open-Ended Task Space
ICML 2023
Efficient Online Reinforcement Learning with Offline Data
ICML 2023
A Tale of Sampling and Estimation in Discounted Reinforcement Learning
AISTATS 2023
Entropic Risk Optimization in Discounted MDPs
AISTATS 2023
Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models via Reinforcement Learning
AISTATS 2023
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
AISTATS 2023
One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning
AISTATS 2023
Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables
AISTATS 2023
Mode-constrained Model-based Reinforcement Learning via Gaussian Processes
AISTATS 2023
A Finite Sample Complexity Bound for Distributionally Robust Q-learning
AISTATS 2023
Exploration in Reward Machines with Low Regret
AISTATS 2023
Social learning spontaneously emerges by searching optimal heuristics with deep reinforcement learning
ICML 2023
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
ICML 2023
Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning
ICML 2023
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks
JMLR 2023
Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
JMLR 2023
Jointly Learning Band Selection and Filter Array Design for Hyperspectral Imaging
WACV 2023
Physically Plausible Animation of Human Upper Body From a Single Image
WACV 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
ICML 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
ICML 2023
Masked Trajectory Models for Prediction, Representation, and Control
ICML 2023
PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient
ICML 2023
Reachability-Aware Laplacian Representation in Reinforcement Learning
ICML 2023
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
ICML 2023
<
1
…
52
53
54
…
155
>