Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Continual Task Allocation in Meta-Policy Network via Sparse Prompting
ICML 2023
The Benefits of Model-Based Generalization in Reinforcement Learning
ICML 2023
Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation
EMNLP 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
ICML 2023
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
ICML 2023
Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
L4DC 2023
LipsNet: A Smooth and Robust Neural Network with Adaptive Lipschitz Constant for High Accuracy Optimal Control
ICML 2023
Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning
L4DC 2023
The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
ICML 2023
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
ICML 2023
Quantile Credit Assignment
ICML 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
ICML 2023
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
L4DC 2023
Multi-Agent Reinforcement Learning with Reward Delays
L4DC 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction
ICML 2023
Eventual Discounting Temporal Logic Counterfactual Experience Replay
ICML 2023
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning
ICML 2023
Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning
ICML 2023
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints
ICML 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
ICML 2023
Reward-Mixing MDPs with Few Latent Contexts are Learnable
ICML 2023
What can online reinforcement learning with function approximation benefit from general coverage conditions?
ICML 2023
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
EMNLP 2023
Retrosynthetic Planning with Dual Value Networks
ICML 2023
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards
CORL 2023
<
1
…
39
40
41
…
155
>