Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Multi-Agent Reinforcement Learning with Reward Delays
L4DC 2023
User Simulator Assisted Open-ended Conversational Recommendation System
ACL 2023
Enhancing Educational Dialogues: A Reinforcement Learning Approach for Generating AI Teacher Responses
ACL 2023
Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning
ACL 2023
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
ICML 2023
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics
ICML 2023
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
L4DC 2023
Krylov–Bellman boosting: Super-linear policy evaluation in general state spaces
AISTATS 2023
Adaptive Ordered Information Extraction with Deep Reinforcement Learning
ACL 2023
Generating Dialog Responses with Specified Grammatical Items for Second Language Learning
ACL 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
ICML 2023
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
ICML 2023
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
ICML 2023
Continual Task Allocation in Meta-Policy Network via Sparse Prompting
ICML 2023
Abstract then Play: A Skill-centric Reinforcement Learning Framework for Text-based Games
ACL 2023
Safe and Efficient Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions
L4DC 2023
HACMan: Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation
CORL 2023
Eventual Discounting Temporal Logic Counterfactual Experience Replay
ICML 2023
The Benefits of Model-Based Generalization in Reinforcement Learning
ICML 2023
GeoDRL: A Self-Learning Framework for Geometry Problem Solving using Reinforcement Learning in Deductive Reasoning
ACL 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
ICML 2023
Quantile Credit Assignment
ICML 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction
ICML 2023
Retrosynthetic Planning with Dual Value Networks
ICML 2023
The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning
ICML 2023
<
1
…
38
39
40
…
155
>