Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
TRAVEL: Tag-Aware Conversational FAQ Retrieval via Reinforcement Learning
EMNLP 2023
Improving Dialogue Discourse Parsing via Reply-to Structures of Addressee Recognition
EMNLP 2023
Diffused Task-Agnostic Milestone Planner
NIPS 2023
Belief Projection-Based Reinforcement Learning for Environments with Delayed Feedback
NIPS 2023
Taylor TD-learning
NIPS 2023
Adversarial Model for Offline Reinforcement Learning
NIPS 2023
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks
NIPS 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
NIPS 2023
Language Model Alignment with Elastic Reset
NIPS 2023
Hybrid Policy Optimization from Imperfect Demonstrations
NIPS 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
NIPS 2023
Learning non-Markovian Decision-Making from State-only Sequences
NIPS 2023
RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization
NIPS 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
NIPS 2023
Finite-Time Analysis of Single-Timescale Actor-Critic
NIPS 2023
Generating Behaviorally Diverse Policies with Latent Diffusion Models
NIPS 2023
Boosting Verification of Deep Reinforcement Learning via Piece-Wise Linear Decision Neural Networks
NIPS 2023
A Long $N$-step Surrogate Stage Reward for Deep Reinforcement Learning
NIPS 2023
On the Importance of Exploration for Generalization in Reinforcement Learning
NIPS 2023
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $\epsilon$-Greedy Exploration
NIPS 2023
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
NIPS 2023
Goal-conditioned Offline Planning from Curious Exploration
NIPS 2023
Optimal Exploration for Model-Based RL in Nonlinear Systems
NIPS 2023
ELDEN: Exploration via Local Dependencies
NIPS 2023
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark
NIPS 2023
<
1
…
43
44
45
…
155
>