Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
NIPS 2020
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing
NIPS 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
NIPS 2020
Generalized Mean Estimation in Monte-Carlo Tree Search
IJCAI 2020
Constrained Policy Improvement for Efficient Reinforcement Learning
IJCAI 2020
Dynamic Control of Stochastic Evolution: A Deep Reinforcement Learning Approach to Adaptively Targeting Emergent Drug Resistance
JMLR 2020
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
AAAI 2020
Learning the Linear Quadratic Regulator from Nonlinear Observations
NIPS 2020
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
NIPS 2020
Dynamic Regret of Policy Optimization in Non-Stationary Environments
NIPS 2020
Deep Reinforcement Learning for Organ Localization in CT
MIDL 2020
Multitask radiological modality invariant landmark localization using deep reinforcement learning
MIDL 2020
Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments
AAAI 2020
Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control
AAAI 2020
A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound
AAAI 2020
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
AAAI 2020
Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?
CVPR 2020
Be Relevant, Non-Redundant, and Timely: Deep Reinforcement Learning for Real-Time Event Summarization
AAAI 2020
Deep Reinforcement Learning with Robust and Smooth Policy
ICML 2020
Dialog State Tracking with Reinforced Data Augmentation
AAAI 2020
Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction
NIPS 2020
Implicit Distributional Reinforcement Learning
NIPS 2020
RMM: A Recursive Mental Model for Dialogue Navigation
EMNLP 2020
Balancing Quality and Human Involvement: An Effective Approach to Interactive Neural Machine Translation
AAAI 2020
Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension
ACL 2020
<
1
…
115
116
117
…
155
>