Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Evaluating Rewards for Question Generation Models
NAACL 2019
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs
NIPS 2019
Provably Efficient Q-Learning with Low Switching Cost
NIPS 2019
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
NIPS 2019
Deep Reactive Policies for Planning in Stochastic Nonlinear Domains
AAAI 2019
Optimizing Discount and Reputation Trade-Offs in E-Commerce Systems: Characterization and Online Learning
AAAI 2019
Deep Reinforcement Learning via Past-Success Directed Exploration
AAAI 2019
ELF OpenGo: an analysis and open reimplementation of AlphaZero
ICML 2019
Action Robust Reinforcement Learning and Applications in Continuous Control
ICML 2019
Making Deep Q-learning methods robust to time discretization
ICML 2019
Dynamic Weights in Multi-Objective Deep Reinforcement Learning
ICML 2019
Aspect Sentiment Classification Towards Question-Answering with Reinforced Bidirectional Attention Network
ACL 2019
Rewarding Smatch: Transition-Based AMR Parsing with Reinforcement Learning
ACL 2019
A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer
ACL 2019
What Should I Ask? Using Conversationally Informative Rewards for Goal-oriented Visual Dialog.
ACL 2019
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
EMNLP 2019
Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue
EMNLP 2019
VIREL: A Variational Inference Framework for Reinforcement Learning
NIPS 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
NIPS 2019
When to use parametric models in reinforcement learning?
NIPS 2019
Correlation Priors for Reinforcement Learning
NIPS 2019
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle
NIPS 2019
Budgeted Reinforcement Learning in Continuous State Space
NIPS 2019
Large Scale Markov Decision Processes with Changing Rewards
NIPS 2019
Variance Reduced Policy Evaluation with Smooth Function Approximation
NIPS 2019
<
1
…
118
119
120
…
155
>