Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Feedback-Based Tree Search for Reinforcement Learning
ICML 2018
Regret Minimization for Partially Observable Deep Reinforcement Learning
ICML 2018
Continual Reinforcement Learning with Complex Synapses
ICML 2018
Hierarchical Imitation and Reinforcement Learning
ICML 2018
Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling
ICML 2018
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
JMLR 2018
RLlib: Abstractions for Distributed Reinforcement Learning
ICML 2018
Learning Environmental Calibration Actions for Policy Self-Evolution
IJCAI 2018
Smoothed Action Value Functions for Learning Gaussian Policies
ICML 2018
Environment Upgrade Reinforcement Learning for Non-Differentiable Multi-Stage Pipelines
CVPR 2018
Time Limits in Reinforcement Learning
ICML 2018
Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
AISTATS 2018
Linear Stochastic Approximation: How Far Does Constant Step-Size and Iterate Averaging Go?
AISTATS 2018
Actor-Critic Fictitious Play in Simultaneous Move Multistage Games
AISTATS 2018
An Analysis of Categorical Distributional Reinforcement Learning
AISTATS 2018
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
ICML 2018
Self-Imitation Learning
ICML 2018
BanditSum: Extractive Summarization as a Contextual Bandit
EMNLP 2018
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning
IJCAI 2018
Using a Deep Learning Dialogue Research Toolkit in a Multilingual Multidomain Practical Application
IJCAI 2018
Towards Sample Efficient Reinforcement Learning
IJCAI 2018
Improving Reinforcement Learning with Human Input
IJCAI 2018
On Q-learning Convergence for Non-Markov Decision Processes
IJCAI 2018
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation
COLT 2018
Policy Optimization with Second-Order Advantage Information
IJCAI 2018
<
1
…
135
136
137
…
155
>