Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Exploring Question-Specific Rewards for Generating Deep Questions
COLING 2020
Reinforced Multi-task Approach for Multi-hop Question Generation
COLING 2020
PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards
CORL 2020
Active Model Estimation in Markov Decision Processes
UAI 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
UAI 2020
Knowledge-enriched, Type-constrained and Grammar-guided Question Generation over Knowledge Bases
COLING 2020
Answer-driven Deep Question Generation based on Reinforcement Learning
COLING 2020
Interactive Question Clarification in Dialogue via Reinforcement Learning
COLING 2020
Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise
UAI 2020
Learning from Interventions: Human-robot interaction as both explicit and implicit feedback
RSS 2020
Spatial Action Maps for Mobile Manipulation
RSS 2020
Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions
RSS 2020
A Reduction from Reinforcement Learning to No-Regret Online Learning
AISTATS 2020
Adaptive Trade-Offs in Off-Policy Learning
AISTATS 2020
Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
AISTATS 2020
Finite-Time Error Bounds for Biased Stochastic Approximation with Applications to Q-Learning
AISTATS 2020
Bayesian Reinforcement Learning via Deep, Sparse Sampling
AISTATS 2020
Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic Approximation
AISTATS 2020
Stochastically Dominant Distributional Reinforcement Learning
ICML 2020
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
NIPS 2020
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos
RSS 2020
Controlling Contact-Rich Manipulation Under Partial Observability
RSS 2020
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning
AAAI 2020
MRI Reconstruction with Interpretable Pixel-Wise Operations Using Reinforcement Learning
AAAI 2020
Generalizable Resource Allocation in Stream Processing via Deep Reinforcement Learning
AAAI 2020
<
1
…
109
110
111
…
155
>