Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Distantly Supervised NER with Partial Annotation Learning and Reinforcement Learning
COLING 2018
Source Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language
COLING 2018
Finite Sample Analysis of LSTD with Random Projections and Eligibility Traces
IJCAI 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
ICML 2018
Prediction Improves Simultaneous Neural Machine Translation
EMNLP 2018
Addressing Function Approximation Error in Actor-Critic Methods
ICML 2018
Synthesizing Programs for Images using Reinforced Adversarial Learning
ICML 2018
Spotlight: Optimizing Device Placement for Training Deep Neural Networks
ICML 2018
Latent Space Policies for Hierarchical Reinforcement Learning
ICML 2018
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
ICML 2018
PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos
ICML 2018
Learning a Policy for Opportunistic Active Learning
EMNLP 2018
The Uncertainty Bellman Equation and Exploration
ICML 2018
The Hierarchical Adaptive Forgetting Variational Filter
ICML 2018
Learning the Reward Function for a Misspecified Model
ICML 2018
Structured Control Nets for Deep Reinforcement Learning
ICML 2018
Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management
EMNLP 2018
A Reinforcement Learning-driven Translation Model for Search-Oriented Conversational Systems
EMNLP 2018
Accelerating Natural Gradient with Higher-Order Invariance
ICML 2018
An Inference-Based Policy Gradient Method for Learning Options
ICML 2018
Convergent Tree Backup and Retrace with Function Approximation
ICML 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
ICML 2018
Learning by Playing Solving Sparse Reward Tasks from Scratch
ICML 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
ICML 2018
SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark
CORL 2018
<
1
…
134
135
136
…
155
>