Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Speeding up Reinforcement Learning-based Information Extraction Training using Asynchronous Methods
EMNLP 2017
Agent-Aware Dropout DQN for Safe and Efficient On-line Dialogue Policy Learning
EMNLP 2017
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
EMNLP 2017
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning
EMNLP 2017
An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks
IJCNLP 2017
Reinforced Video Captioning with Entailment Rewards
EMNLP 2017
Learning how to Active Learn: A Deep Reinforcement Learning Approach
EMNLP 2017
Sentence Simplification with Deep Reinforcement Learning
EMNLP 2017
Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning
IJCNLP 2017
Reinforcement mechanism design
IJCAI 2017
Modular Multitask Reinforcement Learning with Policy Sketches
ICML 2017
Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning
ICML 2017
Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution
ICML 2017
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
ICML 2017
Reinforcement Learning with Deep Energy-Based Policies
ICML 2017
Contextual Decision Processes with low Bellman rank are PAC-Learnable
ICML 2017
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
ICML 2017
Count-Based Exploration with Neural Density Models
ICML 2017
Curiosity-driven Exploration by Self-supervised Prediction
ICML 2017
Robust Adversarial Reinforcement Learning
ICML 2017
Accelerating Stochastic Composition Optimization
JMLR 2017
Hierarchical Reinforcement Learning with Parameters
CORL 2017
Mutual Alignment Transfer Learning
CORL 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
CORL 2017
Optimizing Long-term Predictions for Model-based Policy Search
CORL 2017
<
1
…
145
146
147
…
155
>