Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
On Thompson Sampling and Asymptotic Optimality
IJCAI 2017
Reinforcement Learning with a Corrupted Reward Channel
IJCAI 2017
Learning Conversational Systems that Interleave Task and Non-Task Content
IJCAI 2017
Interactive Narrative Personalization with Deep Reinforcement Learning
IJCAI 2017
A Monte Carlo Tree Search approach to Active Malware Analysis
IJCAI 2017
Leveraging Human Knowledge in Tabular Reinforcement Learning: A Study of Human Subjects
IJCAI 2017
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
IJCAI 2017
Tensor Based Knowledge Transfer Across Skill Categories for Robot Control
IJCAI 2017
Dynamic-Depth Context Tree Weighting
NIPS 2017
Weighted Double Q-learning
IJCAI 2017
Multi-Task Deep Reinforcement Learning for Continuous Action Control
IJCAI 2017
Improving Reinforcement Learning with Confidence-Based Demonstrations
IJCAI 2017
End-to-end optimization of goal-driven and visually grounded dialogue systems
IJCAI 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
IJCAI 2017
Learning Sparse Representations in Reinforcement Learning with Sparse Coding
IJCAI 2017
Efficient Reinforcement Learning with Hierarchies of Machines by Leveraging Internal Transitions
IJCAI 2017
Reinforcement Learning under Model Mismatch
NIPS 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
NIPS 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
ICML 2017
Learning in POMDPs with Monte Carlo Tree Search
ICML 2017
FeUdal Networks for Hierarchical Reinforcement Learning
ICML 2017
Predictive-State Decoders: Encoding the Future into Recurrent Networks
NIPS 2017
Learning Unknown Markov Decision Processes: A Thompson Sampling Approach
NIPS 2017
Collaborative Deep Reinforcement Learning for Joint Object Search
CVPR 2017
Distral: Robust multitask reinforcement learning
NIPS 2017
<
1
…
143
144
145
…
155
>