Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes
NIPS 2017
Multi-View Decision Processes: The Helper-AI Problem
NIPS 2017
Decoding with Value Networks for Neural Machine Translation
NIPS 2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
NIPS 2017
Shallow Updates for Deep Reinforcement Learning
NIPS 2017
Value-Aware Loss Function for Model-based Reinforcement Learning
AISTATS 2017
Deep Reinforcement Learning-Based Image Captioning With Embedding Reward
CVPR 2017
Attention-Aware Face Hallucination via Deep Reinforcement Learning
CVPR 2017
Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection
CVPR 2017
Budget-Aware Deep Semantic Video Segmentation
CVPR 2017
Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning
CVPR 2017
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos
CVPR 2017
Learning to Learn From Noisy Web Videos
CVPR 2017
DSAC - Differentiable RANSAC for Camera Localization
CVPR 2017
PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning
CVPR 2017
A Reinforcement Learning Approach to the View Planning Problem
CVPR 2017
CAD2RL: Real Single-Image Flight Without a Single Real Image
RSS 2017
Automatic Text Summarization Using Reinforcement Learning with Embedding Features
IJCNLP 2017
Tackling Error Propagation through Reinforcement Learning: A Case of Greedy Dependency Parsing
EACL 2017
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
RSS 2017
Extending Model-based Policy Gradients for Robots in Heteroscedastic Environments
CORL 2017
Neural Episodic Control
ICML 2017
Curriculum Learning in Reinforcement Learning
IJCAI 2017
Pytheas: Enabling Data-Driven Quality of Experience Optimization Using Group-Based Exploration-Exploitation
NSDI 2017
Learning what to read: Focused machine reading
EMNLP 2017
<
1
…
144
145
146
…
155
>