conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3,861 papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Auxiliary Tasks for Efficient Learning of Point-Goal Navigation
WACV 2021
Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation
WACV 2021
Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meaningful Reward
WACV 2021
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation
WACV 2021
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
NIPS 2020
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing
NIPS 2020
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
NIPS 2020
Learning Guidance Rewards with Trajectory-space Smoothing
NIPS 2020
A Unifying View of Optimism in Episodic Reinforcement Learning
NIPS 2020
Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning
NIPS 2020
Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction
NIPS 2020
Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation
NIPS 2020
Weakly-Supervised Reinforcement Learning for Controllable Behavior
NIPS 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
NIPS 2020
A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances
NIPS 2020
Deep Reinforcement and InfoMax Learning
NIPS 2020
Adaptive Discretization for Model-Based Reinforcement Learning
NIPS 2020
AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control
NIPS 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
NIPS 2020
Munchausen Reinforcement Learning
NIPS 2020
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards
NIPS 2020
Reward-rational (implicit) choice: A unifying formalism for reward learning
NIPS 2020
Neural Dynamic Policies for End-to-End Sensorimotor Learning
NIPS 2020
Sparse Graphical Memory for Robust Planning
NIPS 2020
Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?
NIPS 2020
<
1
…
99
100
101
…
155
>