conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3,861 papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Instance-based Generalization in Reinforcement Learning
NIPS 2020
Task-agnostic Exploration in Reinforcement Learning
NIPS 2020
Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration
NIPS 2020
Softmax Deep Double Deterministic Policy Gradients
NIPS 2020
Online Decision Based Visual Tracking via Reinforcement Learning
NIPS 2020
Predictive Information Accelerates Learning in RL
NIPS 2020
Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation
NIPS 2020
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning
NIPS 2020
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
NIPS 2020
Reward Propagation Using Graph Convolutional Networks
NIPS 2020
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
NIPS 2020
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
NIPS 2020
POMDPs in Continuous Time and Discrete Spaces
NIPS 2020
Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method
NIPS 2020
A Local Temporal Difference Code for Distributional Reinforcement Learning
NIPS 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
NIPS 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
NIPS 2020
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
NIPS 2020
DISK: Learning local features with policy gradient
NIPS 2020
Learning the Linear Quadratic Regulator from Nonlinear Observations
NIPS 2020
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
NIPS 2020
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
NIPS 2020
Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition
NIPS 2020
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
NIPS 2020
Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs
NIPS 2020
<
1
…
101
102
103
…
155
>