Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
A data-driven approach for learning to control computers
ICML 2022
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation
ICML 2022
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling
ICML 2022
Large Batch Experience Replay
ICML 2022
Goal Misgeneralization in Deep Reinforcement Learning
ICML 2022
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime
ICML 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
ICML 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
ICML 2022
Delayed Reinforcement Learning by Imitation
ICML 2022
Distributionally Robust $Q$-Learning
ICML 2022
How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation
ICML 2022
Optimizing Tensor Network Contraction Using Reinforcement Learning
ICML 2022
Transformers are Meta-Reinforcement Learners
ICML 2022
Learning Stochastic Shortest Path with Linear Function Approximation
ICML 2022
A Simple Reward-free Approach to Constrained Reinforcement Learning
ICML 2022
EqR: Equivariant Representations for Data-Efficient Reinforcement Learning
ICML 2022
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
ICML 2022
The Primacy Bias in Deep Reinforcement Learning
ICML 2022
History Compression via Language Models in Reinforcement Learning
ICML 2022
Evolving Curricula with Regret-Based Environment Design
ICML 2022
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
ICML 2022
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost
ICML 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
ICML 2022
Direct Behavior Specification via Constrained Reinforcement Learning
ICML 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
ICML 2022
<
1
…
60
61
62
…
155
>