Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Divergence-Regularized Multi-Agent Actor-Critic
ICML 2022
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning
ICML 2022
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
ICML 2022
Denoised MDPs: Learning World Models Better Than the World Itself
ICML 2022
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search
ICML 2022
Policy Gradient Method For Robust Reinforcement Learning
ICML 2022
Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
ICML 2022
Prompting Decision Transformer for Few-Shot Policy Generalization
ICML 2022
Reachability Constrained Reinforcement Learning
ICML 2022
Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning
ICML 2022
Actor-Critic based Improper Reinforcement Learning
ICML 2022
Stabilizing Q-learning with Linear Architectures for Provable Efficient Learning
ICML 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach
ICML 2022
Dynamic Regret of Online Markov Decision Processes
ICML 2022
Efficient Learning for AlphaZero via Path Consistency
ICML 2022
Online Decision Transformer
ICML 2022
Sequential Voting With Relational Box Fields for Active Object Detection
CVPR 2022
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
ICML 2022
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP
ICML 2022
Cooperative Online Learning in Stochastic and Adversarial MDPs
ICML 2022
Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning
L4DC 2022
Experience Replay with Likelihood-free Importance Weights
L4DC 2022
Safe Reinforcement Learning with Chance-constrained Model Predictive Control
L4DC 2022
Reinforcement Learning with Almost Sure Constraints
L4DC 2022
Block Contextual MDPs for Continual Learning
L4DC 2022
<
1
…
61
62
63
…
155
>