Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Relative Policy-Transition Optimization for Fast Policy Transfer
AAAI 2024
Parameterized Projected Bellman Operator
AAAI 2024
Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning
NIPS 2024
MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components
AAAI 2024
OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning
AAAI 2024
Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning
NIPS 2024
Real-world fluid directed rigid body control via deep reinforcement learning
L4DC 2024
Towards Achieving Sub-linear Regret and Hard Constraint Violation in Model-free RL
AISTATS 2024
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
IJCAI 2024
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
NIPS 2024
Critic-Guided Decision Transformer for Offline Reinforcement Learning
AAAI 2024
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System
AAAI 2024
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
AAAI 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
NIPS 2024
Central Limit Theorem for Two-Timescale Stochastic Approximation with Markovian Noise: Theory and Applications
AISTATS 2024
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning
AAAI 2024
Learning and deploying robust locomotion policies with minimal dynamics randomization
L4DC 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
NIPS 2024
Reward Certification for Policy Smoothed Reinforcement Learning
AAAI 2024
Robust exploration with adversary via Langevin Monte Carlo
L4DC 2024
Learning Diverse Risk Preferences in Population-Based Self-Play
AAAI 2024
Amortized Active Causal Induction with Deep Reinforcement Learning
NIPS 2024
Sample Complexity Characterization for Linear Contextual MDPs
AISTATS 2024
MANDREL: Modular Reinforcement Learning Pipelines for Material Discovery
AAAI 2024
<
1
…
35
36
37
…
155
>