Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning
NIPS 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
NIPS 2024
Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning
NIPS 2024
MetaCARD: Meta-Reinforcement Learning with Task Uncertainty Feedback via Decoupled Context-Aware Reward and Dynamics Components
AAAI 2024
Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection
NIPS 2024
Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning
NIPS 2024
Learning World Models for Unconstrained Goal Navigation
NIPS 2024
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
NIPS 2024
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
NIPS 2024
Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation
NIPS 2024
Recent Advancements in Inverse Reinforcement Learning
AAAI 2024
P2BPO: Permeable Penalty Barrier-Based Policy Optimization for Safe RL
AAAI 2024
ReFT: Reasoning with Reinforced Fine-Tuning
ACL 2024
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving
CVPR 2024
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
CVPR 2024
When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL
NIPS 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
NIPS 2024
RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning
NIPS 2024
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients
AISTATS 2024
Provable Partially Observable Reinforcement Learning with Privileged Information
NIPS 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
NIPS 2024
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
ACL 2024
Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation
ACL 2024
Reinforcement Learning with Lookahead Information
NIPS 2024
Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy Exploration
NIPS 2024
<
1
…
27
28
29
…
155
>