Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
NIPS 2024
Learning World Models for Unconstrained Goal Navigation
NIPS 2024
RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning
NIPS 2024
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
NIPS 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
NIPS 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
NIPS 2024
Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection
NIPS 2024
Cloud-LoRa: Enabling Cloud Radio Access LoRa Networks Using Reinforcement Learning Based Bandwidth-Adaptive Compression
NSDI 2024
Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning
NIPS 2024
OPPerTune: Post-Deployment Configuration Tuning of Services Made Easy
NSDI 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
NIPS 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
NIPS 2024
RePALM: Popular Quote Tweet Generation via Auto-Response Augmentation
ACL 2024
Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs
NIPS 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling
ACL 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
NIPS 2024
Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation
NIPS 2024
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
NIPS 2024
Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
NIPS 2024
Statistical Efficiency of Distributional Temporal Difference Learning
NIPS 2024
A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning
NIPS 2024
Speculative Monte-Carlo Tree Search
NIPS 2024
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
NIPS 2024
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory
NIPS 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
NIPS 2024
<
1
…
22
23
24
…
155
>