Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL
NIPS 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
NIPS 2024
Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning
NIPS 2024
Distributional Reinforcement Learning with Regularized Wasserstein Loss
NIPS 2024
Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning
NIPS 2024
VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation
RSS 2024
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning
NIPS 2024
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
NIPS 2024
Statistical Efficiency of Distributional Temporal Difference Learning
NIPS 2024
Robust Reinforcement Learning with General Utility
NIPS 2024
Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning
NIPS 2024
Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
NIPS 2024
Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms
NIPS 2024
Diffusion for World Modeling: Visual Details Matter in Atari
NIPS 2024
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control
NIPS 2024
Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers
NIPS 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
NIPS 2024
SPO: Sequential Monte Carlo Policy Optimisation
NIPS 2024
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
NIPS 2024
On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
NIPS 2024
Bit_numeval at SemEval-2024 Task 7: Enhance Numerical Sensitivity and Reasoning Completeness for Quantitative Understanding
SEMEVAL 2024
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
AAAI 2024
Learning to Control Camera Exposure via Reinforcement Learning
CVPR 2024
A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning
NIPS 2024
ReCoRe: Regularized Contrastive Representation Learning of World Model
CVPR 2024
<
1
…
26
27
28
…
155
>