Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Vertical Symbolic Regression via Deep Policy Gradient
IJCAI 2024
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
AISTATS 2024
QGym: Scalable Simulation and Benchmarking of Queuing Network Controllers
NIPS 2024
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions
COLT 2024
FactorSim: Generative Simulation via Factorized Representation
NIPS 2024
Rich Human Feedback for Text-to-Image Generation
CVPR 2024
EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
EACL 2024
A Deep Reinforcement Learning Approach to Balance Viewport Prediction and Video Transmission in 360° Video Streaming
IJCAI 2024
Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
ACL 2024
Safety filters for black-box dynamical systems by learning discriminating hyperplanes
L4DC 2024
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
NIPS 2024
CACTO-SL: Using Sobolev learning to improve continuous actor-critic with trajectory optimization
L4DC 2024
CoVO-MPC: Theoretical analysis of sampling-based MPC and optimal covariance design
L4DC 2024
Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration
NIPS 2024
State-wise safe reinforcement learning with pixel observations
L4DC 2024
Robust exploration with adversary via Langevin Monte Carlo
L4DC 2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
NIPS 2024
Generalized constraint for probabilistic safe reinforcement learning
L4DC 2024
The surprising efficiency of temporal difference learning for rare event prediction
NIPS 2024
PDE control gym: A benchmark for data-driven boundary control of partial differential equations
L4DC 2024
Synthesizing Programmatic Policy for Generalization within Task Domain
IJCAI 2024
Reinforcement Learning from Diverse Human Preferences
IJCAI 2024
Pointwise-in-time diagnostics for reinforcement learning during training and runtime
L4DC 2024
Balancing Context Length and Mixing Times for Reinforcement Learning at Scale
NIPS 2024
Robust cooperative multi-agent reinforcement learning: A mean-field type game perspective
L4DC 2024
<
1
…
24
25
26
…
155
>