Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
AAAI 2025
GenPlan: Generative Sequence Models as Adaptive Planners
AAAI 2025
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
RSS 2025
Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning
AAAI 2025
Reducing AUV Energy Consumption Through Dynamic Sensor Directions Switching via Deep Reinforcement Learning
AAAI 2025
Efficient Reinforcement Learning in Probabilistic Reward Machines
AAAI 2025
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models
EMNLP 2025
Safety with Agency: Human-Centered Safety Filter with Application to AI-Assisted Motorsports
RSS 2025
CTD4 – a Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
AAAI 2025
Towards Robust, Efficient, and Practical Decision-Making: From Reward-Maximizing Deep Reinforcement Learning to Reward-Matching GFlowNets
AAAI 2025
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation
RSS 2025
NL2Lean: Translating Natural Language into Lean 4 through Multi-Aspect Reinforcement Learning
EMNLP 2025
LLM-Powered User Simulator for Recommender System
AAAI 2025
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
ACL 2025
Exploration-Driven Generative Interactive Environments
CVPR 2025
Steering LLM Reasoning Through Bias-Only Adaptation
EMNLP 2025
Reward-Directed Score-Based Diffusion Models via q-Learning
JMLR 2025
Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models
EMNLP 2025
BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds
RSS 2025
Hierarchical and Modular Network on Non-prehensile Manipulation in General Environments
RSS 2025
Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling
IJCAI 2025
Preference-based Deep Reinforcement Learning for Historical Route Estimation
IJCAI 2025
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
ACL 2025
SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
AAAI 2024
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
AAAI 2024
<
1
…
16
17
18
…
155
>