Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement
ICCV 2025
Practicable Black-Box Evasion Attacks on Link Prediction in Dynamic Graphs—a Graph Sequential Embedding Method
AAAI 2025
Representation-driven Option Discovery in Reinforcement Learning
AAAI 2025
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches
RSS 2025
Structured Document Translation via Format Reinforcement Learning
IJCNLP 2025
What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning
AAAI 2025
ModelDiff: Symbolic Dynamic Programming for Model-Aware Policy Transfer in Deep Q-Learning
AAAI 2025
Learning Joint Behaviors with Large Variations
AAAI 2025
SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning
AAAI 2025
Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems
AAAI 2025
Safety with Agency: Human-Centered Safety Filter with Application to AI-Assisted Motorsports
RSS 2025
GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration
EMNLP 2025
Optimal Viewpoint Selection for Autonomous Photography Using Reinforcement Learning
AAAI 2025
BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds
RSS 2025
ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning
ICCV 2025
LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning
AAAI 2025
Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes
AAAI 2025
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
RSS 2025
Resolving Conflicting Constraints in Multi-Agent Reinforcement Learning with Layered Safety
RSS 2025
Enhancing multi-modal Relation Extraction with Reinforcement Learning Guided Graph Diffusion Framework
COLING 2025
InstGAN: Instant Actor-Critic-Driven GAN for De Novo Molecule Generation and Property Optimization
IJCAI 2025
AI-Powered Algorithm-Centric Quantum Processor Topology Design
AAAI 2025
GenAL: Generative Agent for Adaptive Learning
AAAI 2025
SARA: Salience-Aware Reinforced Adaptive Decoding for Large Language Models in Abstractive Summarization
ACL 2025
DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model
ICCV 2025
<
1
…
15
16
17
…
155
>