Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Adapting LLM Agents with Universal Communication Feedback
NAACL 2025
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
NAACL 2025
Optimizing RLHF Training for Large Language Models with Stage Fusion
NSDI 2025
Active Geospatial Search for Efficient Tenant Eviction Outreach
AAAI 2025
Approximated Behavioral Metric-based State Projection for Federated Reinforcement Learning
IJCAI 2025
Learning to Explain: Towards Human-Aligned Explainability in Deep Reinforcement Learning via Attention Guidance
IJCAI 2025
AI-Powered Algorithm-Centric Quantum Processor Topology Design
AAAI 2025
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
IJCAI 2025
InstGAN: Instant Actor-Critic-Driven GAN for De Novo Molecule Generation and Property Optimization
IJCAI 2025
BILE: An Effective Behavior-based Latent Exploration Scheme for Deep Reinforcement Learning
IJCAI 2025
APIRL: Deep Reinforcement Learning for REST API Fuzzing
AAAI 2025
Robustness to Spurious Correlations via Dynamic Knowledge Transfer
IJCAI 2025
CADP: Towards Better Centralized Learning for Decentralized Execution in MARL
IJCAI 2025
Sketch Decompositions for Classical Planning via Deep Reinforcement Learning
IJCAI 2025
Partially Observable Reference Policy Programming
IJCAI 2025
EFormer: An Effective Edge-based Transformer for Vehicle Routing Problems
IJCAI 2025
GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy
IJCAI 2025
Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach
IJCAI 2025
Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation
AAAI 2025
SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch
AAAI 2025
DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation
AAAI 2025
LLM-Powered User Simulator for Recommender System
AAAI 2025
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
AAAI 2025
Deep Reinforcement Learning with Time-Scale Invariant Memory
AAAI 2025
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions
CVPR 2025
<
1
…
12
13
14
…
155
>