Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Partially Observable Hierarchical Reinforcement Learning with AI Planning (Student Abstract)
AAAI 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
NIPS 2024
Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures
AISTATS 2024
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback
ACL 2024
A Visual Active Search Framework for Geospatial Exploration
WACV 2024
Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery
AAAI 2024
Ethics in Action: Training Reinforcement Learning Agents for Moral Decision-making In Text-based Adventure Games
AISTATS 2024
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
ACL 2024
LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer
AAAI 2024
What Effects the Generalization in Visual Reinforcement Learning: Policy Consistency with Truncated Return Prediction
AAAI 2024
MIM-Reasoner: Learning with Theoretical Guarantees for Multiplex Influence Maximization
AISTATS 2024
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)
AAAI 2024
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing
AAAI 2024
Natural Language-based State Representation in Deep Reinforcement Learning
NAACL 2024
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
ACL 2024
Bit_numeval at SemEval-2024 Task 7: Enhance Numerical Sensitivity and Reasoning Completeness for Quantitative Understanding
NAACL 2024
Planning Like Human: A Dual-process Framework for Dialogue Planning
ACL 2024
Cluster-Based Sampling in Hindsight Experience Replay for Robotic Tasks (Student Abstract)
AAAI 2024
Increasing the Difficulty of Automatically Generated Questions via Reinforcement Learning with Synthetic Preference for Cost-Effective Cultural Heritage Dataset Generation
EMNLP 2024
BadRL: Sparse Targeted Backdoor Attack against Reinforcement Learning
AAAI 2024
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving
CVPR 2024
RLfOLD: Reinforcement Learning from Online Demonstrations in Urban Autonomous Driving
AAAI 2024
On learning history-based policies for controlling Markov decision processes
AISTATS 2024
Abstract and Explore: A Novel Behavioral Metric with Cyclic Dynamics in Reinforcement Learning
AAAI 2024
Amortized Active Causal Induction with Deep Reinforcement Learning
NIPS 2024
<
1
…
36
37
38
…
155
>