Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Towards Dynamic Trend Filtering through Trend Point Detection with Reinforcement Learning
IJCAI 2024
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds
JMLR 2024
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
AAAI 2024
Value-Distributional Model-Based Reinforcement Learning
JMLR 2024
Generalizable Policy Improvement via Reinforcement Sampling (Student Abstract)
AAAI 2024
Amortized Active Causal Induction with Deep Reinforcement Learning
NIPS 2024
Increasing the Difficulty of Automatically Generated Questions via Reinforcement Learning with Synthetic Preference for Cost-Effective Cultural Heritage Dataset Generation
EMNLP 2024
Vertical Symbolic Regression via Deep Policy Gradient
IJCAI 2024
Monte Carlo Tree Search in the Presence of Transition Uncertainty
AAAI 2024
Evolutionary Reward Design and Optimization with Multimodal Large Language Models
ACL 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
EMNLP 2024
Reader: Model-based language-instructed reinforcement learning
EMNLP 2023
Safe and Efficient Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions
L4DC 2023
Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation
EMNLP 2023
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
L4DC 2023
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
EMNLP 2023
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
EMNLP 2023
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
EMNLP 2023
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
L4DC 2023
Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
L4DC 2023
User Simulator Assisted Open-ended Conversational Recommendation System
ACL 2023
Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning
ACL 2023
Generating Dialog Responses with Specified Grammatical Items for Second Language Learning
ACL 2023
Enhancing Educational Dialogues: A Reinforcement Learning Approach for Generating AI Teacher Responses
ACL 2023
<
1
…
37
38
39
…
155
>