Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
AISTATS 2024
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms
NIPS 2024
EmpCRL: Controllable Empathetic Response Generation via In-Context Commonsense Reasoning and Reinforcement Learning
COLING 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling
ACL 2024
Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes
AAAI 2024
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
NIPS 2024
RePALM: Popular Quote Tweet Generation via Auto-Response Augmentation
ACL 2024
EROS:Entity-Driven Controlled Policy Document Summarization
COLING 2024
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
NIPS 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
NIPS 2024
Deep Reinforcement Learning with Hierarchical Action Exploration for Dialogue Generation
COLING 2024
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
NIPS 2024
The Power of Resets in Online Reinforcement Learning
NIPS 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
NIPS 2024
Improving Language Model Reasoning with Self-motivated Learning
COLING 2024
Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Games
NIPS 2024
ReCoRe: Regularized Contrastive Representation Learning of World Model
CVPR 2024
The Value of Reward Lookahead in Reinforcement Learning
NIPS 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
NIPS 2024
Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention
AAAI 2024
Graph Diffusion Policy Optimization
NIPS 2024
WPO: Enhancing RLHF with Weighted Preference Optimization
EMNLP 2024
Model-Based Transfer Learning for Contextual Reinforcement Learning
NIPS 2024
Robust Policy Learning via Offline Skill Diffusion
AAAI 2024
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
AAAI 2024
<
1
…
33
34
35
…
155
>