Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy Exploration
NIPS 2024
An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning
NIPS 2024
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning
AAAI 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
NIPS 2024
FlexPlanner: Flexible 3D Floorplanning via Deep Reinforcement Learning in Hybrid Action Space with Multi-Modality Representation
NIPS 2024
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
NIPS 2024
Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning
AAAI 2024
Learning to Assist Humans without Inferring Rewards
NIPS 2024
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
NIPS 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
NIPS 2024
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
NIPS 2024
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
NIPS 2024
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
NIPS 2024
Flipping-based Policy for Chance-Constrained Markov Decision Processes
NIPS 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
NIPS 2024
Text-Aware Diffusion for Policy Learning
NIPS 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
NIPS 2024
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs
NIPS 2024
LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer
AAAI 2024
Learning World Models for Unconstrained Goal Navigation
NIPS 2024
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
NIPS 2024
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
NIPS 2024
Goal-Conditioned On-Policy Reinforcement Learning
NIPS 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
EMNLP 2024
Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation
NIPS 2024
<
1
…
30
31
32
…
155
>