Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
ICML 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
ICML 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
ICML 2023
Revisiting Domain Randomization via Relaxed State-Adversarial Policy Optimization
ICML 2023
Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees
ICML 2023
Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds
ICML 2023
Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling
ICML 2023
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
EMNLP 2023
Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation
EMNLP 2023
Reader: Model-based language-instructed reinforcement learning
EMNLP 2023
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
EMNLP 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
EMNLP 2023
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning
CVPR 2023
Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning
CVPR 2023
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders
CVPR 2023
Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning
CVPR 2023
Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
ICML 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback
ICML 2023
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-per-Second
CVPR 2023
Reinforcement Learning-Based Black-Box Model Inversion Attacks
CVPR 2023
Adaptive Zone-Aware Hierarchical Planner for Vision-Language Navigation
CVPR 2023
Variance Control for Distributional Reinforcement Learning
ICML 2023
Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions
ICML 2023
Convergence of Actor-Critic with Multi-Layer Neural Networks
NIPS 2023
<
1
…
55
56
57
…
155
>