Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
NIPS 2020
POMDPs in Continuous Time and Discrete Spaces
NIPS 2020
Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method
NIPS 2020
Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation
NIPS 2020
Weakly-Supervised Reinforcement Learning for Controllable Behavior
NIPS 2020
A Local Temporal Difference Code for Distributional Reinforcement Learning
NIPS 2020
Learning to Optimize Variational Quantum Circuits to Solve Combinatorial Problems
AAAI 2020
Discretizing Continuous Action Space for On-Policy Optimization
AAAI 2020
Contextual-Bandit Based Personalized Recommendation with Time-Varying User Interests
AAAI 2020
Accelerating Ranking in E-Commerce Search Engines through Contextual Factor Selection
AAAI 2020
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
NIPS 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
NIPS 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
NIPS 2020
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
NIPS 2020
Predictive Information Accelerates Learning in RL
NIPS 2020
Bandit Linear Control
NIPS 2020
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
NIPS 2020
Multi-step Greedy Reinforcement Learning Algorithms
ICML 2020
DISK: Learning local features with policy gradient
NIPS 2020
Reinforcement Mechanism Design: With Applications to Dynamic Pricing in Sponsored Search Auctions
AAAI 2020
A Learning Based Branch and Bound for Maximum Common Subgraph Related Problems
AAAI 2020
Metareasoning in Modular Software Systems: On-the-Fly Configuration Using Reinforcement Learning with Rich Contextual Representations
AAAI 2020
Transfer Value Iteration Networks
AAAI 2020
Generating Persona Consistent Dialogues by Exploiting Natural Language Inference
AAAI 2020
Point-Based Methods for Model Checking in Partially Observable Markov Decision Processes
AAAI 2020
<
1
…
112
113
114
…
155
>