Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Methods
Reinforcement Learning
›
Methods
›
Deep RL
3861 directly classified papers
Papers per year
2005: 1
2006: 9
2007: 14
2008: 15
2009: 9
2010: 21
2011: 27
2012: 32
2013: 21
2014: 17
2015: 10
2016: 33
2017: 102
2018: 222
2019: 399
2020: 450
2021: 533
2022: 478
2023: 532
2024: 513
2025: 326
2026: 97
Papers
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search
AAAI 2019
QUOTA: The Quantile Option Architecture for Reinforcement Learning
AAAI 2019
Self-Supervised Mixture-of-Experts by Uncertainty Estimation
AAAI 2019
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
CORL 2019
Model-Based Planning with Energy-Based Models
CORL 2019
Leveraging exploration in off-policy algorithms via normalizing flows
CORL 2019
Seeded self-play for language learning
EMNLP 2019
Transfer in Deep Reinforcement Learning Using Knowledge Graphs
EMNLP 2019
Generalization in Generation: A closer look at Exposure Bias
EMNLP 2019
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization
EMNLP 2019
Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses
EMNLP 2019
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
EMNLP 2019
Exploring Diverse Expressions for Paraphrase Generation
EMNLP 2019
Better Rewards Yield Better Summaries: Learning to Summarise Without References
EMNLP 2019
Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning
EMNLP 2019
Interactive Language Learning by Question Answering
EMNLP 2019
Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models
NAACL 2019
Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
NAACL 2019
IPOMDP-Net: A Deep Neural Network for Partially Observable Multi-Agent Planning Using Interactive POMDPs
AAAI 2019
Learning Representations in Model-Free Hierarchical Reinforcement Learning
AAAI 2019
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
AAAI 2019
Reinforcement Learning under Threats
AAAI 2019
Diversity-Driven Extensible Hierarchical Reinforcement Learning
AAAI 2019
Composable Modular Reinforcement Learning
AAAI 2019
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
AAAI 2019
<
1
…
126
127
128
…
155
>