Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Addressing Exposure Bias With Document Minimum Risk Training: Cambridge at the WMT20 Biomedical Translation Task
EMNLP 2020
A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound
AAAI 2020
Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control
AAAI 2020
Reinforcement Learning When All Actions Are Not Always Available
AAAI 2020
Lifelong Learning with a Changing Action Set
AAAI 2020
Partner Selection for the Emergence of Cooperation in Multi-Agent Systems Using Reinforcement Learning
AAAI 2020
A Reinforcement Learning Approach to Strategic Belief Revelation with Social Influence
AAAI 2020
Off-Policy Evaluation in Partially Observable Environments
AAAI 2020
Dynamic Reward-Based Dueling Deep Dyna-Q: Robust Policy Learning in Noisy Environments
AAAI 2020
Reinforced Curriculum Learning on Pre-Trained Neural Machine Translation Models
AAAI 2020
Sequence Generation with Optimal-Transport-Enhanced Reinforcement Learning
AAAI 2020
Effective Diversity in Population Based Reinforcement Learning
NIPS 2020
Collapsing Bandits and Their Application to Public Health Intervention
NIPS 2020
A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms
NIPS 2020
Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs
NIPS 2020
Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward
ACL 2020
Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis
NIPS 2020
Online Planning with Lookahead Policies
NIPS 2020
Memory-Efficient Learning of Stable Linear Dynamical Systems for Prediction and Control
NIPS 2020
Predictive Information Accelerates Learning in RL
NIPS 2020
The Mean-Squared Error of Double Q-Learning
NIPS 2020
Off-Policy Evaluation via the Regularized Lagrangian
NIPS 2020
Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?
CVPR 2020
Fast Template Matching and Update for Video Object Tracking and Segmentation
CVPR 2020
Achieving Fairness in the Stochastic Multi-Armed Bandit Problem
AAAI 2020
<
1
…
86
87
88
…
118
>