Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
AAAI 2020
Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
AAAI 2020
Spatiotemporally Constrained Action Space Attacks on Deep Reinforcement Learning Agents
AAAI 2020
A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players
AISTATS 2020
Table2Analysis: Modeling and Recommendation of Common Analysis Patterns for Multi-Dimensional Data
AAAI 2020
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning
AAAI 2020
Double-Oracle Sampling Method for Stackelberg Equilibrium Approximation in General-Sum Extensive-Form Games
AAAI 2020
Hierarchical Text Classification with Reinforced Label Assignment
EMNLP 2019
Learning Dynamic Context Augmentation for Global Entity Linking
EMNLP 2019
A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification
ACL 2019
Sentence Mover’s Similarity: Automatic Evaluation for Multi-Sentence Texts
ACL 2019
Neural Keyphrase Generation via Reinforcement Learning with Adaptive Rewards
ACL 2019
Reinforced Training Data Selection for Domain Adaptation
ACL 2019
Historical Text Normalization with Delayed Rewards
ACL 2019
LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification
EMNLP 2019
Answer-guided and Semantic Coherent Question Generation in Open-domain Conversation
EMNLP 2019
Better Rewards Yield Better Summaries: Learning to Summarise Without References
EMNLP 2019
Deep Reinforcement Learning-based Text Anonymization against Private-Attribute Inference
EMNLP 2019
DBA: Dynamic Multi-Armed Bandit Algorithm
AAAI 2019
Querying NoSQL with Deep Learning to Answer Natural Language Questions
AAAI 2019
Personalized Robot Tutoring Using the Assistive Tutor POMDP (AT-POMDP)
AAAI 2019
Refining Abstraction Heuristics during Real-Time Planning
AAAI 2019
Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning
AAAI 2019
Trainable Undersampling for Class-Imbalance Learning
AAAI 2019
Policy Optimization with Model-Based Explorations
AAAI 2019
<
1
…
87
88
89
…
118
>