Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Sample Complexity of Policy-Based Methods under Off-Policy Sampling and Linear Function Approximation
AISTATS 2022
Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions
AISTATS 2022
Efficient Inference for Dynamic Flexible Interactions of Neural Populations
JMLR 2022
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences
JMLR 2022
On Instrumental Variable Regression for Deep Offline Policy Evaluation
JMLR 2022
Graph Partitioning and Sparse Matrix Ordering using Reinforcement Learning and Graph Neural Networks
JMLR 2022
d3rlpy: An Offline Deep Reinforcement Learning Library
JMLR 2022
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
JMLR 2022
Controllable Text Simplification with Deep Reinforcement Learning
IJCNLP 2022
Different Data, Different Modalities! Reinforced Data Splitting for Effective Multimodal Information Extraction from Social Media Posts
COLING 2022
Reinforcement Learning with Large Action Spaces for Neural Machine Translation
COLING 2022
Learning Natural Language Generation with Truncated Reinforcement Learning
NAACL 2022
Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning
NAACL 2022
Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts
NAACL 2022
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony
NAACL 2022
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization
NAACL 2022
Interactive Query-Assisted Summarization via Deep Reinforcement Learning
NAACL 2022
Data Augmentation with Dual Training for Offensive Span Detection
NAACL 2022
PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding
NAACL 2022
SURF: Semantic-level Unsupervised Reward Function for Machine Translation
NAACL 2022
Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition
NAACL 2022
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents
NAACL 2022
Partner Personas Generation for Dialogue Response Generation
NAACL 2022
Aligning to Social Norms and Values in Interactive Narratives
NAACL 2022
NSGZero: Efficiently Learning Non-exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search
AAAI 2022
<
1
…
50
51
52
…
118
>