Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
VECA: A New Benchmark and Toolkit for General Cognitive Development
AAAI 2022
Offline-to-Online Co-Evolutional User Simulator and Dialogue System
EMNLP 2022
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
EMNLP 2022
Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards
EMNLP 2022
Guiding Abstractive Dialogue Summarization with Content Planning
EMNLP 2022
Turning Fixed to Adaptive: Integrating Post-Evaluation into Simultaneous Machine Translation
EMNLP 2022
Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation
EMNLP 2022
Text Editing as Imitation Game
EMNLP 2022
Reinforced Question Rewriting for Conversational Question Answering
EMNLP 2022
ScienceWorld: Is your Agent Smarter than a 5th Grader?
EMNLP 2022
CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
EMNLP 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
EMNLP 2022
Composing Ci with Reinforced Non-autoregressive Text Generation
EMNLP 2022
RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
EMNLP 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
EMNLP 2022
Generating Natural Language Proofs with Verifier-Guided Search
EMNLP 2022
Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
COLING 2022
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
NAACL 2022
Active Example Selection for In-Context Learning
EMNLP 2022
Pre-Trained Language Models for Interactive Decision-Making
NIPS 2022
Towards Automating the Generation of Human-Robot Interaction Scenarios
AAAI 2022
Selecting Optimal Context Sentences for Event-Event Relation Extraction
AAAI 2022
Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
AAAI 2022
DiPS: Differentiable Policy for Sketching in Recommender Systems
AAAI 2022
Characterization of Incentive Compatibility of an Ex-ante Constrained Player
AAAI 2022
<
1
…
51
52
53
…
118
>