conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2,932 papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Reinforcement Learning with Large Action Spaces for Neural Machine Translation
COLING 2022
Comparing BERT-based Reward Functions for Deep Reinforcement Learning in Machine Translation
COLING 2022
Reinforced Structured State-Evolution for Vision-Language Navigation
CVPR 2022
Global-Aware Registration of Less-Overlap RGB-D Scans
CVPR 2022
DECORE: Deep Compression With Reinforcement Learning
CVPR 2022
Is Mapping Necessary for Realistic PointGoal Navigation?
CVPR 2022
AME: Attention and Memory Enhancement in Hyper-Parameter Optimization
CVPR 2022
Sketching Without Worrying: Noise-Tolerant Sketch-Based Image Retrieval
CVPR 2022
Generating Natural Language Proofs with Verifier-Guided Search
EMNLP 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
EMNLP 2022
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
EMNLP 2022
RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
EMNLP 2022
Composing Ci with Reinforced Non-autoregressive Text Generation
EMNLP 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
EMNLP 2022
Active Example Selection for In-Context Learning
EMNLP 2022
CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
EMNLP 2022
ScienceWorld: Is your Agent Smarter than a 5th Grader?
EMNLP 2022
Reinforced Question Rewriting for Conversational Question Answering
EMNLP 2022
RL with KL penalties is better viewed as Bayesian inference
EMNLP 2022
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User Simulator
EMNLP 2022
Text Editing as Imitation Game
EMNLP 2022
Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation
EMNLP 2022
Turning Fixed to Adaptive: Integrating Post-Evaluation into Simultaneous Machine Translation
EMNLP 2022
Guiding Abstractive Dialogue Summarization with Content Planning
EMNLP 2022
Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards
EMNLP 2022
<
1
…
58
59
60
…
118
>