Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Continually Improving Extractive QA via Human Feedback
EMNLP 2023
Enhancing Task-oriented Dialogue Systems with Generative Post-processing Networks
EMNLP 2023
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
EMNLP 2023
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
EMNLP 2023
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning
EMNLP 2023
Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback
EMNLP 2023
Reinforced Target-driven Conversational Promotion
EMNLP 2023
Be Selfish, But Wisely: Investigating the Impact of Agent Personality in Mixed-Motive Human-Agent Interactions
EMNLP 2023
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
EMNLP 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
EMNLP 2023
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
EMNLP 2023
Non-stationary Reinforcement Learning under General Function Approximation
ICML 2023
Replicable Reinforcement Learning
NIPS 2023
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
ICML 2023
Efficient Online Reinforcement Learning with Offline Data
ICML 2023
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
ICML 2023
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance
CORL 2023
Aligning Language Models with Preferences through $f$-divergence Minimization
ICML 2023
Reinforcement Learning from Passive Data via Latent Intentions
ICML 2023
Information-Theoretic State Space Model for Multi-View Reinforcement Learning
ICML 2023
The Impact of Exploration on Convergence and Performance of Multi-Agent Q-Learning Dynamics
ICML 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
ICML 2023
Reinforcement Learning in Low-rank MDPs with Density Features
ICML 2023
Thompson Sampling with Diffusion Generative Prior
ICML 2023
<
1
…
45
46
47
…
118
>