Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning
EMNLP 2023
Stochastic Contextual Bandits with Long Horizon Rewards
AAAI 2023
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback
EMNLP 2023
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
AAAI 2023
Planning and Learning with Adaptive Lookahead
AAAI 2023
On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
AAAI 2023
Online Prototype Alignment for Few-shot Policy Transfer
ICML 2023
Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games
EMNLP 2023
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
EMNLP 2023
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration & Planning
AAAI 2023
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
ICML 2023
Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons
ICML 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
EMNLP 2023
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning
ICML 2023
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
ICML 2023
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness
ICML 2023
LDM2: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
EMNLP 2023
Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics
ICML 2023
Narrative Order Aware Story Generation via Bidirectional Pretraining Model with Optimal Transport Reward
EMNLP 2023
Weighted Policy Constraints for Offline Reinforcement Learning
AAAI 2023
Inverse Reinforcement Learning for Text Summarization
EMNLP 2023
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
ICML 2023
Theory of Mind for Multi-Agent Collaboration via Large Language Models
EMNLP 2023
Continually Improving Extractive QA via Human Feedback
EMNLP 2023
Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
EMNLP 2023
<
1
…
38
39
40
…
118
>