Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
Multi-Task Off-Policy Learning from Bandit Feedback
ICML 2023
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
ICML 2023
Distance Weighted Supervised Learning for Offline Interaction Data
ICML 2023
Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning
ICML 2023
Scaling Laws for Reward Model Overoptimization
ICML 2023
Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation
EMNLP 2023
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis
EMNLP 2023
Adaptive Zone-Aware Hierarchical Planner for Vision-Language Navigation
CVPR 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
EMNLP 2023
Don’t Add, don’t Miss: Effective Content Preserving Generation from Pre-Selected Text Spans
EMNLP 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
EMNLP 2023
HuatuoGPT, Towards Taming Language Model to Be a Doctor
EMNLP 2023
STEER: Unified Style Transfer with Expert Reinforcement
EMNLP 2023
Temporal Extrapolation and Knowledge Transfer for Lifelong Temporal Knowledge Graph Reasoning
EMNLP 2023
Inverse Reinforcement Learning for Text Summarization
EMNLP 2023
Narrative Order Aware Story Generation via Bidirectional Pretraining Model with Optimal Transport Reward
EMNLP 2023
Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games
EMNLP 2023
LDM2: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
EMNLP 2023
Simultaneous Machine Translation with Tailored Reference
EMNLP 2023
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
EMNLP 2023
INA: An Integrative Approach for Enhancing Negotiation Strategies with Reward-Based Dialogue Agent
EMNLP 2023
Intervention-Based Alignment of Code Search with Execution Feedback
EMNLP 2023
Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning
AISTATS 2023
Cooperative Multi-Agent Learning in a Complex World: Challenges and Solutions
AAAI 2023
Multiple-policy High-confidence Policy Evaluation
AISTATS 2023
<
1
…
46
47
48
…
118
>