Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Reinforcement Learning
2932 directly classified papers
Papers per year
2003: 1
2006: 11
2007: 18
2008: 23
2009: 14
2010: 22
2011: 24
2012: 34
2013: 26
2014: 24
2015: 14
2016: 23
2017: 79
2018: 182
2019: 255
2020: 284
2021: 333
2022: 319
2023: 315
2024: 457
2025: 419
2026: 55
Papers
A Fairness-Driven Method for Learning Human-Compatible Negotiation Strategies
EMNLP 2024
Online Learning of Decision Trees with Thompson Sampling
AISTATS 2024
Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models
EMNLP 2024
Achieving Stronger Generation via Simple Contrastive Tuning
EMNLP 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
NIPS 2024
A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search Over Policy Trees
AAAI 2024
Fast Rates for Maximum Entropy Exploration
ICML 2023
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
ICML 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
EMNLP 2023
Don’t Add, don’t Miss: Effective Content Preserving Generation from Pre-Selected Text Spans
EMNLP 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
ICML 2023
GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification
AAAI 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
EMNLP 2023
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm
AAAI 2023
HuatuoGPT, Towards Taming Language Model to Be a Doctor
EMNLP 2023
STEER: Unified Style Transfer with Expert Reinforcement
EMNLP 2023
Temporal Extrapolation and Knowledge Transfer for Lifelong Temporal Knowledge Graph Reasoning
EMNLP 2023
Hybrid Learning with New Value Function for the Maximum Common Induced Subgraph Problem
AAAI 2023
Inverse Reinforcement Learning for Text Summarization
EMNLP 2023
Narrative Order Aware Story Generation via Bidirectional Pretraining Model with Optimal Transport Reward
EMNLP 2023
Doolittle: Benchmarks and Corpora for Academic Writing Formalization
EMNLP 2023
Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games
EMNLP 2023
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
ICML 2023
LDM2: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
EMNLP 2023
Exploratory Inference Learning for Scribble Supervised Semantic Segmentation
AAAI 2023
<
1
…
37
38
39
…
118
>