← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

A Fairness-Driven Method for Learning Human-Compatible Negotiation Strategies EMNLP 2024

Online Learning of Decision Trees with Thompson Sampling AISTATS 2024

Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models EMNLP 2024

Achieving Stronger Generation via Simple Contrastive Tuning EMNLP 2024

Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning NIPS 2024

A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search Over Policy Trees AAAI 2024

Fast Rates for Maximum Entropy Exploration ICML 2023

Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings ICML 2023

Tuna: Instruction Tuning using Feedback from Large Language Models EMNLP 2023

Don’t Add, don’t Miss: Effective Content Preserving Generation from Pre-Selected Text Spans EMNLP 2023

The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms ICML 2023

GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification AAAI 2023

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data EMNLP 2023

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm AAAI 2023

HuatuoGPT, Towards Taming Language Model to Be a Doctor EMNLP 2023

STEER: Unified Style Transfer with Expert Reinforcement EMNLP 2023

Temporal Extrapolation and Knowledge Transfer for Lifelong Temporal Knowledge Graph Reasoning EMNLP 2023

Hybrid Learning with New Value Function for the Maximum Common Induced Subgraph Problem AAAI 2023

Inverse Reinforcement Learning for Text Summarization EMNLP 2023

Narrative Order Aware Story Generation via Bidirectional Pretraining Model with Optimal Transport Reward EMNLP 2023

Doolittle: Benchmarks and Corpora for Academic Writing Formalization EMNLP 2023

Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games EMNLP 2023

Multi-Environment Pretraining Enables Transfer to Action Limited Datasets ICML 2023

LDM2: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement EMNLP 2023

Exploratory Inference Learning for Scribble Supervised Semantic Segmentation AAAI 2023