← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning EMNLP 2023

Stochastic Contextual Bandits with Long Horizon Rewards AAAI 2023

trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback EMNLP 2023

Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes AAAI 2023

Planning and Learning with Adaptive Lookahead AAAI 2023

On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation AAAI 2023

Online Prototype Alignment for Few-shot Policy Transfer ICML 2023

Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games EMNLP 2023

Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis EMNLP 2023

Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration & Planning AAAI 2023

Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs ICML 2023

Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons ICML 2023

Crystal: Introspective Reasoners Reinforced with Self-Feedback EMNLP 2023

An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning ICML 2023

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes ICML 2023

On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness ICML 2023

LDM2: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement EMNLP 2023

Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics ICML 2023

Narrative Order Aware Story Generation via Bidirectional Pretraining Model with Optimal Transport Reward EMNLP 2023

Weighted Policy Constraints for Offline Reinforcement Learning AAAI 2023

Inverse Reinforcement Learning for Text Summarization EMNLP 2023

The Wisdom of Hindsight Makes Language Models Better Instruction Followers ICML 2023

Theory of Mind for Multi-Agent Collaboration via Large Language Models EMNLP 2023

Continually Improving Extractive QA via Human Feedback EMNLP 2023

Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning EMNLP 2023