← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories ICML 2023

Counterfactual Learning with General Data-Generating Policies AAAI 2023

On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation AAAI 2023

Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes ICML 2023

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning ICML 2023

Online Reinforcement Learning with Uncertain Episode Lengths AAAI 2023

Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration & Planning AAAI 2023

Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning AAAI 2023

Goal-Conditioned Q-learning as Knowledge Distillation AAAI 2023

Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control AAAI 2023

Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons ICML 2023

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning EMNLP 2023

Q-functionals for Value-Based Continuous Control AAAI 2023

Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction AAAI 2023

Adaptive Policy Learning for Offline-to-Online Reinforcement Learning AAAI 2023

The Benefits of Model-Based Generalization in Reinforcement Learning ICML 2023

Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics ICML 2023

An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning ICML 2023

The Wisdom of Hindsight Makes Language Models Better Instruction Followers ICML 2023

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes ICML 2023

Online Prototype Alignment for Few-shot Policy Transfer ICML 2023

On the Convergence of SARSA with Linear Function Approximation ICML 2023

Temporal Abstraction in Reinforcement Learning with the Successor Representation JMLR 2023

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks JMLR 2023

trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback EMNLP 2023