← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness ICML 2023

Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL ICML 2023

An Instrumental Variable Approach to Confounded Off-Policy Evaluation ICML 2023

Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents ICML 2023

Universal Morphology Control via Contextual Modulation ICML 2023

Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification ICML 2023

Semiparametrically Efficient Off-Policy Evaluation in Linear Markov Decision Processes ICML 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer ICML 2023

Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards ICML 2023

Distributional Offline Policy Evaluation with Predictive Error Guarantees ICML 2023

The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond ICML 2023

Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap ICML 2023

SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models ICML 2023

Multi-Environment Pretraining Enables Transfer to Action Limited Datasets ICML 2023

The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms ICML 2023

Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings ICML 2023

Fast Rates for Maximum Entropy Exploration ICML 2023

Reinforcement Learning with History Dependent Dynamic Contexts ICML 2023

VA-learning as a more efficient alternative to Q-learning ICML 2023

Towards a better understanding of representation dynamics under TD-learning ICML 2023

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm ICML 2023

Tile Networks: Learning Optimal Geometric Layout for Whole-page Recommendation AISTATS 2022

Sample Complexity of Robust Reinforcement Learning with a Generative Model AISTATS 2022

Finite Sample Analysis of Mean-Volatility Actor-Critic for Risk-Averse Reinforcement Learning AISTATS 2022

Towards an Understanding of Default Policies in Multitask Policy Optimization AISTATS 2022