← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

Unsupervised Object Interaction Learning with Counterfactual Dynamics Models AAAI 2024

Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning AAAI 2024

Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention AAAI 2024

Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge AAAI 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments AAAI 2024

Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms NIPS 2024

Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity JMLR 2024

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds JMLR 2024

A Reinforcement-Learning-Based Multiple-Column Selection Strategy for Column Generation AAAI 2024

Online Markov Decision Processes Configuration with Continuous Decision Space AAAI 2024

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations AAAI 2024

Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing AAAI 2024

DiffPhyCon: A Generative Approach to Control Complex Physical Systems NIPS 2024

When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback NIPS 2024

RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models NAACL 2024

A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation NIPS 2024

Deterministic Policies for Constrained Reinforcement Learning in Polynomial Time NIPS 2024

Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning JMLR 2024

Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems NIPS 2024

Controlling Character Motions Without Observable Driving Source WACV 2024

Imitating Language via Scalable Inverse Reinforcement Learning NIPS 2024

Decentralized Natural Policy Gradient with Variance Reduction for Collaborative Multi-Agent Reinforcement Learning JMLR 2024

Graph Diffusion Policy Optimization NIPS 2024

Causal Imitation for Markov Decision Processes: a Partial Identification Approach NIPS 2024

Speculative Monte-Carlo Tree Search NIPS 2024