← Learning Types

Machine Learning › Learning Types ›

Reinforcement Learning

2932 directly classified papers

Papers per year

Papers

QGFN: Controllable Greediness with Action Values NIPS 2024

Responsible Bandit Learning via Privacy-Protected Mean-Volatility Utility AAAI 2024

Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It’s Complicated AAAI 2024

Learning Optimal Advantage from Preferences and Mistaking It for Reward AAAI 2024

Variational Delayed Policy Optimization NIPS 2024

WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment NIPS 2024

Explaining Reinforcement Learning Agents through Counterfactual Action Outcomes AAAI 2024

Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL NIPS 2024

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity AISTATS 2024

Automated Design of Affine Maximizer Mechanisms in Dynamic Settings AAAI 2024

Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity NIPS 2024

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms NIPS 2024

Rethinking Discount Regularization: New Interpretations, Unintended Consequences, and Solutions for Regularization in Reinforcement Learning JMLR 2024

Active Reinforcement Learning for Robust Building Control AAAI 2024

Reward (Mis)design for Autonomous Driving (Abstract Reprint) AAAI 2024

Model-Free Representation Learning and Exploration in Low-Rank MDPs JMLR 2024

Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning AAAI 2024

Learning Encodings for Constructive Neural Combinatorial Optimization Needs to Regret AAAI 2024

Task Planning for Object Rearrangement in Multi-Room Environments AAAI 2024

Effect-Invariant Mechanisms for Policy Generalization JMLR 2024

Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming AAAI 2024

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation AAAI 2024

DiG-In-GNN: Discriminative Feature Guided GNN-Based Fraud Detector against Inconsistencies in Multi-Relation Fraud Graph AAAI 2024

Limited Query Graph Connectivity Test AAAI 2024

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution AAAI 2024