← Learning Types

Machine Learning › Learning Types ›

Multi-Armed Bandits

1044 directly classified papers

Papers per year

Papers

Deep Hierarchy in Bandits ICML 2022

Smoothed Adversarial Linear Contextual Bandits with Knapsacks ICML 2022

Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits ICML 2022

Safe Exploration for Efficient Policy Evaluation and Comparison ICML 2022

Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences ICML 2022

Off-Policy Evaluation for Large Action Spaces via Embeddings ICML 2022

Instance Dependent Regret Analysis of Kernelized Bandits ICML 2022

Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces ICML 2022

Socially Fair Mitigation of Misinformation on Social Networks via Constraint Stochastic Optimization AAAI 2022

Bandit Limited Discrepancy Search and Application to Machine Learning Pipeline Optimization AAAI 2022

Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-profits in Improving Maternal and Child Health AAAI 2022

Adversarial Attacks on Gaussian Process Bandits ICML 2022

A Reduction from Linear Contextual Bandits Lower Bounds to Estimations Lower Bounds ICML 2022

Gaussian Process Bandits with Aggregated Feedback AAAI 2022

No Weighted-Regret Learning in Adversarial Bandits with Delays JMLR 2022

KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints JMLR 2022

Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits ICML 2022

Towards Off-Policy Learning for Ranking Policies with Logged Feedback AAAI 2022

Contextual Information-Directed Sampling ICML 2022

Distributionally-Aware Kernelized Bandit Problems for Risk Aversion ICML 2022

Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits AAAI 2022

A Simple Unified Framework for High Dimensional Bandit Problems ICML 2022

Multi-slots Online Matching with High Entropy ICML 2022

An Online Learning Approach to Sequential User-Centric Selection Problems AAAI 2022

Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms ICML 2022