← Learning Types

Machine Learning › Learning Types ›

Multi-Armed Bandits

1044 directly classified papers

Papers per year

Papers

Adaptation to Misspecified Kernel Regularity in Kernelised Bandits AISTATS 2023

Overcoming Prior Misspecification in Online Learning to Rank AISTATS 2023

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments AISTATS 2023

Stochastic Contextual Bandits with Long Horizon Rewards AAAI 2023

Delayed Feedback in Generalised Linear Bandits Revisited AISTATS 2023

Online Restless Bandits with Unobserved States ICML 2023

Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles AISTATS 2023

Learning Revenue Maximization Using Posted Prices for Stochastic Strategic Patient Buyers AAAI 2023

Optimal Algorithms for Latent Bandits with Cluster Structure AISTATS 2023

Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets AAAI 2023

One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits AISTATS 2023

Optimistic Whittle Index Policy: Online Learning for Restless Bandits AAAI 2023

Revisiting Weighted Strategy for Non-stationary Parametric Bandits AISTATS 2023

Towards Efficient and Domain-Agnostic Evasion Attack with High-Dimensional Categorical Inputs AAAI 2023

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference AISTATS 2023

Fairness and Welfare Quantification for Regret in Multi-Armed Bandits AAAI 2023

Meta-Learning for Simple Regret Minimization AAAI 2023

Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits AAAI 2023

Interactive Learning with Pricing for Optimal and Stable Allocations in Markets AISTATS 2023

Revisiting Simple Regret: Fast Rates for Returning a Good Arm ICML 2023

TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation AISTATS 2023

Stochastic Gradient Succeeds for Bandits ICML 2023

Almost Cost-Free Communication in Federated Best Arm Identification AAAI 2023

Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits AAAI 2023

Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health AAAI 2023