Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Armed Bandits
1044 directly classified papers
Papers per year
2002: 1
2006: 2
2007: 3
2008: 5
2009: 3
2010: 5
2011: 23
2012: 16
2013: 32
2014: 42
2015: 27
2016: 33
2017: 46
2018: 55
2019: 80
2020: 87
2021: 124
2022: 160
2023: 136
2024: 126
2025: 38
Papers
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits
AISTATS 2023
Overcoming Prior Misspecification in Online Learning to Rank
AISTATS 2023
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
AISTATS 2023
Stochastic Contextual Bandits with Long Horizon Rewards
AAAI 2023
Delayed Feedback in Generalised Linear Bandits Revisited
AISTATS 2023
Online Restless Bandits with Unobserved States
ICML 2023
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
AISTATS 2023
Learning Revenue Maximization Using Posted Prices for Stochastic Strategic Patient Buyers
AAAI 2023
Optimal Algorithms for Latent Bandits with Cluster Structure
AISTATS 2023
Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets
AAAI 2023
One Arrow, Two Kills: A Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits
AISTATS 2023
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
AAAI 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
AISTATS 2023
Towards Efficient and Domain-Agnostic Evasion Attack with High-Dimensional Categorical Inputs
AAAI 2023
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
AISTATS 2023
Fairness and Welfare Quantification for Regret in Multi-Armed Bandits
AAAI 2023
Meta-Learning for Simple Regret Minimization
AAAI 2023
Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits
AAAI 2023
Interactive Learning with Pricing for Optimal and Stable Allocations in Markets
AISTATS 2023
Revisiting Simple Regret: Fast Rates for Returning a Good Arm
ICML 2023
TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation
AISTATS 2023
Stochastic Gradient Succeeds for Bandits
ICML 2023
Almost Cost-Free Communication in Federated Best Arm Identification
AAAI 2023
Efficient Explorative Key-Term Selection Strategies for Conversational Contextual Bandits
AAAI 2023
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health
AAAI 2023
<
1
…
11
12
13
…
42
>