Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Armed Bandits
1044 directly classified papers
Papers per year
2002: 1
2006: 2
2007: 3
2008: 5
2009: 3
2010: 5
2011: 23
2012: 16
2013: 32
2014: 42
2015: 27
2016: 33
2017: 46
2018: 55
2019: 80
2020: 87
2021: 124
2022: 160
2023: 136
2024: 126
2025: 38
Papers
Deep Hierarchy in Bandits
ICML 2022
Smoothed Adversarial Linear Contextual Bandits with Knapsacks
ICML 2022
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
ICML 2022
Safe Exploration for Efficient Policy Evaluation and Comparison
ICML 2022
Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences
ICML 2022
Off-Policy Evaluation for Large Action Spaces via Embeddings
ICML 2022
Instance Dependent Regret Analysis of Kernelized Bandits
ICML 2022
Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces
ICML 2022
Socially Fair Mitigation of Misinformation on Social Networks via Constraint Stochastic Optimization
AAAI 2022
Bandit Limited Discrepancy Search and Application to Machine Learning Pipeline Optimization
AAAI 2022
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-profits in Improving Maternal and Child Health
AAAI 2022
Adversarial Attacks on Gaussian Process Bandits
ICML 2022
A Reduction from Linear Contextual Bandits Lower Bounds to Estimations Lower Bounds
ICML 2022
Gaussian Process Bandits with Aggregated Feedback
AAAI 2022
No Weighted-Regret Learning in Adversarial Bandits with Delays
JMLR 2022
KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints
JMLR 2022
Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits
ICML 2022
Towards Off-Policy Learning for Ranking Policies with Logged Feedback
AAAI 2022
Contextual Information-Directed Sampling
ICML 2022
Distributionally-Aware Kernelized Bandit Problems for Risk Aversion
ICML 2022
Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits
AAAI 2022
A Simple Unified Framework for High Dimensional Bandit Problems
ICML 2022
Multi-slots Online Matching with High Entropy
ICML 2022
An Online Learning Approach to Sequential User-Centric Selection Problems
AAAI 2022
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
ICML 2022
<
1
…
13
14
15
…
42
>