Co-occurring keywords
Papers
Bandit problems with fidelity rewards
JMLR 2023
On the Convergence of Distributed Stochastic Bilevel Optimization Algorithms over a Network
AISTATS 2023
Primal-Dual Stochastic Mirror Descent for MDPs
AISTATS 2022
Rotting Infinitely Many-Armed Bandits
ICML 2022
Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences
ICML 2022
Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence
ICML 2022