Papers
1,396 papers found
Beyond No Regret: Instance-Dependent PAC Reinforcement Learning
Andrew J Wagenmaker, Max Simchowitz, Kevin Jamieson
Big-Step-Little-Step: Efficient Gradient Methods for Objectives with Multiple Scales
Jonathan Kelner, Annie Marsden, Vatsal Sharan et al.
Can Q-learning be Improved with Advice?
Noah Golowich, Ankur Moitra
Chained generalisation bounds
Eugenio Clerico, Amitis Shidani, George Deligiannidis et al.
Chasing Convex Bodies and Functions with Black-Box Advice
Nicolas Christianson, Tinashe Handina, Adam Wierman
Clustering with Queries under Semi-Random Noise
Alberto Del Pia, Mingchen Ma, Christos Tzamos
Community Recovery in the Degree-Heterogeneous Stochastic Block Model
Vincent Cohen-Addad, Frederik Mallmann-Trenn, David Saulpic
Complete Policy Regret Bounds for Tallying Bandits
Dhruv Malik, Yuanzhi Li, Aarti Singh
Computational-Statistical Gap in Reinforcement Learning
Daniel Kane, Sihan Liu, Shachar Lovett et al.
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo, Mengxiao Zhang, Peng Zhao et al.
Corruption-Robust Contextual Search through Density Updates
Renato Paes Leme, Chara Podimata, Jon Schneider
Damped Online Newton Step for Portfolio Selection
Zakaria Mhammedi, Alexander Rakhlin
Depth and Feature Learning are Provably Beneficial for Neural Network Discriminators
Carles Domingo-Enrich
Derivatives and residual distribution of regularized M-estimators with application to adaptive tuning
Pierre C Bellec, Yiwei Shen
Differential privacy and robust statistics in high dimensions
Xiyang Liu, Weihao Kong, Sewoong Oh
Dimension-free convergence rates for gradient Langevin dynamics in RKHS
Boris Muzellec, Kanji Sato, Mathurin Massias et al.
Efficient Convex Optimization Requires Superlinear Memory
Annie Marsden, Vatsal Sharan, Aaron Sidford et al.
Efficient decentralized multi-agent learning in asymmetric queuing systems
Daniel Freund, Thodoris Lykouris, Wentao Weng
Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics
Asaf B Cassel, Alon Cohen, Tomer Koren
EM’s Convergence in Gaussian Latent Tree Models
Yuval Dagan, Vardis Kandiros, Constantinos Daskalakis
Exact Community Recovery in Correlated Stochastic Block Models
Julia Gaudio, Miklos Z. Racz, Anirudh Sridhar
Fast algorithm for overcomplete order-3 tensor decomposition
Jingqiu Ding, Tommaso d’Orsi, Chih-Hung Liu et al.
Faster online calibration without randomization: interval forecasts and the power of two choices
Chirag Gupta, Aaditya Ramdas