Papers
No-Regret Reinforcement Learning with Heavy-Tailed Rewards
Vincent Zhuang, Yanan Sui
Novel Change of Measure Inequalities with Applications to PAC-Bayesian Bounds and Monte Carlo Estimation
Yuki Ohnishi, Jean Honorio
Offline detection of change-points in the mean for stationary graph signals.
Alejandro de la Concha Duarte, Nicolas Vayatis, Argyris Kalogeratos
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Andrew Bennett, Nathan Kallus, Lihong Li et al.
On Data Efficiency of Meta-learning
Maruan Al-Shedivat, Liam Li, Eric Xing et al.
One-pass Stochastic Gradient Descent in overparametrized two-layer neural networks
Hanjing Zhu, Jiaming Xu
One-Round Communication Efficient Distributed M-Estimation
Yajie Bao, Weijia Xiong
On Information Gain and Regret Bounds in Gaussian Process Bandits
Sattar Vakili, Kia Khezeli, Victor Picheny
On Learning Continuous Pairwise Markov Random Fields
Abhin Shah, Devavrat Shah, Gregory Wornell
Online Active Model Selection for Pre-trained Classifiers
Mohammad Reza Karimi, Nezihe Merve Gürel, Bojan Karlaš et al.
Online Forgetting Process for Linear Regression Models
Yuantong Li, Chi-Hua Wang, Guang Cheng
Online k-means Clustering
Vincent Cohen-Addad, Benjamin Guedj, Varun Kanade et al.
Online Model Selection for Reinforcement Learning with Function Approximation
Jonathan Lee, Aldo Pacchiano, Vidya Muthukumar et al.
Online probabilistic label trees
Kalina Jasinska-Kobus, Marek Wydmuch, Devanathan Thiruvenkatachari et al.
Online Robust Control of Nonlinear Systems with Large Uncertainty
Dimitar Ho, Hoang Le, John Doyle et al.
Online Sparse Reinforcement Learning
Botao Hao, Tor Lattimore, Csaba Szepesvari et al.
On Multilevel Monte Carlo Unbiased Gradient Estimation for Deep Latent Variable Models
Yuyang Shi, Rob Cornish
On Projection Robust Optimal Transport: Sample Complexity and Model Misspecification
Tianyi Lin, Zeyu Zheng, Elynn Chen et al.
On Riemannian Stochastic Approximation Schemes with Fixed Step-Size
Alain Durmus, Pablo Jiménez, Eric Moulines et al.
On the Absence of Spurious Local Minima in Nonlinear Low-Rank Matrix Recovery Problems
Yingjie Bi, Javad Lavaei
On the Consistency of Metric and Non-Metric K-Medoids
He Jiang, Ery Arias-Castro
On the Convergence of Gradient Descent in GANs: MMD GAN As a Gradient Flow
Youssef Mroueh, Truyen Nguyen
On the convergence of the Metropolis algorithm with fixed-order updates for multivariate binary probability distributions
Kai Brügge, Asja Fischer, Christian Igel
On the Effect of Auxiliary Tasks on Representation Dynamics
Clare Lyle, Mark Rowland, Georg Ostrovski et al.