Papers
Online Learning Rate Adaptation with Hypergradient Descent
Atilim Gunes Baydin, Robert Cornish, David Martinez Rubio et al.
Online Learning: Sufficient Statistics and the Burkholder Method
Dylan J. Foster, Alexander Rakhlin, Karthik Sridharan
Online Learning with Abstention
Corinna Cortes, Giulia DeSalvo, Claudio Gentile et al.
Online Learning with an Unknown Fairness Metric
Stephen Gillen, Christopher Jung, Michael Kearns et al.
Online Learning with Non-Convex Losses and Non-Stationary Regret
Xiand Gao, Xiaobo Li, Shuzhong Zhang
Online Linear Quadratic Control
Alon Cohen, Avinatan Hasidim, Tomer Koren et al.
Online Multi-Object Tracking with Dual Matching Attention Networks
Ji Zhu, Hua Yang, Nian Liu et al.
Online Pricing for Revenue Maximization with Unknown Time Discounting Valuations
Weichao Mao, Zhenzhe Zheng, Fan Wu et al.
Online Reciprocal Recommendation with Theoretical Performance Guarantees
Fabio Vitale, Nikos Parotsidis, Claudio Gentile
Online Regression with Partial Information: Generalization and Linear Projection
Shinji Ito, Daisuke Hatano, Hanna Sumita et al.
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron Havens, Zhanhong Jiang, Soumik Sarkar
Online Speech Translation System for Tamil
Madhavaraj Ayyavu, Shiva Kumar H R, Ramakrishnan A G
Online Structured Laplace Approximations for Overcoming Catastrophic Forgetting
Hippolyt Ritter, Aleksandar Botev, David Barber
Online Structure Learning for Feed-Forward and Recurrent Sum-Product Networks
Agastya Kalra, Abdullah Rashwan, Wei-Shou Hsu et al.
Online User Assessment for Minimal Intervention During Task-Based Robotic Assistance
Aleksandra Kalinowska, Kathleen Fitzsimons, Julius Dewald et al.
Online Variance Reduction for Stochastic Optimization
Zalan Borsos, Andreas Krause, Kfir Y. Levy
On Markov Chain Gradient Descent
Tao Sun, Yuejiao Sun, Wotao Yin
On Matching Pursuit and Coordinate Descent
Francesco Locatello, Anant Raj, Sai Praneeth Karimireddy et al.
On Misinformation Containment in Online Social Networks
Amo Tong, Ding-Zhu Du, Weili Wu
On Nesting Monte Carlo Estimators
Tom Rainforth, Rob Cornish, Hongseok Yang et al.
On Neuronal Capacity
Pierre Baldi, Roman Vershynin
On Offline Evaluation of Vision-based Driving Models
Felipe Codevilla, Antonio M. Lopez, Vladlen Koltun et al.
On Oracle-Efficient PAC RL with Rich Observations
Christoph Dann, Nan Jiang, Akshay Krishnamurthy et al.
On preserving non-discrimination when combining expert advice
Avrim Blum, Suriya Gunasekar, Thodoris Lykouris et al.
On Q-learning Convergence for Non-Markov Decision Processes
Sultan Javed Majeed, Marcus Hutter