Papers
NysADMM: faster composite convex optimization via low-rank approximation
Shipu Zhao, Zachary Frangella, Madeleine Udell
Nyström Kernel Mean Embeddings
Antoine Chatalic, Nicolas Schreuder, Lorenzo Rosasco et al.
Object Permanence Emerges in a Random Walk along Memory
Pavel Tokmakov, Allan Jabri, Jie Li et al.
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang, An Yang, Rui Men et al.
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H Pong, Ashvin V Nair, Laura M Smith et al.
Offline RL Policies Should Be Trained to be Adaptive
Dibya Ghosh, Anurag Ajay, Pulkit Agrawal et al.
Off-Policy Evaluation for Large Action Spaces via Embeddings
Yuta Saito, Thorsten Joachims
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni et al.
Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han, Zhizhou Ren, Zuofan Wu et al.
Omni-Granular Ego-Semantic Propagation for Self-Supervised Graph Representation Learning
Ling Yang, Shenda Hong
On Collective Robustness of Bagging Against Data Poisoning
Ruoxin Chen, Zenan Li, Jie Li et al.
On Convergence of Gradient Descent Ascent: A Tight Local Analysis
Haochuan Li, Farzan Farnia, Subhro Das et al.
On Distribution Shift in Learning-based Bug Detectors
Jingxuan He, Luca Beurer-Kellner, Martin Vechev
One-Pass Algorithms for MAP Inference of Nonsymmetric Determinantal Point Processes
Aravind Reddy, Ryan A. Rossi, Zhao Song et al.
One-Pass Diversified Sampling with Application to Terabyte-Scale Genomic Sequence Streams
Benjamin Coleman, Benito Geordie, Li Chou et al.
On Implicit Bias in Overparameterized Bilevel Optimization
Paul Vicol, Jonathan P Lorraine, Fabian Pedregosa et al.
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning
Weichao Mao, Lin Yang, Kaiqing Zhang et al.
On Last-Iterate Convergence Beyond Zero-Sum Games
Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina et al.
On Learning Mixture of Linear Regressions in the Non-Realizable Setting
Soumyabrata Pal, Arya Mazumdar, Rajat Sen et al.
Online Active Regression
Cheng Chen, Yi Li, Yiming Sun
Online Algorithms with Multiple Predictions
Keerti Anand, Rong Ge, Amit Kumar et al.
Online and Consistent Correlation Clustering
Vincent Cohen-Addad, Silvio Lattanzi, Andreas Maggiori et al.
Online Balanced Experimental Design
David Arbour, Drew Dimmery, Tung Mai et al.
Online Continual Learning through Mutual Information Maximization
Yiduo Guo, Bing Liu, Dongyan Zhao