Papers
Online Composite Optimization Between Stochastic and Adversarial Environments
Yibo Wang, Sijia Chen, Wei Jiang et al.
Online Consistency of the Nearest Neighbor Rule
Sanjoy Dasgupta, Geelon So
Online Control in Population Dynamics
Noah Golowich, Elad Hazan, Zhou Lu et al.
Online Control with Adversarial Disturbance for Continuous-time Linear Systems
Jingwei Li, Jing Dong, Can Chang et al.
Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously
Stephen Pasteris, Chris Hicks, Vasilios Mavroudis et al.
Online Estimation via Offline Estimation: An Information-Theoretic Framework
Dylan J. Foster, Yanjun Han, Jian Qian et al.
Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
Ruihan Wu, Siddhartha Datta, Yi Su et al.
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
Chenlu Ye, Wei Xiong, Yuheng Zhang et al.
Online Learning of Delayed Choices
Recep Yusuf Bekci
Online Learning with Sublinear Best-Action Queries
Matteo Russo, Andrea Celli, Riccardo Colini-Baldeschi et al.
Online Non-convex Learning in Dynamic Environments
Zhipan Xu, Lijun Zhang
Online Posterior Sampling with a Diffusion Prior
Branislav Kveton, Boris N. Oreshkin, Youngsuk Park et al.
Online Relational Inference for Evolving Multi-agent Interacting Systems
Beomseok Kang, Priyabrata Saha, Sudarshan Sharma et al.
OnlineTAS: An Online Baseline for Temporal Action Segmentation
Qing Zhong, Guodong Ding, Angela Yao
Online Weighted Paging with Unknown Weights
Orin Levy, Noam Touitou, Aviv Rosenberg
Only Strict Saddles in the Energy Landscape of Predictive Coding Networks?
Francesco Innocenti, El Mehdi Achour, Ryan Singh et al.
On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
Chenyu Zheng, Wei Huang, Rongzhen Wang et al.
On Neural Networks as Infinite Tree-Structured Probabilistic Graphical Models
Boyao Li, Alexander J. Thomson, Houssam Nassif et al.
On provable privacy vulnerabilities of graph representations
Ruofan Wu, Guanhua Fang, Mingyang Zhang et al.
On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance
Zhixiong Nan, Yilong Chen, Tianfei Zhou et al.
On Sampling Strategies for Spectral Model Sharding
Denis Korzhenkov, Christos Louizos
On scalable oversight with weak LLMs judging strong LLMs
Zachary Kenton, Noah Y. Siegel, János Kramár et al.
On Socially Fair Low-Rank Approximation and Column Subset Selection
Zhao Song, Ali Vakilian, David P. Woodruff et al.
On Softmax Direct Preference Optimization for Recommendation
Yuxin Chen, Junfei Tan, An Zhang et al.
On Sparse Canonical Correlation Analysis
Yongchun Li, Santanu S. Dey, Weijun Xie