Papers
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
Junxiong Wang, Daniele Paliotta, Avner May et al.
The Many Faces of Optimal Weak-to-Strong Learning
Mikael Møller Høgsgaard, Kasper Green Larsen, Markus Engelund Mathiasen
The Map Equation Goes Neural: Mapping Network Flows with Graph Neural Networks
Christopher Blöcker, Chester Tan, Ingo Scholtes
The Minimax Rate of HSIC Estimation for Translation-Invariant Kernels
Florian Kalinke, Zoltán Szabó
The motion planning neural circuit in goal-directed navigation as Lie group operator search
Junfeng Zuo, Ying Nian Wu, Si Wu et al.
The Multimodal Universe: Enabling Large-Scale Machine Learning with 100 TB of Astronomical Scientific Data
Eirini Angeloudi, Jeroen Audenaert, Micah Bowles et al.
Theoretical Analysis of Weak-to-Strong Generalization
Hunter Lang, David Sontag, Aravindan Vijayaraghavan
Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural Networks
Arjun Subramonian, Jian Kang, Yizhou Sun
Theoretical Characterisation of the Gauss Newton Conditioning in Neural Networks
Jim Zhao, Sidak Pal Singh, Aurelien Lucchi
Theoretical Foundations of Deep Selective State-Space Models
Nicola Muca Cirone, Antonio Orvieto, Benjamin Walker et al.
Theoretical guarantees in KL for Diffusion Flow Matching
Marta Gentiloni Silveri, Giovanni Conforti, Alain Durmus
Theoretical Investigations and Practical Enhancements on Tail Task Risk Minimization in Meta Learning
Yiqin Lv, Qi Wang, Dong Liang et al.
The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models
Saravanan Kandasamy, Dheeraj Nagaraj
The Power of Extrapolation in Federated Learning
Hanmin Li, Kirill Acharya, Peter Richtárik
The Power of Hard Attention Transformers on Data Sequences: A formal language theoretic perspective
Pascal Bergsträßer, Chris Köcher, Anthony Widjaja Lin et al.
The Power of Resets in Online Reinforcement Learning
Zakaria Mhammedi, Dylan J. Foster, Alexander Rakhlin
The Prevalence of Neural Collapse in Neural Multivariate Regression
George Andriopoulos, Zixuan Dong, Li Guo et al.
The Price of Implicit Bias in Adversarially Robust Generalization
Nikolaos Tsilivis, Natalie S. Frank, Nathan Srebro et al.
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models
Hannah Rose Kirk, Alexander Whitefield, Paul Röttger et al.
The Reliability of OKRidge Method in Solving Sparse Ridge Regression Problems
Xiyuan Li, Youjun Wang, Weiwei Liu
The Representation Landscape of Few-Shot Learning and Fine-Tuning in Large Language Models
Diego Doimo, Alessandro Serra, Alessio Ansuini et al.
The Road Less Scheduled
Aaron Defazio, Xingyu (Alice) Yang, Harsh Mehta et al.
The Sample-Communication Complexity Trade-off in Federated Q-Learning
Sudeep Salgia, Yuejie Chi
The Sample Complexity of Gradient Descent in Stochastic Convex Optimization
Roi Livni, Amir, Koren et al.
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
Kenneth Enevoldsen, Márton Kardos, Niklas Muennighoff et al.