Papers
Approximation Algorithms for Fair Range Clustering
Sedjro Salomon Hotegni, Sepideh Mahabadi, Ali Vakilian
Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input
Shokichi Takakura, Taiji Suzuki
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Siyuan Li, Di Wu, Fang Wu et al.
Are Diffusion Models Vulnerable to Membership Inference Attacks?
Jinhao Duan, Fei Kong, Shiqi Wang et al.
Are Equivariant Equilibrium Approximators Beneficial?
Zhijian Duan, Yunxuan Ma, Xiaotie Deng
Are Gaussian Data All You Need? The Extents and Limits of Universality in High-Dimensional Generalized Linear Estimation
Luca Pesce, Florent Krzakala, Bruno Loureiro et al.
A Reinforcement Learning Framework for Dynamic Mediation Analysis
Lin Ge, Jitao Wang, Chengchun Shi et al.
Are labels informative in semi-supervised learning? Estimating and leveraging the missing-data mechanism.
Aude Sportisse, Hugo Schmutz, Olivier Humbert et al.
Are Large Kernels Better Teachers than Transformers for ConvNets?
Tianjin Huang, Lu Yin, Zhenyu Zhang et al.
Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations
Yongyi Yang, Jacob Steinhardt, Wei Hu
Are Random Decompositions all we need in High Dimensional Bayesian Optimisation?
Juliusz Krzysztof Ziomek, Haitham Bou Ammar
Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models
Luke Vilnis, Yury Zemlyanskiy, Patrick Murray et al.
A Robust Optimisation Perspective on Counterexample-Guided Repair of Neural Networks
David Boetius, Stefan Leue, Tobias Sutter
A Robust Test for the Stationarity Assumption in Sequential Decision Making
Jitao Wang, Chengchun Shi, Zhenke Wu
A Scalable Frank-Wolfe-Based Algorithm for the Max-Cut SDP
Chi Bach Pham, Wynita Griggs, James Saunderson
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models
James Urquhart Allingham, Jie Ren, Michael W Dusenberry et al.
A Statistical Perspective on Retrieval-Based Models
Soumya Basu, Ankit Singh Rawat, Manzil Zaheer
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff, Minqi Jiang, Roberta Raileanu
A Study on Transformer Configuration and Training Objective
Fuzhao Xue, Jianghai Chen, Aixin Sun et al.
Atari-5: Distilling the Arcade Learning Environment down to Five Games
Matthew Aitchison, Penny Sweetser, Marcus Hutter
A Theoretical Analysis of the Learning Dynamics under Class Imbalance
Emanuele Francazi, Marco Baity-Jesi, Aurelien Lucchi
A theory of continuous generative flow networks
Salem Lahlou, Tristan Deleu, Pablo Lemos et al.
A theory of representation learning gives a deep generalisation of kernel methods
Adam X. Yang, Maxime Robeyns, Edward Milsom et al.
A Three-regime Model of Network Pruning
Yefan Zhou, Yaoqing Yang, Arin Chang et al.
A Toy Model of Universality: Reverse Engineering how Networks Learn Group Operations
Bilal Chughtai, Lawrence Chan, Neel Nanda