Papers
11,955 papers found
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
Michael Zhang, Kush Bhatia, Hermann Kumbong et al.
The Hidden Language of Diffusion Models
Hila Chefer, Oran Lang, Mor Geva et al.
The Human-AI Substitution game: active learning from a strategic labeler
Tom Yan, Chicheng Zhang
The Impact of Digital Editing on the Study of Holocaust Survivors’ Testimonies in the context of Voci dall’Inferno Project
Angelo Mario Del Grosso, Marina Riccucci, Elvira Mercatanti
The importance of feature preprocessing for differentially private linear optimization
Ziteng Sun, Ananda Theertha Suresh, Aditya Krishna Menon
The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model
Daniel Goldfarb, Itay Evron, Nir Weinberger et al.
The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing
Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.
The LLM Surgeon
Tycho F. A. van der Ouderaa, Markus Nagel, Mart Van Baalen et al.
The Marginal Value of Momentum for Small Learning Rate SGD
Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.
The Need for Speed: Pruning Transformers with One Recipe
Samir Khaki, Konstantinos N Plataniotis
The optimality of kernel classifiers in Sobolev space
Jianfa Lai, zhifan Li, Dongming Huang et al.
Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach
Shaopeng Fu, Di Wang
Theoretical Understanding of Learning from Adversarial Perturbations
Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki
The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models
Saravanan Kandasamy, Dheeraj Nagaraj
The Prevalence of Neural Collapse in Neural Multivariate Regression
George Andriopoulos, Zixuan Dong, Li Guo et al.
The Production of Contrastive Focus by 7 to 13-year-olds Learning Mandarin Chinese
Zimeng Li, Zhongxuan Mao, Shengting Shen et al.
The Reasonableness Behind Unreasonable Translation Capability of Large Language Model
Tingchen Fu, Lemao Liu, Deng Cai et al.
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.
The Trickle-down Impact of Reward Inconsistency on RLHF
Lingfeng Shen, Sihao Chen, Linfeng Song et al.
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Pratyusha Sharma, Jordan T. Ash, Dipendra Misra
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin, Abhilasha Ravichander, Ximing Lu et al.
The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric
Daniel Severo, Lucas Theis, Jona Ballé
The Update-Equivalence Framework for Decision-Time Planning
Samuel Sokota, Gabriele Farina, David J Wu et al.
The Use of Modifiers and f0 in Remote Referential Communication with Human and Computer Partners
Iona Gessinger, Bistra Andreeva, Benjamin R. Cowan