Papers - Conftrace

The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry

Michael Zhang, Kush Bhatia, Hermann Kumbong et al.

2024 ICLR

The Hidden Language of Diffusion Models

Hila Chefer, Oran Lang, Mor Geva et al.

2024 ICLR

The Human-AI Substitution game: active learning from a strategic labeler

Tom Yan, Chicheng Zhang

2024 ICLR

The Impact of Digital Editing on the Study of Holocaust Survivors’ Testimonies in the context of Voci dall’Inferno Project

Angelo Mario Del Grosso, Marina Riccucci, Elvira Mercatanti

2024 COLING

The importance of feature preprocessing for differentially private linear optimization

Ziteng Sun, Ananda Theertha Suresh, Aditya Krishna Menon

2024 ICLR

The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model

Daniel Goldfarb, Itay Evron, Nir Weinberger et al.

2024 ICLR

The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing

Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.

2024 ICLR

The LLM Surgeon

Tycho F. A. van der Ouderaa, Markus Nagel, Mart Van Baalen et al.

2024 ICLR

The Marginal Value of Momentum for Small Learning Rate SGD

Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.

2024 ICLR

The mechanistic basis of data dependence and abrupt learning in an in-context classification task

Gautam Reddy

2024 ICLR

The Need for Speed: Pruning Transformers with One Recipe

Samir Khaki, Konstantinos N Plataniotis

2024 ICLR

The optimality of kernel classifiers in Sobolev space

Jianfa Lai, zhifan Li, Dongming Huang et al.

2024 ICLR

Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach

Shaopeng Fu, Di Wang

2024 ICLR

Theoretical Understanding of Learning from Adversarial Perturbations

Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki

2024 ICLR

The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models

Saravanan Kandasamy, Dheeraj Nagaraj

2024 NIPS

The Prevalence of Neural Collapse in Neural Multivariate Regression

George Andriopoulos, Zixuan Dong, Li Guo et al.

2024 NIPS

The Production of Contrastive Focus by 7 to 13-year-olds Learning Mandarin Chinese

Zimeng Li, Zhongxuan Mao, Shengting Shen et al.

2024 INTERSPEECH

The Reasonableness Behind Unreasonable Translation Capability of Large Language Model

Tingchen Fu, Lemao Liu, Deng Cai et al.

2024 ICLR

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”

Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.

2024 ICLR

The Trickle-down Impact of Reward Inconsistency on RLHF

Lingfeng Shen, Sihao Chen, Linfeng Song et al.

2024 ICLR

The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Pratyusha Sharma, Jordan T. Ash, Dipendra Misra

2024 ICLR

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning

Bill Yuchen Lin, Abhilasha Ravichander, Ximing Lu et al.

2024 ICLR

The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric

Daniel Severo, Lucas Theis, Jona Ballé

2024 ICLR

The Update-Equivalence Framework for Decision-Time Planning

Samuel Sokota, Gabriele Farina, David J Wu et al.

2024 ICLR

The Use of Modifiers and f0 in Remote Referential Communication with Human and Computer Partners

Iona Gessinger, Bistra Andreeva, Benjamin R. Cowan

2024 INTERSPEECH