Papers
11,015 papers found
The Hidden Language of Diffusion Models
Hila Chefer, Oran Lang, Mor Geva et al.
The Human-AI Substitution game: active learning from a strategic labeler
Tom Yan, Chicheng Zhang
The importance of feature preprocessing for differentially private linear optimization
Ziteng Sun, Ananda Theertha Suresh, Aditya Krishna Menon
The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model
Daniel Goldfarb, Itay Evron, Nir Weinberger et al.
The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing
Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.
The LLM Surgeon
Tycho F. A. van der Ouderaa, Markus Nagel, Mart Van Baalen et al.
The Marginal Value of Momentum for Small Learning Rate SGD
Runzhe Wang, Sadhika Malladi, Tianhao Wang et al.
The Need for Speed: Pruning Transformers with One Recipe
Samir Khaki, Konstantinos N Plataniotis
The optimality of kernel classifiers in Sobolev space
Jianfa Lai, zhifan Li, Dongming Huang et al.
Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach
Shaopeng Fu, Di Wang
Theoretical Understanding of Learning from Adversarial Perturbations
Soichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki
The Reasonableness Behind Unreasonable Translation Capability of Large Language Model
Tingchen Fu, Lemao Liu, Deng Cai et al.
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.
The Trickle-down Impact of Reward Inconsistency on RLHF
Lingfeng Shen, Sihao Chen, Linfeng Song et al.
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Pratyusha Sharma, Jordan T. Ash, Dipendra Misra
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin, Abhilasha Ravichander, Ximing Lu et al.
The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric
Daniel Severo, Lucas Theis, Jona Ballé
The Update-Equivalence Framework for Decision-Time Planning
Samuel Sokota, Gabriele Farina, David J Wu et al.
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Raphaël Avalos, Florent Delgrange, Ann Nowe et al.
Think before you speak: Training Language Models With Pause Tokens
Sachin Goyal, Ziwei Ji, Ankit Singh Rawat et al.
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph
Jiashuo Sun, Chengjin Xu, Lumingyuan Tang et al.
Thin-Shell Object Manipulations With Differentiable Physics Simulations
Yian Wang, Juntian Zheng, Zhehuan Chen et al.
THOUGHT PROPAGATION: AN ANALOGICAL APPROACH TO COMPLEX REASONING WITH LARGE LANGUAGE MODELS
Junchi Yu, Ran He, Zhitao Ying
Threaten Spiking Neural Networks through Combining Rate and Temporal Information
Zecheng Hao, Tong Bu, Xinyu Shi et al.