Papers

79 papers found
Scaling Laws for Associative Memories
Vivien Cabannes, Elvis Dohmatob, Alberto Bietti
2024 ICLR
Scaling Laws of RoPE-based Extrapolation
Xiaoran Liu, Hang Yan, Chenxin An et al.
2024 ICLR
Scaling Laws for Sparsely-Connected Foundation Models
Elias Frantar, Carlos Riquelme Ruiz, Neil Houlsby et al.
2024 ICLR
Scaling Laws for Precision
Tanishq Kumar, Zachary Ankner, Benjamin Frederick Spector et al.
2025 ICLR
Towards Neural Scaling Laws for Time Series Foundation Models
Qingren Yao, Chao-Han Huck Yang, Renhe Jiang et al.
2025 ICLR
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
Muhammed Emrullah Ildiz, Halil Alperen Gozeten, Ege Onur Taga et al.
2025 ICLR
Data Scaling Laws in Imitation Learning for Robotic Manipulation
Fanqi Lin, Yingdong Hu, Pingyue Sheng et al.
2025 ICLR
A Solvable Attention for Neural Scaling Laws
Bochen Lyu, Di Wang, Zhanxing Zhu
2025 ICLR
2025 ICLR
Scaling Laws for Downstream Task Performance in Machine Translation
Berivan Isik, Natalia Ponomareva, Hussein Hazimeh et al.
2025 ICLR
How Much is a Noisy Image Worth? Data Scaling Laws for Ambient Diffusion.
Giannis Daras, Yeshwanth Cherapanamjeri, Constantinos Costis Daskalakis
2025 ICLR
How Feature Learning Can Improve Neural Scaling Laws
Blake Bordelon, Alexander Atanasov, Cengiz Pehlevan
2025 ICLR
Breaking Neural Network Scaling Laws with Modularity
Akhilan Boopathy, Sunshine Jiang, William Yue et al.
2025 ICLR
Data Scaling Laws in NMT: The Effect of Noise and Architecture
Yamini Bansal, Behrooz Ghorbani, Ankush Garg et al.
2022 ICML
Unified Scaling Laws for Routed Language Models
Aidan Clark, Diego De Las Casas, Aurelia Guy et al.
2022 ICML
Scaling Laws for Generative Mixed-Modal Language Models
Armen Aghajanyan, Lili Yu, Alexis Conneau et al.
2023 ICML
2023 ICML
Scaling Laws for Multilingual Neural Machine Translation
Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia et al.
2023 ICML
Scaling Laws for Reward Model Overoptimization
Leo Gao, John Schulman, Jacob Hilton
2023 ICML
TAN Without a Burn: Scaling Laws of DP-SGD
Tom Sander, Pierre Stock, Alexandre Sablayrolles
2023 ICML