Papers
11,951 papers found
A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts
Suyu Ge, Xihui Lin, Yunan Zhang et al.
ALLaM: Large Language Models for Arabic and English
M Saiful Bari, Yazeed Alnumay, Norah A. Alzahrani et al.
Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits
Zihan Zhang, Xiangyang Ji, Yuan Zhou
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Junfeng Fang, Houcheng Jiang, Kun Wang et al.
Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models Trained on Corrupted Data
Asad Aali, Giannis Daras, Brett Levac et al.
A Meta-Learning Approach to Bayesian Causal Discovery
Anish Dhir, Matthew Ashman, James Requeima et al.
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series
Byoungwoo Park, Hyungi Lee, Juho Lee
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Zhaowei Zhang, Fengshuo Bai, Qizhi Chen et al.
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
Kairong Luo, Haodong Wen, Shengding Hu et al.
A Multiscale Frequency Domain Causal Framework for Enhanced Pathological Analysis
Xiaoyu Cui, Weixing Chen, Jiandong Su
ANaGRAM: A Natural Gradient Relative to Adapted Model for efficient PINNs learning
Nilo Schwencke, Cyril Furtlehner
AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit Topologies
Jian Gao, Weidong Cao, Junyi Yang et al.
Analysis of Linear Mode Connectivity via Permutation-Based Weight Matching: With Insights into Other Permutation Search Methods
Akira Ito, Masanori Yamada, Atsutoshi Kumagai
Analytic DAG Constraints for Differentiable DAG Learning
Zhen Zhang, Ignavier Ng, Dong Gong et al.
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models
Hulingxiao He, Geng Li, Zijun Geng et al.
Analyzing Neural Scaling Laws in Two-Layer Networks with Power-Law Data Spectra
Roman Worschech, Bernd Rosenow
An Asynchronous Bundle Method for Distributed Learning Problems
Daniel Cederberg, Xuyang Wu, Stephen P. Boyd et al.
An Auditing Test to Detect Behavioral Shift in Language Models
Leo Richter, Xuanli He, Pasquale Minervini et al.
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Christopher Rawles, Sarah Clinckemaillie, Yifan Chang et al.
An Effective Theory of Bias Amplification
Arjun Subramonian, Samuel Bell, Levent Sagun et al.
An Efficient Framework for Crediting Data Contributors of Diffusion Models
MingYu Lu, Chris Lin, Chanwoo Kim et al.
An Empirical Analysis of Uncertainty in Large Language Model Evaluations
Qiujie Xie, Qingqiu Li, Zhuohao Yu et al.
An Engorgio Prompt Makes Large Language Model Babble on
Jianshuo Dong, Ziyuan Zhang, Qingjie Zhang et al.
An Evolved Universal Transformer Memory
Edoardo Cetin, Qi Sun, Tianyu Zhao et al.