Papers

5,479 papers found

MagicPIG: LSH Sampling for Efficient LLM Generation

Zhuoming Chen, Ranajoy Sadhukhan, Zihao Ye et al.

2025 ICLR

Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs

Haowen Pan, Xiaozhi Wang, Yixin Cao et al.

2025 ICLR

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver

Zhenting Qi, Mingyuan MA, Jiahang Xu et al.

2025 ICLR

Is Factuality Enhancement a Free Lunch For LLMs? Better Factuality Can Lead to Worse Context-Faithfulness

Baolong Bi, Shenghua Liu, Yiwei Wang et al.

2025 ICLR

Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective

Zeyu Gan, Yong Liu

2025 ICLR

Learning to Contextualize Web Pages for Enhanced Decision Making by LLM Agents

Dongjun Lee, Juyong Lee, Kyuyoung Kim et al.

2025 ICLR

Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection

Ziqing Fan, Siyuan Du, Shengchao Hu et al.

2025 ICLR

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Sihang Li, Jin Huang, Jiaxi Zhuang et al.

2025 ICLR

Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View

Xuan Liu, Jie ZHANG, HaoYang Shang et al.

2025 ICLR

Dobi-SVD: Differentiable SVD for LLM Compression and Some New Perspectives

Wang Qinsi, Jinghan Ke, Masayoshi Tomizuka et al.

2025 ICLR

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Jixuan Leng, Chengsong Huang, Banghua Zhu et al.

2025 ICLR

A Benchmark for Semantic Sensitive Information in LLMs Outputs

Qingjie Zhang, Han Qiu, Di Wang et al.

2025 ICLR

PiCO: Peer Review in LLMs based on Consistency Optimization

Kun-Peng Ning, Shuo Yang, Yuyang Liu et al.

2025 ICLR

Uncovering Gaps in How Humans and LLMs Interpret Subjective Language

Erik Jones, Arjun Patrawala, Jacob Steinhardt

2025 ICLR

Ada-K Routing: Boosting the Efficiency of MoE-based LLMs

Tongtian Yue, Longteng Guo, Jie Cheng et al.

2025 ICLR

MallowsPO: Fine-Tune Your LLM with Preference Dispersions

Haoxian Chen, Hanyang Zhao, Henry Lam et al.

2025 ICLR

Catastrophic Failure of LLM Unlearning via Quantization

Zhiwei Zhang, Fali Wang, Xiaomin Li et al.

2025 ICLR

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Jihan Yao, Wenxuan Ding, Shangbin Feng et al.

2025 ICLR

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Guangxuan Xiao, Jiaming Tang, Jingwei Zuo et al.

2025 ICLR

Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs

Xuandong Zhao, Lei Li, Yu-Xiang Wang

2025 ICLR

Aligned LLMs Are Not Aligned Browser Agents

Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.

2025 ICLR

DS-LLM: Leveraging Dynamical Systems to Enhance Both Training and Inference of Large Language Models

Ruibing Song, Chuan Liu, Chunshu Wu et al.

2025 ICLR

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara et al.

2025 ICLR

Do LLMs ``know'' internally when they follow instructions?

Juyeon Heo, Christina Heinze-Deml, Oussama Elachqar et al.

2025 ICLR

Optimized Multi-Token Joint Decoding With Auxiliary Model for LLM Inference

Zongyue Qin, Ziniu Hu, Zifan He et al.

2025 ICLR