Papers

2,781 papers found

Persistent Pre-training Poisoning of LLMs

Yiming Zhang, Javier Rando, Ivan Evtimov et al.

2025 ICLR

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts

Guorui Zheng, Xidong Wang, Juhao Liang et al.

2025 ICLR

Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors

Weixuan Wang, JINGYUAN YANG, Wei Peng

2025 ICLR

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning

Mingyang Chen, sunhaoze, Tianpeng Li et al.

2025 ICLR

Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?

Sravanti Addepalli, Yerram Varun, Arun Suggala et al.

2025 ICLR

Density estimation with LLMs: a geometric investigation of in-context learning trajectories

Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati et al.

2025 ICLR

Can Watermarked LLMs be Identified by Users via Crafted Prompts?

Aiwei Liu, Sheng Guan, Yiming Liu et al.

2025 ICLR

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Zhuoqun Li, Xuanang Chen, Haiyang Yu et al.

2025 ICLR

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation

Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu et al.

2025 ICLR

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Chankyu Lee, Rajarshi Roy, Mengyao Xu et al.

2025 ICLR

Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning

Shengyuan Hu, Yiwei Fu, Steven Wu et al.

2025 ICLR

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Peng Xu, Wei Ping, Xianchao Wu et al.

2025 ICLR

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

Changle Qu, Sunhao Dai, Xiaochi Wei et al.

2025 ICLR

Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension ability

Yujin Han, Lei Xu, Sirui Chen et al.

2025 ICLR

Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity

Wentao Guo, Jikai Long, Yimeng Zeng et al.

2025 ICLR

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Sheryl Hsu, Omar Khattab, Chelsea Finn et al.

2025 ICLR

Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration

Qintong Li, Jiahui Gao, Sheng Wang et al.

2025 ICLR

ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

XIANGYU PENG, Congying Xia, Xinyi Yang et al.

2025 ICLR

Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning

Zenan Li, Zhaoyu Li, Wen Tang et al.

2025 ICLR

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Xiangyu Qi, Boyi Wei, Nicholas Carlini et al.

2025 ICLR

Second-Order Fine-Tuning without Pain for LLMs: A Hessian Informed Zeroth-Order Optimizer

Yanjun Zhao, Sizhe Dang, Haishan Ye et al.

2025 ICLR

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Zhangchen Xu, Fengqing Jiang, Luyao Niu et al.

2025 ICLR

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling

Zhicheng Yang, Yiwei Wang, Yinya Huang et al.

2025 ICLR

Transformer Block Coupling and its Correlation with Generalization in LLMs

Murdock Aubry, Haoming Meng, Anton Sugolov et al.

2025 ICLR

Tamper-Resistant Safeguards for Open-Weight LLMs

Rishub Tamirisa, Bhrugu Bharathi, Long Phan et al.

2025 ICLR