Papers

5,479 papers found
Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
Souradip Chakraborty, Sujay Bhatt, Udari Madhushani Sehwag et al.
2025 ICLR
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour, David Harrison, Maxwell Horton et al.
2025 ICLR
ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration
Andrew Estornell, Jean-Francois Ton, Yuanshun Yao et al.
2025 ICLR
Can Watermarks be Used to Detect LLM IP Infringement For Free?
Zhengyue Zhao, Xiaogeng Liu, Somesh Jha et al.
2025 ICLR
Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors
Tianchun Wang, Yuanzhou Chen, Zichuan Liu et al.
2025 ICLR
Human-inspired Episodic Memory for Infinite Context LLMs
Zafeirios Fountas, Martin Benfeghoul, Adnan Oomerjee et al.
2025 ICLR
BingoGuard: LLM Content Moderation Tools with Risk Levels
Fan Yin, Philippe Laban, XIANGYU PENG et al.
2025 ICLR
SFS: Smarter Code Space Search improves LLM Inference Scaling
Jonathan Light, Yue Wu, Yiyou Sun et al.
2025 ICLR
2025 ICLR
LLMs Can Plan Only If We Tell Them
Bilgehan Sel, Ruoxi Jia, Ming Jin
2025 ICLR
Do LLM Agents Have Regret? A Case Study in Online Learning and Games
Chanwoo Park, Xiangyu Liu, Asuman E. Ozdaglar et al.
2025 ICLR
Benchmarking LLMs' Judgments with No Gold Standard
Shengwei Xu, Yuxuan Lu, Grant Schoenebeck et al.
2025 ICLR
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
Bill Yuchen Lin, Yuntian Deng, Khyathi Chandu et al.
2025 ICLR
Decision Tree Induction Through LLMs via Semantically-Aware Evolution
Tennison Liu, Nicolas Huynh, Mihaela van der Schaar
2025 ICLR
R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
Zhenyu Zhang, Zechun Liu, Yuandong Tian et al.
2025 ICLR
MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS
Sheng-Chieh Lin, Chankyu Lee, Mohammad Shoeybi et al.
2025 ICLR
Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs
Nguyen Nhat Minh, Andrew Baker, Clement Neo et al.
2025 ICLR
Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning
Simran Kaur, Simon Park, Anirudh Goyal et al.
2025 ICLR
Better autoregressive regression with LLMs via regression-aware fine-tuning
Michal Lukasik, Zhao Meng, Harikrishna Narasimhan et al.
2025 ICLR
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
HyoJung Han, Akiko Eriguchi, Haoran Xu et al.
2025 ICLR