Papers
5,479 papers found
MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs
Junpeng Ma, Qizhe Zhang, Ming Lu et al.
Prototype Entropy Alignment: Reinforcing Structured Uncertainty in LLM Reasoning
Zhengyuan Pan, Yanhao Chen, Zhongquan Jian et al.
What Makes a Good Generated Image? Investigating Human and Multimodal LLM Image Preference Alignment
Rishab Parthasarathy, Jasmine Collins, Cory Stephenson
Online Multi-LLM Selection via Contextual Bandits Under Unstructured Context Evolution
Manhin Poon, Xiangxiang Dai, Xutong Liu et al.
Next Generation Active Learning: Mixture of LLMs in the Loop
Yuanyuan Qi, Xiaohao Yang, Jueqing Lu et al.
BitDP: Ultra-low-bit Communication for Data Parallelism in LLM Training
Xiaozhe Ren, Qiong Luo
A Solver-in-the-Loop Framework for Improving LLMs on Answer Set Programming for Logic Puzzle Solving
Timo Pierre Schrader, Lukas Lange, Tobias Kaminski et al.
Low-Rank Curvature for Zeroth-Order Optimization in LLM Fine-tuning
Hyunseok Seung, Jaewoo Lee, Hyunsuk Ko
URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
Yongxin Shi, Jiapeng Wang, Zeyu Shan et al.
Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
Wenwen Si, Sooyong Jang, Insup Lee et al.
Learning to Collaborate: An Orchestrated-Decentralized Framework for Peer-to-Peer LLM Federation
Inderjeet Singh, Eleonore Vissol-Gaudin, Andikan Otung et al.
DAWN: Distributed LLM Multi-Agent Workflow Synthesis
Guancheng Wan, Mo Zhou, Ziyi Wang et al.
TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents
Dawei Wang, Chengming Zhou, Di Zhao et al.
MemeBQ:Memory Efficient Binary Quantization of LLMs
Yuanhui Wang, Kunlong Liu, Minnan Pei et al.
Making Sense of LLM Decisions: A Prototype-based Framework for Explainable Classification
Bowen Wei, Mehrdad Fazli, Ziwei Zhu
Improving Generalization in LLM Structured Pruning via Function-Aware Neuron Grouping
Tao Yu, Yongqi An, Kuan Zhu et al.
Lethe: Layer- and Time-Adaptive KV Cache Pruning for Reasoning-Intensive LLM Serving
Hui Zeng, Daming Zhao, Pengfei Yang et al.
PRISM: Privacy-Aware Routing for Adaptive Cloud–Edge LLM Inference via Semantic Sketch Collaboration
Junfei Zhan, Haoxun Shen, Zheng Lin et al.
SpecQuant: Spectral Decomposition and Adaptive Truncation for Ultra-Low-Bit LLMs Quantization
Zhixiong Zhao, Fangxin Liu, Junjie Wang et al.
iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference
Wei Fan, JinYi Yoon, Bo Ji
DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs
Yuanhao Li, Mingshan Liu, Hongbo Wang et al.
From Text to Simulation: A Multi-Agent LLM Workflow for Automated Chemical Process Design
Xufei Tian, Wenli Du, Shaoyi Yang et al.
HiveMind: Contribution-Guided Online Prompt Optimization of LLM Multi-Agent Systems
Yihan Xia, Taotao Wang, Shengli Zhang et al.
RECoRD: A Multi-Agent LLM Framework for Reverse Engineering Codebase to Relational Diagram
Yuan Xue, Xiaoyu Lu, Yunfei Bai et al.