Papers
PRP-Graph: Pairwise Ranking Prompting to LLMs with Graph Aggregation for Effective Text Re-ranking
Jian Luo, Xuanang Chen, Ben He et al.
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
Qineng Wang, Zihao Wang, Ying Su et al.
Soft Knowledge Prompt: Help External Knowledge Become a Better Teacher to Instruct LLM in Knowledge-based VQA
Qunbo Wang, Ruyi Ji, Tianhao Peng et al.
The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities
David Stap, Eva Hasler, Bill Byrne et al.
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs
Justin Chen, Swarnadeep Saha, Mohit Bansal
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents
Yifan Song, Da Yin, Xiang Yue et al.
MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs
Yavuz Faruk Bakman, Duygu Nur Yaldiz, Baturalp Buyukates et al.
PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator
Chuyi Kong, Yaxin Fan, Xiang Wan et al.
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Jiaxi Yang, Binyuan Hui, Min Yang et al.
NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time
Yilong Chen, Guoxia Wang, Junyuan Shang et al.
MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
Prince Jha, Raghav Jain, Konika Mandal et al.
Bypassing LLM Watermarks with Color-Aware Substitutions
Qilong Wu, Varun Chandrasekaran
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Haoxiang Wang, Yong Lin, Wei Xiong et al.
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
Zixuan Li, Yutao Zeng, Yuxin Zuo et al.
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Le Zhuo, Zewen Chi, Minghao Xu et al.
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Peiyi Wang, Lei Li, Zhihong Shao et al.
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Jun Zhan, Junqi Dai, Jiasheng Ye et al.
BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents
Yifei Wang, Dizhan Xue, Shengjie Zhang et al.
MERA: A Comprehensive LLM Evaluation in Russian
Alena Fenogenova, Artem Chervyakov, Nikita Martynov et al.
POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation
Shilong Pan, Zhiliang Tian, Liang Ding et al.
Quantifying the Persona Effect in LLM Simulations
Tiancheng Hu, Nigel Collier
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Nishant Balepur, Abhilasha Ravichander, Rachel Rudinger
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Charlotte Siska, Katerina Marazopoulou, Melissa Ailem et al.
Bridging the Preference Gap between Retrievers and LLMs
Zixuan Ke, Weize Kong, Cheng Li et al.