Papers
Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and Expected Information Gain
Davide Mazzaccara, Alberto Testoni, Raffaella Bernardi
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM Agents
Zengqing Wu, Run Peng, Shuyuan Zheng et al.
From Test-Taking to Test-Making: Examining LLM Authoring of Commonsense Assessment Items
Melissa Roemmele, Andrew Gordon
Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media
Nikhil Mehta, Dan Goldwasser
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring
Jiazheng Li, Hainiu Xu, Zhaoyue Sun et al.
Toolken+: Improving LLM Tool Usage with Reranking and a Reject Option
Konstantin Yakovlev, Sergey Nikolenko, Andrey Bout
Can LLMs Recognize Toxicity? A Structured Investigation Framework and Toxicity Metric
Hyukhun Koh, Dohyung Kim, Minwoo Lee et al.
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?
Ehsan Doostmohammadi, Oskar Holmström, Marco Kuhlmann
VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
Ruotong Liao, Max Erler, Huiyu Wang et al.
CEAMC: Corpus and Empirical Study of Argument Analysis in Education via LLMs
Yupei Ren, Hongyi Wu, Zhaoguang Long et al.
LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
Sihui Yang, Keping Bi, Wanqing Cui et al.
Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas
Salvatore Giorgi, Tingting Liu, Ankit Aich et al.
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Jessica Maria Echterhoff, Fartash Faghri, Raviteja Vemulapalli et al.
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
Sihang Zhao, Youliang Yuan, Xiaoying Tang et al.
RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
Xijie Huang, Zechun Liu, Shih-Yang Liu et al.
Insights into LLM Long-Context Failures: When Transformers Know but Don’t Tell
Muhan Gao, TaiMing Lu, Kuai Yu et al.
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz, Kartik Mehta, Yu-Hsiang Lin et al.
Is Compound Aspect-Based Sentiment Analysis Addressed by LLMs?
Yinhao Bai, Zhixin Han, Yuhua Zhao et al.
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs
Seyed Mahed Mousavi, Simone Alghisi, Giuseppe Riccardi
Exploring the Capability of Multimodal LLMs with Yonkoma Manga: The YManga Dataset and Its Challenging Tasks
Qi Yang, Jingjie Zeng, Liang Yang et al.
PURE: Aligning LLM via Pluggable Query Reformulation for Enhanced Helpfulness
Wenjin Yao, Yidong Wang, Zhuohao Yu et al.
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMs
Minsang Kim, Cheoneum Park, Seung Jun Baek
LoRAExit: Empowering Dynamic Modulation of LLMs in Resource-limited Settings using Low-rank Adapters
Jiacheng Liu, Peng Tang, Xiaofeng Hou et al.
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions
Angana Borah, Rada Mihalcea
Do LLMs Think Fast and Slow? A Causal Study on Sentiment Analysis
Zhiheng Lyu, Zhijing Jin, Fernando Gonzalez Adauto et al.