Papers
Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors
Zhiyu Yang, Shuo Wang, Yukun Yan et al.
Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce
Arnav Attri, Anuj Attri, Suman Banerjee et al.
WiC Evaluation in Galician and Spanish: Effects of Dataset Quality and Composition
Marta Vázquez Abuín, Marcos Garcia
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
An-Lan Wang, Jingqun Tang, Lei Liao et al.
WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning
Gagan Mundada, Yash Vishe, Amit Namburi et al.
Will Annotators Disagree? Identifying Subjectivity in Value-Laden Arguments
Amir Homayounirad, Enrico Liscio, Tong Wang et al.
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA
Sergey Pletenev, Maria Marina, Nikolay Ivanov et al.
WISE: Weak-Supervision-Guided Step-by-Step Explanations for Multimodal LLMs in Image Classification
Yiwen Jiang, Deval Mehta, Siyuan Yan et al.
WMT 2025 CreoleMT Systems Description : Martinican Creole and French
Ludovic Mompelat
WojoodOntology: Ontology-Driven LLM Prompting for Unified Information Extraction Tasks
Alaa Aljabari, Nagham Hamad, Mohammed Khalilia et al.
WojoodRelations: Arabic Relation Extraction Corpus and Modeling
Alaa Aljabari, Mohammed Khalilia, Mustafa Jarrar
Women, Infamous, and Exotic Beings: A Comparative Study of Honorific Usages in Wikipedia and LLMs for Bengali and Hindi
Sourabrata Mukherjee, Atharva Mehta, Sougata Saha et al.
Word Clouds as Common Voices: LLM-Assisted Visualization of Participant-Weighted Themes in Qualitative Interviews
Joseph T Colonel, Baihan Lin
Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly
Wenya Xie, Shaochen Zhong, Hoang Anh Duy Le et al.
Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication
Jocelyn J Shen, Akhila Yerukola, Xuhui Zhou et al.
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
Ernesto Luis Estevanell Valladares, Suilan Estevez-Velarde, Yoan Gutierrez et al.
X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Jailbreak Attacks without Compromising Usability
Xiaoya Lu, Dongrui Liu, Yi Yu et al.
xCoRe: Cross-context Coreference Resolution
Giuliano Martinelli, Bruno Gatti, Roberto Navigli
X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning
Prasanna Reddy Pulakurthi, Jiamian Wang, Majid Rabbani et al.
X-FLoRA: Cross-modal Federated Learning with Modality-expert LoRA for Medical VQA
Min Hyuk Kim, Changheon Kim, Seok Bong Yoo
X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding
Wenqi Zhou, Kai Cao, Hao Zheng et al.
XLQA: A Benchmark for Locale-Aware Multilingual Open-Domain Question Answering
Keonwoo Roh, Yeong-Joon Ju, Seong-Whan Lee
XL-Suite: Cross-Lingual Synthetic Training and Evaluation Data for Open-Ended Generation
Vivek Iyer, Pinzhen Chen, Ricardo Rei et al.
XQuant: Achieving Ultra-Low Bit KV Cache Quantization with Cross-Layer Compression
Haoqi Yang, Yao Yao, Zuchao Li et al.
XRAG: Cross-lingual Retrieval-Augmented Generation
Wei Liu, Sony Trenous, Leonardo F. R. Ribeiro et al.