Papers
Is It JUST Semantics? A Case Study of Discourse Particle Understanding in LLMs
William Sheffield, Kanishka Misra, Valentina Pyatkin et al.
Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?
Seok Hwan Song, Mohna Chakraborty, Qi Li et al.
Is linguistically-motivated data augmentation worth it?
Ray Groshan, Michael Ginn, Alexis Palmer
Is LLM an Overconfident Judge? Unveiling the Capabilities of LLMs in Detecting Offensive Language with Annotation Disagreement
Junyu Lu, Kai Ma, Kaichun Wang et al.
Is Partial Linguistic Information Sufficient for Discourse Connective Disambiguation? A Case Study of Concession
Takuma Sato, Ai Kubota, Koji Mineshima
I Speak for the Árboles: Developing a Dependency Treebank for Spanish L2 and Heritage Speakers
Emiliana Pulido, Robert Pugh, Zoey Liu
ISR: Self-Refining Referring Expressions for Entity Grounding
Zhuocheng Yu, Bingchan Zhao, Yifan Song et al.
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering
William Jurayj, Jeffrey Cheng, Benjamin Van Durme
iTBLS: A Dataset of Interactive Conversations Over Tabular Information
Anirudh Sundar, Christopher Richardson, Adar Avsian et al.
Iterative Repair with Weak Verifiers for Few-shot Transfer in KBQA with Unanswerability
Riya Sawhney, Samrat Yadav, Indrajit Bhattacharya et al.
It’s Not a Walk in the Park! Challenges of Idiom Translation in Speech-to-text Systems
Iuliia Zaitova, Badr M. Abdullah, Wei Xue et al.
It’s Not Bragging If You Can Back It Up: Can LLMs Understand Braggings?
Jingjie Zeng, Huayang Li, Liang Yang et al.
ITUNLP at SemEval-2025 Task 8: Question-Answering over Tabular Data: A Zero-Shot Approach using LLM-Driven Code Generation
Atakan Site, Emre Erdemir, Gülşen Eryiğit
“I understand your perspective”: LLM Persuasion through the Lens of Communicative Action Theory
Esra Dönmez, Agnieszka Falenska
IUST_Champs at SemEval-2025 Task 8: Structured Prompting and Retry Policy for Tabular Question Answering
Arshia Hossein Zadeh, Aysa Mayahinia, Nafiseh Ahmadi
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web
Hongcheng Guo, Wei Zhang, Junhao Chen et al.
IWSLT 2025 Indic Track System Description Paper: Speech-to-Text Translation from Low-Resource Indian Languages (Bengali and Tamil) to English
Sayan Das, Soham Chaudhuri, Dipanjan Saha et al.
Jailbreaking? One Step Is Enough!
Weixiong Zheng, Peijian Zeng, YiWei Li et al.
Jailbreak Large Vision-Language Models Through Multi-Modal Linkage
Yu Wang, Xiaofei Zhou, Yichen Wang et al.
JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs
Junjie Chu, Yugeng Liu, Ziqing Yang et al.
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Muyao Li, Zihao Wang, Kaichen He et al.
JBBQ: Japanese Bias Benchmark for Analyzing Social Biases in Large Language Models
Hitomi Yanaka, Namgi Han, Ryoma Kumon et al.