Papers
Referential ambiguity and clarification requests: comparing human and LLM behaviour
Chris Madge, Matthew Purver, Massimo Poesio
Mention detection with LLMs in pair-programming dialogue
Cecilia Domingo, Paul Piwek, Svetlana Stoyanchev et al.
Findings of the Fourth Shared Task on Multilingual Coreference Resolution: Can LLMs Dethrone Traditional Approaches?
Michal Novák, Miloslav Konopik, Anna Nedoluzhko et al.
LLM as a Guide: an Approach for Unsupervised Economic Relation Discovery in Administrative Documents
Thomas Sebbag, Solen Quiniou, Emmanuel Morin
Zero-Shot Extraction of Stock Relationship Graphs with LLMs
Hao Zhou, Luis Felipe Costa Sperb, Tiejun Ma
A Self-Improving Method for Generating Descriptions of Financial Data Quality Grading Using LLMs
Yang Zhao, Yohei Ikawa, Bishwaranjan Bhattacharjee
From Earnings Calls to Investment Reports: Evaluating Role-based Multi-Agent LLM Systems
Ranjan Satapathy, Raphael Liew, Joyjit Chattorj et al.
Rethinking Search: A Study of University Students’ Perspectives on Using LLMs and Traditional Search Engines in Academic Problem Solving
Md. Faiyaz Abdullah Sayeedi, Md. Sadman Haque, Zobaer Ibn Razzaque et al.
Culturally-Aware Conversations: A Framework & Benchmark for LLMs
Shreya Havaldar, Young Min Cho, Sunny Rai et al.
MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf
Lingxiang Hu, Shurun Yuan, Xiaoting Qin et al.
Dialogue Acts as a Lens on Human–LLM Interaction: Analyzing Conversational Norms in Model-Generated Responses
Arunima Maitra, Dorothea French, Katharina von der Wense
Syntactic Blind Spots: How Misalignment Leads to LLMs’ Mathematical Errors
Dane A Williamson, Yangfeng Ji, Matthew B. Dwyer
BanglaMATH : A Bangla benchmark dataset for testing LLM mathematical reasoning at grades 6, 7, and 8
Tabia Tanzin Prama, Christopher M. Danforth, Peter Dodds
Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean
Mark Obozov, Michael Diskin, Aleksandr Beznosikov et al.
CoCo-CoLa: Evaluating and Improving Language Adherence in Multilingual LLMs
Elnaz Rahmati, Alireza Salkhordeh Ziabari, Morteza Dehghani
Unlocking LLM Safeguards for Low-Resource Languages via Reasoning and Alignment with Minimal Training Data
Zhuowei Chen, Bowei Zhang, Nankai Lin et al.
The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs
Lucas Bandarkar, Nanyun Peng
Reassessing Speech Translation for Low-Resource Languages: Do LLMs Redefine the State-of-the-Art Against Cascaded Models?
Jonah Dauvet, Min Ma, Jessica Ojo et al.
TenseLoC: Tense Localization and Control in a Multilingual LLM
Ariun-Erdene Tumurchuluun, Yusser Al Ghussin, David Mareček et al.
Translating Tax Law to Code with LLMs: A Benchmark and Evaluation Framework
Gabriele Lorenzo, Aldo Pietromatera, Nils Holzenberger
Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization
Eunjung Cho, Alexander Hoyle, Yoan Hermstrüwer
Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification
M. Mikail Demir, M Abdullah Canbaz