Papers
Teaching Small Language Models to Learn Logic through Meta-Learning
Leonardo Bertolazzi, Manuel Vargas Guzmán, Raffaella Bernardi et al.
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Yuxuan Zhu, Antony Kellermann, Akul Gupta et al.
TechING: Towards Real World Technical Image Understanding via VLMs
Tafazzul Nadeem, Bhavik Shangari, Manish Rai et al.
TELLME: Test-Enhanced Learning for Language Model Enrichment
Minjun Kim, Inho Won, HyeonSeok Lim et al.
TeluguEval: A Comprehensive Benchmark for Evaluating LLM Capabilities in Telugu
Revanth Kumar Gundam, Radhika Mamidi
TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models
Carolin Holtermann, Nina Krebs, Anne Lauscher
Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish
Cedric Lothritz, Jordi Cabot, Laura Bernardy
Test-time Corpus Feedback: From Retrieval to RAG
Mandeep Rathee, Venktesh V, Sean MacAvaney et al.
Test-Time Scaling of Reasoning Models for Machine Translation
Zihao Li, Shaoxiong Ji, Jörg Tiedemann
Text Classification Under Class Distribution Shift: A Survey
Adriana Valentina Costache, Silviu-Florin Gheorghe, Eduard Poesina et al.
Text Filter Based on Automatically Acquired Vocabularies for Multilingual Machine Translation
Kenji Imamura, Masao Utiyama
TextMineX: Data, Evaluation Framework and Ontology-guided LLM Pipeline for Humanitarian Mine Action
Chenyue Zhou, Gürkan Solmaz, Flavio Cirillo et al.
Text-to-Text Automatic Story Generation: A Survey
Yuan Ma, Hanna Suominen, Patrik Haslum et al.
The AI Committee: A Multi-Agent Framework for Automated Validation and Remediation of Web-Sourced Data
Sunith Vallabhaneni, Thomas Berkane, Maimuna S. Majumder
The Anthropology of Food: How NLP can Help us Unravel the Food cultures of the World
Arij Riabi, Sougata Saha, Monojit Choudhury
The Art of Saying "Maybe": A Conformal Lens for Uncertainty Benchmarking in VLMs
Asif Azad, Mohammad Sadat Hossain, MD Sadik Hossain Shanto et al.
The Automatic Verification of Image-Text Claims (AVerImaTeC) Shared Task
Rui Cao, Yulong Chen, Zhenyun Deng et al.
The Correlation Between Emotion in Text and Speech Segments is Limited: A Cross-Modal Study
David Lindevelt, Suzan Verberne, Joost Broekens
The Curse of Verbalization: How Presentation Order Constrains LLM Reasoning
Yue Zhou, Henry Peng Zou, Barbara Di Eugenio et al.
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Mingkai Tian, Guorong Li, Yuankai Qi et al.
The Doctor Will Agree With You Now: Sycophancy of Large Language Models in Multi-Turn Medical Conversations
Taeil Matthew Kim, Luyang Luo, Sung Eun Kim et al.
The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Sangmitra Madhusudan, Kaige Chen, Ali Emami
The Energy of Falsehood: Detecting Hallucinations via Diffusion Model Likelihoods
Arpit Singh Gautam, Kailash Talreja, Saurabh Jha
The Hidden Bias: A Study on Explicit and Implicit Political Stereotypes in Large Language Models
Konrad Löhr, Shuzhou Yuan, Michael Färber