Papers
The Art of Saying "Maybe": A Conformal Lens for Uncertainty Benchmarking in VLMs
Asif Azad, Mohammad Sadat Hossain, MD Sadik Hossain Shanto et al.
The Automatic Verification of Image-Text Claims (AVerImaTeC) Shared Task
Rui Cao, Yulong Chen, Zhenyun Deng et al.
The Avengers: A Routing Recipe for Collective Intelligence in Language Models
Yiqun Zhang, Hao Li, Chenxu Wang et al.
The Bidirectional Process Reward Model
Lingyin Zhang, Jun Gao, Xiaoxue Ren et al.
The Bitter Lesson of Diffusion Language Models for Agentic Workflows: A Comprehensive Reality Check
Qingyu Lu, Liang Ding, Kanjian Zhang et al.
The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents
Weihao Xuan, Qingcheng Zeng, Heli Qi et al.
The Confidence Trap: Gender Bias and Predictive Certainty in LLMs
Ahmed Sabir, Markus Kängsepp, Rajesh Sharma
The Confident Liar: Diagnosing Multi-Agent Debate with Log-Probabilities and LLM-as-Judge
Ali Keramati, Justin Cheok, Jacob Horne et al.
The Correlation Between Emotion in Text and Speech Segments is Limited: A Cross-Modal Study
David Lindevelt, Suzan Verberne, Joost Broekens
The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order Logic
Bernardo Cuenca Grau, Eva Feng, Przemysław Andrzej Wałęga
The Cost and Complexity of Minimizing Envy in House Allocations (Abstract Reprint)
Jayakrishnan Madathil, Neeldhara Misra, Aditi Sethia
The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models
Taewhoo Lee, Minju Song, Chanwoong Yoon et al.
The Curse of Verbalization: How Presentation Order Constrains LLM Reasoning
Yue Zhou, Henry Peng Zou, Barbara Di Eugenio et al.
The Data Frontier for Large Language Models: Selection, Synthesis, and Tools
Lijun Wu, Wentao Zhang, Conghui He
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Mingkai Tian, Guorong Li, Yuankai Qi et al.
The Digital Dunning-Kruger Effect: Decoupling Hallucinations via Geometric Hidden-state Observation for Semantic Truthfulness
Yueheng Mao, Min Yu, Gengwang Li et al.
The Doctor Will Agree With You Now: Sycophancy of Large Language Models in Multi-Turn Medical Conversations
Taeil Matthew Kim, Luyang Luo, Sung Eun Kim et al.
The Dog the Cat Chased Stumped the Model: Measuring When Language Models Abandon Structure for Shortcuts
Sangmitra Madhusudan, Kaige Chen, Ali Emami
The Dominance of Text Space: Unveiling the Asymmetric Nature of Cross-Modal Alignment in Large Language Models
Linqing Chen, Hanmeng Zhong, Wentao Wu et al.
The Emotional Baby Is Truly Deadly: Does Your Multimodal Large Reasoning Model Have Emotional Flattery Towards Humans?
Yuan Xun, Xiaojun Jia, Xinwei Liu et al.
The Energy of Falsehood: Detecting Hallucinations via Diffusion Model Likelihoods
Arpit Singh Gautam, Kailash Talreja, Saurabh Jha
The Essentials of AI for Life and Society: A Full-Scale AI Literacy Course Accessible to All
Zifan Xu, Kristen Procko, Michael Munje et al.
The Evolution of Thought: Tracking LLM Overthinking via Reasoning Dynamics Analysis
Zihao Wei, Liang Pang, Jiahao Liu et al.
The Finer the Better: Towards Granular-aware Open-set Domain Generalization
Yunyun Wang, Zheng Duan, Xinyue Liao et al.