Papers
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?
Samuel Lewis-Lim, Xingwei Tan, Zhixue Zhao et al.
Analysis of Automated Document Relevance Annotation for Information Retrieval in Oil and Gas Industry
João Vitor Mariano Correia, Murilo Missano Bell, João Vitor Robiatti Amorim et al.
Analyzing and Modeling LLM Response Lengths with Extreme Value Theory: Anchoring Effects and Hybrid Distributions
Liuxuan Jiao, Chen Gao, Yiqian Yang et al.
Analyzing Dialectical Biases in LLMs for Knowledge and Reasoning Benchmarks
Eileen Pan, Anna Seo Gyeong Choi, Maartje Ter Hoeve et al.
Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior
Manuel Couto, Marcos Fernández-Pichel, Mario Ezra Aragon et al.
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels
Junjie Ye, Yuming Yang, Yang Nan et al.
Analyzing Uncertainty of LLM-as-a-Judge: Interval Evaluations with Conformal Prediction
Huanxin Sheng, Xinyi Liu, Hangfeng He et al.
Analyzing values about gendered language reform in LLMs’ revisions
Jules Watson, Xi Wang, Raymond Liu et al.
Anatomy of a Feeling: Narrating Embodied Emotions via Large Vision-Language Models
Mohammad Saim, Phan Anh Duong, Cat Luong et al.
An Attention-Based Neural Translation System for English to Bodo
Subhash Wary, Birhang Borgoyary, Akher Ahmed et al.
AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity
Yu Zhang, Dong Guo, Fang Wu et al.
Anchoring-Guidance Fine-Tuning (AnGFT): Elevating Professional Response Quality in Role-Playing Conversational Agents
Qibin Li, Zhen Xu, Shengyuan Bai et al.
Anecdoctoring: Automated Red-Teaming Across Language and Place
Alejandro Cuevas, Saloni Dash, Bharat Kumar Nayak et al.
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations
Lingjun Zhao, Hal Daumé Iii
An Empirical Analysis of Machine Translation for Expanding Multilingual Benchmarks
Sara Rajaee, Rochelle Choenni, Ekaterina Shutova et al.
An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint
Yi Sun, Han Wang, Jiaqiang Li et al.
An Empirical Study of Position Bias in Modern Information Retrieval
Ziyang Zeng, Dun Zhang, Jiacheng Li et al.
An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation
Shubham Gandhi, Atharva Naik, Yiqing Xie et al.
An Evaluation Resource for Grounding Translation Errors
Sujin Chen, Kang Wang, Zixuan Zhou et al.
An Exploration of Knowledge Editing for Arabic
Basel Mousi, Nadir Durrani, Fahim Dalvi
Angular Dispersion Accelerates k-Nearest Neighbors Machine Translation
Evgeniia Tokarchuk, Sergey Troshin, Vlad Niculae
An Improved, Strong Baseline for Pre-Trained Large Language Models as Task-Oriented Dialogue Systems
Sebastian Steindl, André Kestler, Ulrich Schäfer et al.
An in-depth human study of the mathematical reasoning abilities in Large Language Models
Carolina Dias-Alexiou, Edison Marrese-Taylor, Yutaka Matsuo
An Interdisciplinary Approach to Human-Centered Machine Translation
Marine Carpuat, Omri Asscher, Kalika Bali et al.
An LLM-based Temporal-spatial Data Generation and Fusion Approach for Early Detection of Late Onset Alzheimer’s Disease (LOAD) Stagings Especially in Chinese and English-speaking Populations
Yang Han, Jacqueline C.k. Lam, Victor O.k. Li et al.