Papers
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models
Artem Kirsanov, Chi-Ning Chou, Kyunghyun Cho et al.
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
Yifan Song, Guoyin Wang, Sujian Li et al.
The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR
Injy Hamed, Thang Vu, Nizar Habash
The Impact of Dialect Variation on Robust Automatic Speech Recognition for Catalan
Zachary Hopton, Eleanor Chodroff
The Impact of Domain-Specific Terminology on Machine Translation for Finance in European Languages
Arturo Oncevay, Charese Smiley, Xiaomo Liu
The Impact of Inference Acceleration on Bias of LLMs
Elisabeth Kirsten, Ivan Habernal, Vedant Nanda et al.
The Impact of Visual Information in Chinese Characters: Evaluating Large Models’ Ability to Recognize and Utilize Radicals
Xiaofeng Wu, Karl Stratos, Wei Xu
The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units
Badr AlKhamissi, Greta Tuckute, Antoine Bosselut et al.
The Power of Bullet Lists: A Simple Yet Effective Prompting Approach to Enhancing Spatial Reasoning in Large Language Models
Ikhyun Cho, Changyeon Park, Julia Hockenmaier
The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning
Longju Bai, Angana Borah, Oana Ignat et al.
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection
Tomáš Horych, Christoph Mandl, Terry Ruas et al.
The Role of Prosody in Spoken Question Answering
Jie Chi, Maureen de Seyssel, Natalie Schluter
The Russian-focused embedders’ exploration: ruMTEB benchmark and Russian embedding model design
Artem Snegirev, Maria Tikhonova, Maksimova Anna et al.
The State and Fate of Summarization Datasets: A Survey
Noam Dahan, Gabriel Stanovsky
The Stochastic Parrot on LLM’s Shoulder: A Summative Assessment of Physical Concept Understanding
Mo Yu, Lemao Liu, Junjie Wu et al.
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)
Abhijit Mishra, Shreya Shukla, Jose Torres et al.
ThoughtSculpt: Reasoning with Intermediate Revision and Search
Yizhou Chi, Kevin Yang, Dan Klein
THREAD: Thinking Deeper with Recursive Spawning
Philip Schroeder, Nathaniel W. Morgan, Hongyin Luo et al.
Threefold model for AI Readiness: A Case Study with Finnish Healthcare SMEs
Mohammed Alnajjar, Khalid Alnajjar, Mika Hämäläinen
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs
Jiancheng Dong, Lei Jiang, Wei Jin et al.
Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images
Elisei Rykov, Kseniia Petrushina, Kseniia Titova et al.