Papers
The Effect of Model Capacity and Script Diversity on Subword Tokenization for Sorani Kurdish
Ali Salehi, Cassandra L. Jacobs
The Elephant in the Room: Ten Challenges of Computational Detection of Rhetorical Figures
Ramona Kühn, Jelena Mitrović
The Emergence of High-Level Semantics in a Signaling Game
Timothée Bernard, Timothee Mickus, Hiroya Takamura
The Impact of Depth on Compositional Generalization in Transformer Language Models
Jackson Petty, Sjoerd Steenkiste, Ishita Dasgupta et al.
The Impact of Differential Privacy on Group Disparity Mitigation
Victor Hansen, Atula Neerkaje, Ramit Sawhney et al.
The Impact of Language on Arithmetic Proficiency: A Multilingual Investigation with Cross-Agent Checking Computation
Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi et al.
The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing
Muzhi Li, Minda Hu, Irwin King et al.
The Mexican Gayze: A Computational Analysis of the Attitudes towards the LGBT+ Population in Mexico on Social Media Across a Decade
Scott Andersen, Segio-Luis Ojeda-Trueba, Juan Vásquez et al.
The Paradox of Preference: A Study on LLM Alignment Algorithms and Data Acquisition Methods
Rishikesh Devanathan, Varun Nathan, Ayush Kumar
The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels
Eve Fleisig, Su Lin Blodgett, Dan Klein et al.
The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education
Paiheng Xu, Jing Liu, Nathan Jones et al.
The Role of Adverbs in Language Variety Identification: The Case of Portuguese Multi-Word Adverbs
Izabela Müller, Nuno Mamede, Jorge Baptista
The Role of n-gram Smoothing in the Age of Neural Networks
Luca Malagutti, Andrius Buinovskij, Anej Svete et al.
The steerability of large language models toward data-driven personas
Junyi Li, Charith Peris, Ninareh Mehrabi et al.
The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language
Jian Zhu, Changbing Yang, Farhan Samir et al.
The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification
Minh Duc Bui, Katharina Von Der Wense
The Uli Dataset: An Exercise in Experience Led Annotation of oGBV
Arnav Arora, Maha Jinadoss, Cheshta Arora et al.
The Unreasonable Effectiveness of Random Target Embeddings for Continuous-Output Neural Machine Translation
Evgeniia Tokarchuk, Vlad Niculae
The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics
Nikolay Bogoychev, Pinzhen Chen, Barry Haddow et al.
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks
Yunqi Zhang, Songda Li, Chunyuan Deng et al.