Papers
The Effects of Data Quality on Named Entity Recognition
Divya Bhadauria, Alejandro Sierra Múnera, Ralf Krestel
The Future of Web Data Mining: Insights from Multimodal and Code-based Extraction Methods
Evan Fellman, Jacob Tyo, Zachary Lipton
The Generative AI Paradox in Evaluation: “What It Can Solve, It May Not Evaluate”
Juhyun Oh, Eunsu Kim, Inha Cha et al.
The Impact of Integration Step on Integrated Gradients
Masahiro Makino, Yuya Asazuma, Shota Sasaki et al.
The Impact of Language Adapters in Cross-Lingual Transfer for NLU
Jenny Kunz, Oskar Holmström
The KIND Dataset: A Social Collaboration Approach for Nuanced Dialect Data Collection
Asma Yamani, Raghad Alziyady, Reem AlYami et al.
The Kronieken Corpus: an Annotated Collection of Dutch/Flemish Chronicles from 1500-1850
Theo Dekker, Erika Kuijpers, Alie Lassche et al.
The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks
Anders Giovanni Møller, Arianna Pera, Jacob Dalsgaard et al.
The Queen of England is not England’s Queen: On the Lack of Factual Coherency in PLMs
Paul Youssef, Jörg Schlötterer, Christin Seifert
Therapist Self-Disclosure as a Natural Language Processing Task
Natalie Shapira, Tal Alfi-Yogev
The Role of Data Curation in Image Captioning
Wenyan Li, Jonas Lotz, Chen Qiu et al.
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova et al.
Thesis Proposal: Detecting Agency Attribution
Igor Ryazanov, Johanna Björklund
Thesis Proposal: Detecting Empathy Using Multimodal Language Model
Md Rakibul Hasan, Md Zakir Hossain, Aneesh Krishna et al.
The Typology of Ellipsis: A Corpus for Linguistic Analysis and Machine Learning Applications
Damir Cavar, Ludovic Mompelat, Muhammad Abdo
Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models
Lukáš Mikula, Michal Štefánik, Marek Petrovič et al.
Threat Behavior Textual Search by Attention Graph Isomorphism
Chanwoo Bae, Guanhong Tao, Zhuo Zhang et al.
Timeline Extraction from Decision Letters Using ChatGPT
Femke Bakker, Ruben Van Heusden, Maarten Marx
T is for Treu, but how do you pronounce that? Using C-LARA to create phonetic texts for Kanak languages
Pauline Welby, Fabrice Wacalie, Manny Rayner
TL;DR Progress: Multi-faceted Literature Exploration in Text Summarization
Shahbaz Syed, Khalid Al Khatib, Martin Potthast
Topic Bias in Emotion Classification
Maximilian Wegge, Roman Klinger
Topic-guided Example Selection for Domain Adaptation in LLM-based Machine Translation
Seth Aycock, Rachel Bawden
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Bolei Ma, Ercong Nie, Shuzhou Yuan et al.
Towards Better Inclusivity: A Diverse Tweet Corpus of English Varieties
Nhi Pham, Lachlan Pham, Adam Meyers