Papers
The Challenges of Creating a Parallel Multilingual Hate Speech Corpus: An Exploration
Katerina Korre, Arianna Muti, Alberto Barrón-Cedeño
The Constant in HATE: Toxicity in Reddit across Topics and Languages
Wondimagegnhue Tsegaye Tufa, Ilia Markov, Piek T.J.M. Vossen
The Contextual Variability of English Nouns: The Impact of Categorical Specificity beyond Conceptual Concreteness
Giulia Rambelli, Marianna Bolognesi
The Corpus AIKIA: Using Ranking Annotation for Offensive Language Detection in Modern Greek
Stella Markantonatou, Vivian Stamou, Christina Christodoulou et al.
The dbpedia R Package: An Integrated Workflow for Entity Linking (for ParlaMint Corpora)
Christoph Leonhardt, Andreas Blaette
The Distracted Ear: How Listeners Shape Conversational Dynamics
Auriane Boudin, Stéphane Rauzy, Roxane Bertrand et al.
The EASIER Mobile Application and Avatar End-User Evaluation Methodology
Frankie Picron, Davy Van Landuyt, Rehana Omardeen et al.
The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
Maja Pavlovic, Massimo Poesio
The Effects of Pretraining in Video-Guided Machine Translation
Ammon Shurtz, Lawry Sorenson, Stephen D. Richardson
The ELCo Dataset: Bridging Emoji and Lexical Composition
Zi Yun Yang, Ziqing Zhang, Yisong Miao
The Emergence of Semantic Units in Massively Multilingual Models
Andrea Gregor de Varda, Marco Marelli
The Ethical Question – Use of Indigenous Corpora for Large Language Models
Linda Wiechetek, Flammie Pirinen, Maja Lisa Kappfjell et al.
The Extraction and Fine-grained Classification of Written Cantonese Materials through Linguistic Feature Detection
Chaak-ming Lau, Mingfei Lau, Ann Wai Huen To
The First Parallel Corpus and Neural Machine Translation Model of Western Armenian and English
Ari Nubar Boyacıoğlu, Jan Niehues
The First Universal Dependency Treebank for Tswana: Tswana-Popapolelo
Tanja Gaustad, Ansu Berg, Rigardt Pretorius et al.
The IgboAPI Dataset: Empowering Igbo Language Technologies through Multi-dialectal Enrichment
Chris Chinenye Emezue, Ifeoma Okoh, Chinedu Emmanuel Mbonu et al.
The Impact of Digital Editing on the Study of Holocaust Survivors’ Testimonies in the context of Voci dall’Inferno Project
Angelo Mario Del Grosso, Marina Riccucci, Elvira Mercatanti
The Impact of Stance Object Type on the Quality of Stance Detection
Maxwell A. Weinzierl, Sanda M. Harabagiu
The Influence of Automatic Speech Recognition on Linguistic Features and Automatic Alzheimer’s Disease Detection from Spontaneous Speech
Jonathan Heitz, Gerold Schneider, Nicolas Langer
The Key Points: Using Feature Importance to Identify Shortcomings in Sign Language Recognition Models
Ruth M. Holmes, Ellen Rushe, Anthony Ventresque
The Low Saxon LSDC Dataset at Universal Dependencies
Janine Siewert, Jack Rueter
The MEET Corpus: Collocated, Distant and Hybrid Three-party Meetings with a Ranking Task
Ghazaleh Esfandiari-Baiat, Jens Edlund
The Mental Lexicon of Communicative Fragments and Contours: The Remix N-gram Method
Emese K. Molnár, Andrea Dömötör
The MOLOR Lemma Bank: a New LLOD Resource for Old Irish
Theodorus Fransen, Cormac Anderson, Sacha Beniamine et al.