Papers
Team Curie at HSD-2Lang 2024: Hate Speech Detection in Turkish and Arabic Tweets using BERT-based models
Ehsan Barkhodar, Işık Topçu, Ali Hürriyetoğlu
TechWhiz@DravidianLangTech 2024: Fake News Detection Using Deep Learning Models
Madhumitha M, Kunguma M, Tejashri J et al.
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Rabeeh Karimi Mahabadi, Hamish Ivison, Jaesung Tae et al.
Tewodros@DravidianLangTech 2024: Hate Speech Recognition in Telugu Codemixed Text
Tewodros Achamaleh, Lemlem Kawo, Ildar Batyrshini et al.
TextBI: An Interactive Dashboard for Visualizing Multidimensional NLP Annotations in Social Media Data
Maxime Masson, Christian Sallaberry, Marie-Noelle Bessagnet et al.
Text-Guided Image Clustering
Andreas Stephan, Lukas Miklautz, Kevin Sidak et al.
Text or Image? What is More Important in Cross-Domain Generalization Capabilities of Hate Meme Detection Models?
Piush Aggarwal, Jawar Mehrabanian, Weigang Huang et al.
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou, Guchun Zhang, Gerasimos Lampouras
The ARRAU 3.0 Corpus
Massimo Poesio, Maris Camilleri, Paloma Carretero Garcia et al.
The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese
Ajinkya Kulkarni, Anna Tokareva, Rameez Qureshi et al.
The DURel Annotation Tool: Human and Computational Measurement of Semantic Proximity, Sense Clusters and Semantic Change
Dominik Schlechtweg, Shafqat Mumtaz Virk, Pauline Sander et al.
The Effects of Data Quality on Named Entity Recognition
Divya Bhadauria, Alejandro Sierra Múnera, Ralf Krestel
The Future of Web Data Mining: Insights from Multimodal and Code-based Extraction Methods
Evan Fellman, Jacob Tyo, Zachary Lipton
The Generative AI Paradox in Evaluation: “What It Can Solve, It May Not Evaluate”
Juhyun Oh, Eunsu Kim, Inha Cha et al.
The Impact of Integration Step on Integrated Gradients
Masahiro Makino, Yuya Asazuma, Shota Sasaki et al.
The Impact of Language Adapters in Cross-Lingual Transfer for NLU
Jenny Kunz, Oskar Holmström
The KIND Dataset: A Social Collaboration Approach for Nuanced Dialect Data Collection
Asma Yamani, Raghad Alziyady, Reem AlYami et al.
The Kronieken Corpus: an Annotated Collection of Dutch/Flemish Chronicles from 1500-1850
Theo Dekker, Erika Kuijpers, Alie Lassche et al.
The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks
Anders Giovanni Møller, Arianna Pera, Jacob Dalsgaard et al.
The Queen of England is not England’s Queen: On the Lack of Factual Coherency in PLMs
Paul Youssef, Jörg Schlötterer, Christin Seifert
Therapist Self-Disclosure as a Natural Language Processing Task
Natalie Shapira, Tal Alfi-Yogev