Papers
DACL: Disfluency Augmented Curriculum Learning for Fluent Text Generation
Rohan Chaudhury, Maria Teleki, Xiangjue Dong et al.
DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
Lorenzo Lupo, Paul Bose, Mahyar Habibi et al.
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan et al.
DanteLLM: Let’s Push Italian LLM Research Forward!
Andrea Bacciu, Cesare Campagnano, Giovanni Trappolini et al.
DARES: Dataset for Arabic Readability Estimation of School Materials
Mo El-Haj, Sultan Almujaiwel, Damith Premasiri et al.
DARIUS: A Comprehensive Learner Corpus for Argument Mining in German-Language Essays
Nils-Jonathan Schaller, Andrea Horbach, Lars Ingver Höft et al.
Data Collection Pipeline for Low-Resource Languages: A Case Study on Constructing a Tetun Text Corpus
Gabriel de Jesus, Sérgio Sobral Nunes
Data Drift in Clinical Outcome Prediction from Admission Notes
Paul Grundmann, Jens-Michalis Papaioannou, Tom Oberhauser et al.
Data Driven Approach for Mathematical Problem Solving
Byungju Kim, Wonseok Lee, Jaehong Kim et al.
Data-Envelopes for Cultural Heritage: Going beyond Datasheets
Mrinalini Luthra, Maria Eskevich
Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks
Ileana Rugina, Rumen Dangovski, Li Jing et al.
Data Integration, Annotation, and Transcription Methods for Sign Language Dialogue with Latency in Videoconferencing
Mayumi Bono, Tomohiro Okada, Victor Skobov et al.
Dataset for Identification of Homophobia and Transphobia for Telugu, Kannada, and Gujarati
Prasanna Kumar Kumaresan, Rahul Ponnusamy, Dhruv Sharma et al.
Dataset of Quotation Attribution in German News Articles
Fynn Petersen-Frey, Chris Biemann
Dates and places as points of attachment for memorial contents in the ISW corpus: 1938 as a turning point
Carolina Flinz, Simona Leonardi
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
Jianhao Yan, Jin Xu, Fandong Meng et al.
DDxGym: Online Transformer Policies in a Knowledge Graph Based Natural Language Environment
Benjamin Winter, Alexei Gustavo Figueroa Rosero, Alexander Loeser et al.
Dealing with Data Scarcity in Spoken Question Answering
Merve Ünlü Menevşe, Yusufcan Manav, Ebru Arisoy et al.
Debiasing Multi-Entity Aspect-Based Sentiment Analysis with Norm-Based Data Augmentation
Scott Friedman, Joan Zheng, Hillel Steinmetz
Deciphering Emotional Landscapes in the Iliad: A Novel French-Annotated Dataset for Emotion Recognition
Davide Picca, John Pavlopoulos
Deciphering Political Entity Sentiment in News with Large Language Models: Zero-Shot and Few-Shot Strategies
Alapan Kuila, Sudeshna Sarkar
DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark
Enes Yavuz Ugan, Ngoc-Quan Pham, Alexander Waibel
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs
Chenxi Sun, Hongzhi Zhang, Zijia Lin et al.
Decoding Probing: Revealing Internal Linguistic Structures in Neural Language Models Using Minimal Pairs
Linyang He, Peili Chen, Ercong Nie et al.
Decoding Sign Languages: The SL-FE Framework for Phonological Analysis and Automated Annotation
Karahan Şahin, Kadir Gökgöz