Papers
DocSplit: Simple Contrastive Pretraining for Large Document Embeddings
Yujie Wang, Mike Izbicki
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading
Hao Wang, Qingxuan Wang, Yue Li et al.
Document-Level Language Models for Machine Translation
Frithjof Petrick, Christian Herold, Pavel Petrushkov et al.
Document-Level Machine Translation with Large Language Models
Longyue Wang, Chenyang Lyu, Tianbo Ji et al.
Document-level Relationship Extraction by Bidirectional Constraints of Beta Rules
Yichun Liu, Zizhong Zhu, Xiaowang Zhang et al.
DocumentNet: Bridging the Data Gap in Document Pre-training
Lijun Yu, Jin Miao, Xiaoyu Sun et al.
Do Differences in Values Influence Disagreements in Online Discussions?
Michiel van der Meer, Piek Vossen, Catholijn M. Jonker et al.
Do “English” Named Entity Recognizers Work Well on Global Englishes?
Alexander Shan, John Bauer, Riley Carlson et al.
Does Listener Gaze in Face-to-Face Interaction Follow the Entropy Rate Constancy Principle: An Empirical Study
Yu Wang, Hendrik Buschmeier
Does Named Entity Recognition Truly Not Scale Up to Real-world Product Attribute Extraction?
Wei-Te Chen, Keiji Shinzato, Naoki Yoshinaga et al.
Does the Correctness of Factual Knowledge Matter for Factual Knowledge-Enhanced Pre-trained Language Models?
Boxi Cao, Qiaoyu Tang, Hongyu Lin et al.
Do Language Models Have a Common Sense regarding Time? Revisiting Temporal Commonsense Reasoning in the Era of Large Language Models
Raghav Jain, Daivik Sojitra, Arkadeep Acharya et al.
Do Language Models Learn about Legal Entity Types during Pretraining?
Claire Barale, Michael Rovatsos, Nehal Bhuta
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Minje Choi, Jiaxin Pei, Sagar Kumar et al.
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
El Moatez Billah Nagoudi, AbdelRahim Elmadany, Ahmed Oumar El-Shangiti et al.
Domain Adaptation for Conversational Query Production with the RAG Model Feedback
Ante Wang, Linfeng Song, Ge Xu et al.
Domain Adaptation for Sentiment Analysis Using Robust Internal Representations
Mohammad Rostami, Digbalay Bose, Shrikanth Narayanan et al.
Domain-Adapting BERT for Attributing Manuscript, Century and Region in Pre-Modern Slavic Texts
Piroska Lendvai, Uwe Reichel, Anna Jouravel et al.
Domain Private Transformers for Multi-Domain Dialog Systems
Anmol Kabra, Ethan Elenberg
Domain Terminology Integration into Machine Translation: Leveraging Large Language Models
Yasmin Moslem, Gianfranco Romani, Mahdi Molaei et al.
Don’t Add, don’t Miss: Effective Content Preserving Generation from Pre-Selected Text Spans
Aviv Slobodkin, Avi Caciularu, Eran Hirsch et al.
‘Don’t Get Too Technical with Me’: A Discourse Structure-Based Framework for Automatic Science Journalism
Ronald Cardenas, Bingsheng Yao, Dakuo Wang et al.
Don’t Take This Out of Context!: On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Akhila Yerukola, Xuhui Zhou, Elizabeth Clark et al.
Don’t Trust ChatGPT when your Question is not in English: A Study of Multilingual Abilities and Types of LLMs
Xiang Zhang, Senyu Li, Bradley Hauer et al.
Don’t waste a single annotation: improving single-label classifiers through soft labels
Ben Wu, Yue Li, Yida Mu et al.