Papers
16,749 papers found
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Ziang Ye, Zhenru Zhang, Yang Zhang et al.
Disentangling Text and Math in Word Problems: Evidence for the Bidimensional Structure of Large Language Models’ Reasoning
Pedro Calais, Gabriel Franco, Zilu Tang et al.
Disentangling the Roles of Representation and Selection in Data Pruning
Yupei Du, Yingjin Song, Hugh Mee Wong et al.
(Dis)improved?! How Simplified Language Affects Large Language Model Performance across Languages
Miriam Anschütz, Anastasiya Damaratskaya, Chaeeun Joy Lee et al.
DISPUTool 3.0: Fallacy Detection and Repairing in Argumentative Political Debates
Pierpaolo Goffredo, Deborah Dore, Elena Cabrio et al.
Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
Runchu Tian, Yanghao Li, Yuepeng Fu et al.
Distilling an End-to-End Voice Assistant Without Instruction Training Data
William Held, Yanzhe Zhang, Minzhi Li et al.
DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models
Chengyu Wang, Junbing Yan, Yuanhao Yue et al.
DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts
Yuchen Feng, Bowen Shen, Naibin Gu et al.
Diversification Catalyzes Language Models’ Instruction Generalization To Unseen Semantics
Dylan Zhang, Justin Wang, Francois Charton
Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts
Zeliang Zhang, Xiaodong Liu, Hao Cheng et al.
Diversity Explains Inference Scaling Laws: Through a Case Study of Minimum Bayes Risk Decoding
Hidetaka Kamigaito, Hiroyuki Deguchi, Yusuke Sakai et al.
Diversity-oriented Data Augmentation with Large Language Models
Zaitian Wang, Jinghan Zhang, Xinhao Zhang et al.
Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation
Dongsheng Zhu, Weixian Shi, Zhengliang Shi et al.
Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Xin Sun, Jianan Xie, Zhongqi Chen et al.
Divide-Verify-Refine: Can LLMs Self-align with Complex Instructions?
Xianren Zhang, Xianfeng Tang, Hui Liu et al.
DKITNLP at ArchEHR-QA 2025: A Retrieval Augmented LLM Pipeline for Evidence-Based Patient Question Answering
Provia Kadusabe, Abhishek Kaushik, Fiona Lawless
DLSU at BEA 2025 Shared Task: Towards Establishing Baseline Models for Pedagogical Response Evaluation Tasks
Maria Monica Manlises, Mark Edward Gonzales, Lanz Lim
DLU: Dictionary Look-Up Data and Prediction
David Strohmaier, Gladys Tyen, Hongyi Gu et al.
DMIS Lab at ArchEHR-QA 2025: Evidence-Grounded Answer Generation for EHR-based QA via a Multi-Agent Framework
Hyeon Hwang, Hyeongsoon Hwang, Jongmyung Jung et al.
DNASpeech: A Contextualized and Situated Text-to-Speech Dataset with Dialogues, Narratives and Actions
Chuanqi Cheng, Hongda Sun, Bo Du et al.
DNB-AI-Project at SemEval-2025 Task 5: An LLM-Ensemble Approach for Automated Subject Indexing
Lisa Kluge, Maximilian Kähler
DNCASR: End-to-End Training for Speaker-Attributed ASR
Xianrui Zheng, Chao Zhang, Phil Woodland