Papers
Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models
Stephan Linzbach, Dimitar Dimitrov, Laura Kallmeyer et al.
Distilling Text Style Transfer With Self-Explanation From LLMs
Chiyu Zhang, Honglong Cai, Yuezhang Li et al.
Divergent Token Metrics: Measuring degradation to prune away LLM components – and optimize quantization
Björn Deiseroth, Max Meuer, Nikolas Gritsch et al.
Diverse Perspectives, Divergent Models: Cross-Cultural Evaluation of Depression Detection on Twitter
Nuredin Ali Abdelkadir, Charles Zhang, Ned Mayo et al.
DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text
Wenting Zhao, Ye Liu, Tong Niu et al.
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Weihao Zeng, Dayuan Fu, Keqing He et al.
DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness
Yuqi Wang, Zeqiang Wang, Wei Wang et al.
DLM: A Decoupled Learning Model for Long-tailed Polyphone Disambiguation in Mandarin
Beibei Gao, Yangsen Zhang, Ga Xiang et al.
DOCMASTER: A Unified Platform for Annotation, Training, & Inference in Document Question-Answering
Alex Nguyen, Zilong Wang, Jingbo Shang et al.
Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling
Yupu Liang, Yaping Zhang, Cong Ma et al.
Does Fine-tuning a Classifier Help in Low-budget Scenarios? Not Much
Cesar Gonzalez - Gutierrez, Audi Primadhanty, Francesco Cazzaro et al.
Does GPT-4 pass the Turing test?
Cameron R. Jones, Benjamin K. Bergen
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi et al.
Does Whisper Understand Swiss German? An Automatic, Qualitative, and Human Evaluation
Eyal Dolev, Clemens Lutz, Noëmi Aepli
DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping
Yongrui Chen, Haiyun Jiang, Xinting Huang et al.
Do large language models and humans have similar behaviours in causal inference with script knowledge?
Xudong Hong, Margarita Ryzhova, Daniel Biondi et al.
Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as Rankers
Yuan Wang, Xuyang Wu, Hsin-Tai Wu et al.
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks
Ting-Yun Chang, Jesse Thomason, Robin Jia
Do Multilingual Language Models Think Better in English?
Julen Etxaniz, Gorka Azkune, Aitor Soroa et al.
Don’t be a Fool: Pooling Strategies in Offensive Language Detection from User-Intended Adversarial Attacks
Seunguk Yu, Juhwan Choi, YoungBin Kim
Do Prompt Positions Really Matter?
Junyu Mao, Stuart E. Middleton, Mahesan Niranjan
DoubleLingo: Causal Estimation with Large Language Models
Marko Veljanovski, Zach Wood-Doughty
Do Vision-Language Models Understand Compound Nouns?
Sonal Kumar, Sreyan Ghosh, S Sakshi et al.
DriftWatch: A Tool that Automatically Detects Data Drift and Extracts Representative Examples Affected by Drift
Myeongjun Jang, Antonios Georgiadis, Yiyun Zhao et al.