Damai Dai

26 papers · 2019–2026 · 8 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (6) 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (7)

🐝 Cross-Pollinator (7) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (68) 🤝 Dynamic Duo (18) 🧬 Topic Evolution ⚡ Prolific Year (6) 🚀 Conference Pioneer 🔥 Unstoppable (5) 💎 Century Club (24) 🗃️ Keyword Collector (121) ❓ The Questioner

Conferences

ACL (11) EMNLP (7) AAAI (3) COLING (1) IJCAI (1) IJCNLP (1) NAACL (1) NIPS (1)

Top co-authors

Zhifang Sui (19) Tianyu Liu (9) Baobao Chang (9) Lei Li (6) Deli Chen (5) Xu Sun (5) Furu Wei (5) Li Dong (4) Shuming Ma (4) Hua Zheng (4)

Keywords

large language model (8) mixture of expert (5) in-context learning (4) language model (4) pretrained language model (3) transfer learning (2) word formation (2) entity representation (2) knowledge graph embedding (2) model scaling (2) few-shot learning (2) factual knowledge (2) mathematical reasoning (2) attention mechanism (2) knowledge editing (2) sentiment analysis (2) representation learning (2) text generation (2) video understanding (2) expert specialization (2)

Papers

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models ACL 2026 Large Language Models Struggle with Unreasonability in Math Problems AAAI 2026 Language Models Encode the Value of Numbers Linearly COLING 2025 Exploring Activation Patterns of Parameters in Language Models AAAI 2025 Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention ACL 2025 DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models ACL 2024 Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations ACL 2024 A Survey on In-context Learning EMNLP 2024 Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models EMNLP 2024 Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning EMNLP 2023 Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion ACL 2023 Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers ACL 2023 Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning EMNLP 2023 Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization EMNLP 2023 StableMoE: Stable Routing Strategy for Mixture of Experts ACL 2022 Knowledge Neurons in Pretrained Transformers ACL 2022 Hierarchical Curriculum Learning for AMR Parsing ACL 2022 On the Representation Collapse of Sparse Mixture of Experts NIPS 2022 Calibrating Factual Knowledge in Pretrained Language Models EMNLP 2022 Robust Fine-tuning via Perturbation and Interpolation from In-batch Instances IJCAI 2022 Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions ACL 2021 Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions IJCNLP 2021 Decompose, Fuse and Generate: A Formation-Informed Method for Chinese Definition Generation NAACL 2021 Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation EMNLP 2021 LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts AAAI 2019 Learning to Control the Fine-grained Sentiment for Story Ending Generation ACL 2019