Damai Dai
26 papers · 2019–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Academic Marathon (6) π Conference Polyglot (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (7)
π
Cross-Pollinator
(7)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(68)
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
β‘
Prolific Year
(6)
π
Conference Pioneer
π₯
Unstoppable
(5)
π
Century Club
(24)
ποΈ
Keyword Collector
(121)
β
The Questioner
Conferences
ACL (11)
EMNLP (7)
AAAI (3)
COLING (1)
IJCAI (1)
IJCNLP (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
large language model
(8)
mixture of expert
(5)
in-context learning
(4)
language model
(4)
pretrained language model
(3)
transfer learning
(2)
word formation
(2)
entity representation
(2)
knowledge graph embedding
(2)
model scaling
(2)
few-shot learning
(2)
factual knowledge
(2)
mathematical reasoning
(2)
attention mechanism
(2)
knowledge editing
(2)
sentiment analysis
(2)
representation learning
(2)
text generation
(2)
video understanding
(2)
expert specialization
(2)
Papers
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
ACL 2026
Large Language Models Struggle with Unreasonability in Math Problems
AAAI 2026
Language Models Encode the Value of Numbers Linearly
COLING 2025
Exploring Activation Patterns of Parameters in Language Models
AAAI 2025
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
ACL 2025
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
ACL 2024
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
ACL 2024
A Survey on In-context Learning
EMNLP 2024
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
EMNLP 2024
Not All Demonstration Examples are Equally Beneficial: Reweighting Demonstration Examples for In-Context Learning
EMNLP 2023
Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion
ACL 2023
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers
ACL 2023
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
EMNLP 2023
Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization
EMNLP 2023
StableMoE: Stable Routing Strategy for Mixture of Experts
ACL 2022
Knowledge Neurons in Pretrained Transformers
ACL 2022
Hierarchical Curriculum Learning for AMR Parsing
ACL 2022
On the Representation Collapse of Sparse Mixture of Experts
NIPS 2022
Calibrating Factual Knowledge in Pretrained Language Models
EMNLP 2022
Robust Fine-tuning via Perturbation and Interpolation from In-batch Instances
IJCAI 2022
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions
ACL 2021
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions
IJCNLP 2021
Decompose, Fuse and Generate: A Formation-Informed Method for Chinese Definition Generation
NAACL 2021
Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation
EMNLP 2021
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts
AAAI 2019
Learning to Control the Fine-grained Sentiment for Story Ending Generation
ACL 2019