Ming Dai
6 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(14)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (3)
ICCV (2)
NIPS (1)
Top co-authors
Keywords
visual grounding
(3)
multimodal large language model
(2)
attention mechanism
(2)
referring image segmentation
(2)
multimodal learning
(1)
multi-modal learning
(1)
object localization
(1)
vision-language model
(1)
inference optimization
(1)
multimodal representation
(1)
multimodal fusion
(1)
inference acceleration
(1)
referring expression comprehension
(1)
multi-modal fusion
(1)
key-value cache
(1)
visual token pruning
(1)
object proposal
(1)
consistency constraint
(1)
referring expression segmentation
(1)
image-text comprehension
(1)
Papers
Q Cache: Visual Attention Is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model
AAAI 2026
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
AAAI 2025
ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
AAAI 2025
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
ICCV 2025
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
ICCV 2025
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
NIPS 2024