Minuk Ma
4 papers · 2019–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Conference Polyglot (3) π Academic Marathon (6) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
CVPR (2)
ECCV (1)
EMNLP (1)
Top co-authors
Keywords
multimodal learning
(3)
video understanding
(2)
attention mechanism
(2)
question answering
(2)
video question answering
(2)
audio-visual representation
(1)
memory network
(1)
dynamic fusion
(1)
caption generation
(1)
temporal localization
(1)
neural network
(1)
automated audio captioning
(1)
cross-modal supervision
(1)
progressive attention
(1)
modality shifting
(1)
visual question answering
(1)
heterogeneous reasoning
(1)
cross-modal learning
(1)
multi-modal learning
(1)
visual grounding
(1)
Papers
Learning to See through Sound: From VggCaps to Multi2Cap for Richer Automated Audio Captioning
EMNLP 2025
Modality Shifting Attention Network for Multi-Modal Video Question Answering
CVPR 2020
VLANet: Video-Language Alignment Network for Weakly-Supervised Video Moment Retrieval
ECCV 2020
Progressive Attention Memory Network for Movie Story Question Answering
CVPR 2019