Yingsen Zeng
4 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌈
Renaissance Researcher
(5)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
ICCV (2)
AAAI (1)
ECCV (1)
Top co-authors
Keywords
video understanding
(2)
visual question answering
(1)
text-to-image generation
(1)
probability distribution
(1)
multi-modal large language model
(1)
vision-language model
(1)
video language model
(1)
temporal representation
(1)
visual segmentation
(1)
multimodal diffusion
(1)
temporal localization
(1)
temporal embedding
(1)
visual text rendering
(1)
distribution-based modeling
(1)
glyph-aware diffusion
(1)
semantic segmentation
(1)
image aesthetic refinement
(1)
Papers
ViType: High-Fidelity Visual Text Rendering via Glyph-Aware Multimodal Diffusion
AAAI 2026
DisTime: Distribution-based Time Representation for Video Large Language Models
ICCV 2025
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
ICCV 2025
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
ECCV 2024