Muyan Zhong
3 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(3)
🐝
Cross-Pollinator
(12)
🌉
Interdisciplinary Bridge
🐣
Hot Topic Early Bird
Conferences
CVPR (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
zero-shot learning
(1)
image generation
(1)
pose estimation
(1)
visual question answering
(1)
vision-language alignment
(1)
multi-modal learning
(1)
image editing
(1)
object localization
(1)
zero-shot image classification
(1)
foundation model
(1)
vision-language model
(1)
multimodal large language model
(1)
image-text retrieval
(1)
vision foundation model
(1)
end-to-end training
(1)
vision-language task
(1)
visual understanding
(1)
large-scale training
(1)
multi-modal dialogue system
(1)
Papers
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
ICML 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NIPS 2024
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
CVPR 2024