Zhuofan Zong
11 papers · 2020–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (22) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (5) π Renaissance Researcher (5)
π
Conference Polyglot
(5)
π
Academic Marathon
(5)
π€
Dynamic Duo
(10)
π
Century Club
(11)
Conferences
NIPS (6)
ICCV (2)
AAAI (1)
ECCV (1)
ICML (1)
Top co-authors
Keywords
text-to-image generation
(3)
diffusion model
(3)
mixture of expert
(2)
object detection
(2)
autonomous driving
(1)
3d vision
(1)
visual question answering
(1)
bird's eye view
(1)
multimodal learning
(1)
visual reasoning
(1)
instance segmentation
(1)
model adaptation
(1)
graph attention
(1)
3d object detection
(1)
multi-modal large language model
(1)
video understanding
(1)
chain-of-thought reasoning
(1)
vision-language model
(1)
graph attention network
(1)
temporal modeling
(1)
Papers
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
ICML 2025
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
NIPS 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
NIPS 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
NIPS 2024
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
NIPS 2024
DETRs with Collaborative Hybrid Assignments Training
ICCV 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
ICCV 2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
NIPS 2023
Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes
NIPS 2022
Self-Slimmed Vision Transformer
ECCV 2022
Graph Attention Based Proposal 3D ConvNets for Action Detection
AAAI 2020