conftrace_

Zhuofan Zong

11 papers · 2020–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🗺️ Taxonomy Completionist (22) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5) 🌈 Renaissance Researcher (5)

🌍 Conference Polyglot (5) 🏃 Academic Marathon (5) 🤝 Dynamic Duo (10) 💎 Century Club (11)

Conferences

NIPS (6) ICCV (2) AAAI (1) ECCV (1) ICML (1)

Top co-authors

Guanglu Song (10) Yu Liu (10) hongsheng Li (6) Dongzhi Jiang (4) Bingqi Ma (3) Hao Shao (3) Zeyue Xue (3) Dazhong Shen (3) Ping Luo (2) Mingyuan Zhang (1)

Keywords

text-to-image generation (3) diffusion model (3) mixture of expert (2) object detection (2) autonomous driving (1) 3d vision (1) visual question answering (1) bird's eye view (1) multimodal learning (1) visual reasoning (1) instance segmentation (1) model adaptation (1) graph attention (1) 3d object detection (1) multi-modal large language model (1) video understanding (1) chain-of-thought reasoning (1) vision-language model (1) graph attention network (1) temporal modeling (1)

Papers

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM ICML 2025 Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning NIPS 2024 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching NIPS 2024 MoVA: Adapting Mixture of Vision Experts to Multimodal Context NIPS 2024 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models NIPS 2024 DETRs with Collaborative Hybrid Assignments Training ICCV 2023 Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction ICCV 2023 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths NIPS 2023 Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes NIPS 2022 Self-Slimmed Vision Transformer ECCV 2022 Graph Attention Based Proposal 3D ConvNets for Action Detection AAAI 2020