Yuchao Gu

15 papers · 2020–2026 · 7 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (6) 🗺️ Taxonomy Completionist (26)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🏃 Academic Marathon (6) 🤝 Dynamic Duo (11) ⚡ Prolific Year (7) 💎 Century Club (15) 🗃️ Keyword Collector (60) 🔥 Unstoppable (5)

Conferences

CVPR (5) ECCV (3) NIPS (3) AAAI (1) ICCV (1) ICLR (1) WACV (1)

Top co-authors

Mike Zheng Shou (11) Rui Zhao (7) Ying Shan (5) Jay Zhangjie Wu (5) David Junhao Zhang (4) Xintao Wang (4) Weijia Wu (4) Yixiao Ge (3) Jia-Wei Liu (3) Jussi Keppo (2)

Keywords

diffusion model (5) video editing (3) video generation (2) text-to-image generation (2) domain generalization (1) transfer learning (1) semantic segmentation (1) knowledge distillation (1) self-attention mechanism (1) image synthesis (1) frame interpolation (1) instance segmentation (1) task generalization (1) synthetic data generation (1) motion detection (1) vector quantization (1) pyramid structures (1) generative model (1) image generation (1) zero-shot learning (1)

Papers

UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models WACV 2026 Show-o: One Single Transformer to Unify Multimodal Understanding and Generation ICLR 2025 ROICtrl: Boosting Instance Control for Visual Generation CVPR 2025 MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers CVPR 2024 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence CVPR 2024 DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing CVPR 2024 EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models NIPS 2024 Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis CVPR 2024 Drag Anything: Motion Control for Anything using Entity Representation ECCV 2024 MotionDirector: Motion Customization of Text-to-Video Diffusion Models ECCV 2024 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models NIPS 2023 Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation ICCV 2023 Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models NIPS 2023 VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder ECCV 2022 Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection AAAI 2020