Yuchao Gu
15 papers · 2020–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (7) π Academic Marathon (6) πΊοΈ Taxonomy Completionist (26)
π£
Hot Topic Early Bird
π
Conference Polyglot
(7)
π
Academic Marathon
(6)
π€
Dynamic Duo
(11)
β‘
Prolific Year
(7)
π
Century Club
(15)
ποΈ
Keyword Collector
(60)
π₯
Unstoppable
(5)
Conferences
CVPR (5)
ECCV (3)
NIPS (3)
AAAI (1)
ICCV (1)
ICLR (1)
WACV (1)
Top co-authors
Keywords
diffusion model
(5)
video editing
(3)
video generation
(2)
text-to-image generation
(2)
domain generalization
(1)
transfer learning
(1)
semantic segmentation
(1)
knowledge distillation
(1)
self-attention mechanism
(1)
image synthesis
(1)
frame interpolation
(1)
instance segmentation
(1)
task generalization
(1)
synthetic data generation
(1)
motion detection
(1)
vector quantization
(1)
pyramid structures
(1)
generative model
(1)
image generation
(1)
zero-shot learning
(1)
Papers
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
WACV 2026
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
ICLR 2025
ROICtrl: Boosting Instance Control for Visual Generation
CVPR 2025
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
CVPR 2024
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
CVPR 2024
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
CVPR 2024
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
NIPS 2024
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis
CVPR 2024
Drag Anything: Motion Control for Anything using Entity Representation
ECCV 2024
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
ECCV 2024
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
NIPS 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ICCV 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
NIPS 2023
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
ECCV 2022
Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection
AAAI 2020