Xiaodong Cun
35 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Cross-Pollinator (11) π§ Keyword Pioneer π Academic Marathon (5) π Conference Polyglot (7) π Renaissance Researcher (6)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(59)
π
Keyword Champion
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π€
Dynamic Duo
(20)
ποΈ
Keyword Collector
(168)
β‘
Prolific Year
(12)
π₯
Unstoppable
(6)
π
Century Club
(35)
Conferences
CVPR (16)
ECCV (6)
AAAI (5)
ICCV (4)
NIPS (2)
ICLR (1)
WACV (1)
Top co-authors
Keywords
diffusion model
(12)
video generation
(8)
temporal consistency
(3)
image restoration
(3)
semantic segmentation
(3)
zero-shot learning
(3)
text-to-video generation
(3)
text-to-image generation
(3)
latent space
(3)
high-resolution image
(2)
test-time training
(2)
transformer architecture
(2)
generative adversarial network
(2)
neural rendering
(2)
depth estimation
(2)
video editing
(2)
shadow removal
(2)
face animation
(2)
object detection
(1)
image generation
(1)
Papers
MagicStick: Controllable Video Editing via Control Handle Transformations
WACV 2025
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
AAAI 2025
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
CVPR 2025
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
CVPR 2025
DEIM: DETR with Improved Matching for Fast Convergence
CVPR 2025
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
CVPR 2024
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
NIPS 2024
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024
Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
CVPR 2024
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
CVPR 2024
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
CVPR 2024
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
CVPR 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
CVPR 2024
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
ECCV 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
ECCV 2024
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
Inserting Anybody in Diffusion Models via Celeb Basis
NIPS 2023
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
ICCV 2023
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
ICCV 2023
DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
CVPR 2023
CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior
CVPR 2023
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
CVPR 2023
Explicit Visual Prompting for Low-Level Structure Segmentations
CVPR 2023
ToonTalker: Cross-Domain Face Reenactment
ICCV 2023
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
ICCV 2023
CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying
AAAI 2023
3D GAN Inversion With Facial Symmetry Prior
CVPR 2023
Generating Human Motion From Textual Descriptions With Discrete Representations
CVPR 2023
Uformer: A General U-Shaped Transformer for Image Restoration
CVPR 2022
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization
ECCV 2022
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
ECCV 2022
Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal
AAAI 2021
Towards Ghost-Free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN
AAAI 2020
Defocus Blur Detection via Depth Distillation
ECCV 2020