Dongdong Chen

88 papers · 2017–2026 · 13 conferences · across top CS/AI conferences

Achievements

+20 more ↓

🌍 Conference Polyglot (13) 🏃 Academic Marathon (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (5)

🏃 Academic Marathon (9) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13) 🌟 Keyword Trendsetter Combo (6) 🏠 Conference Loyalist (40) 📛 The Namer 🏆 Grand Slam 🤝 Dynamic Duo (43) 👥 Mega-Team (20) 👑 Triple Crown 🔬 Deep Specialist (15) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🗃️ Keyword Collector (340) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (10) ❓ The Questioner ⚡ Prolific Year (12) 💎 Century Club (86)

Conferences

CVPR (40) ICCV (13) NIPS (9) ECCV (7) AAAI (6) ICML (3) MICCAI (3) ICLR (2) EMNLP (1) IJCAI (1) JMLR (1) NAACL (1) WACV (1)

Top co-authors

Lu Yuan (43) Nenghai Yu (31) Weiming Zhang (20) Jing Liao (19) Xiyang Dai (19) Yinpeng Chen (18) Mengchen Liu (16) Jianmin Bao (15) Xiaoyi Dong (14) Dong Chen (13)

Keywords

contrastive learning (8) image generation (7) self-supervised learning (7) diffusion model (7) multimodal learning (6) adversarial attack (6) transfer learning (6) video understanding (5) semantic segmentation (5) object detection (5) image inpainting (5) unsupervised learning (5) domain adaptation (5) vision transformer (5) convolutional neural network (4) few-shot learning (4) attention mechanism (4) image editing (3) zero-shot learning (3) text-to-image generation (3)

Papers

MageBench: Bridging Large Multimodal Models to Agents WACV 2026 MagicPaint: Operate Anything for Image Inpainting with Diffusion Model AAAI 2026 LLM2CLIP: Powerful Language Model Unlocks Richer Cross-Modality Representation AAAI 2026 UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery CVPR 2025 SmartEraser: Remove Anything from Images using Masked-Region Guidance CVPR 2025 Olympus: A Universal Task Router for Computer Vision Tasks CVPR 2025 Show and Segment: Universal Medical Image Segmentation via In-Context Learning CVPR 2025 I2V3D: Controllable Image-to-video Generation with 3D Guidance ICCV 2025 FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing ICCV 2025 ProLongVid: A Simple but Strong Baseline for Long-context Video Instruction Tuning EMNLP 2025 RSAD: Region-Specific Anomaly Detection in fMRI for Disease Diagnosis MICCAI 2025 Exploring Invariance in Images through One-way Wave Equations ICML 2025 VLM4D: Towards Spatiotemporal Awareness in Vision Language Models ICCV 2025 Equivariant Multi-Modality Image Fusion CVPR 2024 Image Fusion via Vision-Language Model ICML 2024 Self-supervised Learning with Adaptive Graph Structure and Function Representation For Cross-Dataset Brain Disorder Diagnosis MICCAI 2024 Affinity Learning Based Brain Function Representation for Disease Diagnosis MICCAI 2024 Sub-Adjacent Transformer: Improving Time Series Anomaly Detection with Reconstruction Error from Sub-Adjacent Neighborhoods IJCAI 2024 i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data NAACL 2024 Towards More Unified In-context Visual Understanding CVPR 2024 OmniViD: A Generative Framework for Universal Video Understanding CVPR 2024 Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation ECCV 2024 Diversity-Aware Meta Visual Prompting CVPR 2023 Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning CVPR 2023 MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining CVPR 2023 Streaming Video Model CVPR 2023 Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles CVPR 2023 Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding CVPR 2023 Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting ICCV 2023 Sensing Theorems for Unsupervised Learning in Linear Inverse Problems JMLR 2023 X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion ICML 2023 Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations ICLR 2023 AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control ICCV 2023 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models NIPS 2023 Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection NIPS 2023 PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers AAAI 2023 HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending ICCV 2023 Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis AAAI 2023 i-Code: An Integrative and Composable Multimodal Learning Framework AAAI 2023 Look Before You Match: Instance Understanding Matters in Video Object Segmentation CVPR 2023 HairCLIP: Design Your Hair by Text and Reference Image CVPR 2022 Unsupervised Learning From Incomplete Measurements for Inverse Problems NIPS 2022 OmniVL: One Foundation Model for Image-Language and Video-Language Tasks NIPS 2022 REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering NIPS 2022 Mobile-Former: Bridging MobileNet and Transformer CVPR 2022 CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields CVPR 2022 CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows CVPR 2022 Reduce Information Loss in Transformers for Pluralistic Image Inpainting CVPR 2022 Large-Scale Pre-Training for Person Re-Identification With Noisy Labels CVPR 2022 BEVT: BERT Pretraining of Video Transformers CVPR 2022 Shape-Invariant 3D Adversarial Point Clouds CVPR 2022 Bringing Old Films Back to Life CVPR 2022 Robust Equivariant Imaging: A Fully Unsupervised Framework for Learning To Image From Noisy and Partial Measurements CVPR 2022 General Facial Representation Learning in a Visual-Linguistic Manner CVPR 2022 Vector Quantized Diffusion Model for Text-to-Image Synthesis CVPR 2022 Protecting Celebrities From DeepFake With Identity Consistency Transformer CVPR 2022 Should All Proposals Be Treated Equally in Object Detection? ECCV 2022 Bootstrapped Masked Autoencoders for Vision BERT Pretraining ECCV 2022 Multi-Attentional Deepfake Detection CVPR 2021 Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation CVPR 2021 Dynamic Head: Unifying Object Detection Heads With Attentions CVPR 2021 Stronger NAS with Weaker Predictors NIPS 2021 Unsupervised Pre-Training for Person Re-Identification CVPR 2021 Learning With Noisy Labels for Robust Point Cloud Segmentation ICCV 2021 High-Fidelity Pluralistic Image Completion With Transformers ICCV 2021 Equivariant Imaging: Learning Beyond the Range Space ICCV 2021 MicroNet: Improving Image Recognition With Extremely Low FLOPs ICCV 2021 Improve Unsupervised Pretraining for Few-Label Transfer ICCV 2021 Revisiting Dynamic Convolution via Matrix Decomposition ICLR 2021 Diverse Semantic Image Synthesis via Probability Distribution Modeling CVPR 2021 Passport-aware Normalization for Deep Model Protection NIPS 2020 DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search ECCV 2020 Dynamic ReLU ECCV 2020 Robust Superpixel-Guided Attentional Adversarial Attack CVPR 2020 LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks CVPR 2020 Bringing Old Photos Back to Life CVPR 2020 Model Watermarking for Image Processing Networks AAAI 2020 Deep Decomposition Learning for Inverse Imaging Problems ECCV 2020 GreedyFool: Distortion-Aware Sparse Adversarial Attack NIPS 2020 Density-Aware Graph for Deep Semi-Supervised Visual Recognition CVPR 2020 Self-Robust 3D Point Recognition via Gather-Vector Guidance CVPR 2020 Dynamic Convolution: Attention Over Convolution Kernels CVPR 2020 Transductive Zero-Shot Learning with Visual Structure Constraint NIPS 2019 Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once ICCV 2019 Decouple Learning for Parameterized Image Operators ECCV 2018 Stereoscopic Neural Style Transfer CVPR 2018 StyleBank: An Explicit Representation for Neural Image Style Transfer CVPR 2017 Coherent Online Video Style Transfer ICCV 2017