Dongdong Chen
88 papers · 2017–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
🌍 Conference Polyglot (13) 🏃 Academic Marathon (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (5)
🏃
Academic Marathon
(9)
🌉
Interdisciplinary Bridge
🌍
Conference Polyglot
(13)
🌟
Keyword Trendsetter Combo
(6)
🏠
Conference Loyalist
(40)
📛
The Namer
🏆
Grand Slam
🤝
Dynamic Duo
(43)
👥
Mega-Team
(20)
👑
Triple Crown
🔬
Deep Specialist
(15)
🧬
Topic Evolution
🏆
Keyword Champion
(2)
🗃️
Keyword Collector
(340)
📈
Trend Setter
🚀
Conference Pioneer
🔥
Unstoppable
(10)
❓
The Questioner
⚡
Prolific Year
(12)
💎
Century Club
(86)
Conferences
CVPR (40)
ICCV (13)
NIPS (9)
ECCV (7)
AAAI (6)
ICML (3)
MICCAI (3)
ICLR (2)
EMNLP (1)
IJCAI (1)
JMLR (1)
NAACL (1)
WACV (1)
Top co-authors
Keywords
contrastive learning
(8)
image generation
(7)
self-supervised learning
(7)
diffusion model
(7)
multimodal learning
(6)
adversarial attack
(6)
transfer learning
(6)
video understanding
(5)
semantic segmentation
(5)
object detection
(5)
image inpainting
(5)
unsupervised learning
(5)
domain adaptation
(5)
vision transformer
(5)
convolutional neural network
(4)
few-shot learning
(4)
attention mechanism
(4)
image editing
(3)
zero-shot learning
(3)
text-to-image generation
(3)
Papers
MageBench: Bridging Large Multimodal Models to Agents
WACV 2026
MagicPaint: Operate Anything for Image Inpainting with Diffusion Model
AAAI 2026
LLM2CLIP: Powerful Language Model Unlocks Richer Cross-Modality Representation
AAAI 2026
UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
CVPR 2025
SmartEraser: Remove Anything from Images using Masked-Region Guidance
CVPR 2025
Olympus: A Universal Task Router for Computer Vision Tasks
CVPR 2025
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
CVPR 2025
I2V3D: Controllable Image-to-video Generation with 3D Guidance
ICCV 2025
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
ICCV 2025
ProLongVid: A Simple but Strong Baseline for Long-context Video Instruction Tuning
EMNLP 2025
RSAD: Region-Specific Anomaly Detection in fMRI for Disease Diagnosis
MICCAI 2025
Exploring Invariance in Images through One-way Wave Equations
ICML 2025
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models
ICCV 2025
Equivariant Multi-Modality Image Fusion
CVPR 2024
Image Fusion via Vision-Language Model
ICML 2024
Self-supervised Learning with Adaptive Graph Structure and Function Representation For Cross-Dataset Brain Disorder Diagnosis
MICCAI 2024
Affinity Learning Based Brain Function Representation for Disease Diagnosis
MICCAI 2024
Sub-Adjacent Transformer: Improving Time Series Anomaly Detection with Reconstruction Error from Sub-Adjacent Neighborhoods
IJCAI 2024
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
NAACL 2024
Towards More Unified In-context Visual Understanding
CVPR 2024
OmniViD: A Generative Framework for Universal Video Understanding
CVPR 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
ECCV 2024
Diversity-Aware Meta Visual Prompting
CVPR 2023
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning
CVPR 2023
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
CVPR 2023
Streaming Video Model
CVPR 2023
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
CVPR 2023
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
CVPR 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
ICCV 2023
Sensing Theorems for Unsupervised Learning in Linear Inverse Problems
JMLR 2023
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
ICML 2023
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
ICLR 2023
AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control
ICCV 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
NIPS 2023
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
NIPS 2023
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
AAAI 2023
HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
ICCV 2023
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
AAAI 2023
i-Code: An Integrative and Composable Multimodal Learning Framework
AAAI 2023
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
CVPR 2023
HairCLIP: Design Your Hair by Text and Reference Image
CVPR 2022
Unsupervised Learning From Incomplete Measurements for Inverse Problems
NIPS 2022
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks
NIPS 2022
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
NIPS 2022
Mobile-Former: Bridging MobileNet and Transformer
CVPR 2022
CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
CVPR 2022
CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows
CVPR 2022
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
CVPR 2022
Large-Scale Pre-Training for Person Re-Identification With Noisy Labels
CVPR 2022
BEVT: BERT Pretraining of Video Transformers
CVPR 2022
Shape-Invariant 3D Adversarial Point Clouds
CVPR 2022
Bringing Old Films Back to Life
CVPR 2022
Robust Equivariant Imaging: A Fully Unsupervised Framework for Learning To Image From Noisy and Partial Measurements
CVPR 2022
General Facial Representation Learning in a Visual-Linguistic Manner
CVPR 2022
Vector Quantized Diffusion Model for Text-to-Image Synthesis
CVPR 2022
Protecting Celebrities From DeepFake With Identity Consistency Transformer
CVPR 2022
Should All Proposals Be Treated Equally in Object Detection?
ECCV 2022
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
ECCV 2022
Multi-Attentional Deepfake Detection
CVPR 2021
Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation
CVPR 2021
Dynamic Head: Unifying Object Detection Heads With Attentions
CVPR 2021
Stronger NAS with Weaker Predictors
NIPS 2021
Unsupervised Pre-Training for Person Re-Identification
CVPR 2021
Learning With Noisy Labels for Robust Point Cloud Segmentation
ICCV 2021
High-Fidelity Pluralistic Image Completion With Transformers
ICCV 2021
Equivariant Imaging: Learning Beyond the Range Space
ICCV 2021
MicroNet: Improving Image Recognition With Extremely Low FLOPs
ICCV 2021
Improve Unsupervised Pretraining for Few-Label Transfer
ICCV 2021
Revisiting Dynamic Convolution via Matrix Decomposition
ICLR 2021
Diverse Semantic Image Synthesis via Probability Distribution Modeling
CVPR 2021
Passport-aware Normalization for Deep Model Protection
NIPS 2020
DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search
ECCV 2020
Dynamic ReLU
ECCV 2020
Robust Superpixel-Guided Attentional Adversarial Attack
CVPR 2020
LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks
CVPR 2020
Bringing Old Photos Back to Life
CVPR 2020
Model Watermarking for Image Processing Networks
AAAI 2020
Deep Decomposition Learning for Inverse Imaging Problems
ECCV 2020
GreedyFool: Distortion-Aware Sparse Adversarial Attack
NIPS 2020
Density-Aware Graph for Deep Semi-Supervised Visual Recognition
CVPR 2020
Self-Robust 3D Point Recognition via Gather-Vector Guidance
CVPR 2020
Dynamic Convolution: Attention Over Convolution Kernels
CVPR 2020
Transductive Zero-Shot Learning with Visual Structure Constraint
NIPS 2019
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
ICCV 2019
Decouple Learning for Parameterized Image Operators
ECCV 2018
Stereoscopic Neural Style Transfer
CVPR 2018
StyleBank: An Explicit Representation for Neural Image Style Transfer
CVPR 2017
Coherent Online Video Style Transfer
ICCV 2017