Zongxin Yang

32 papers · 2019–2026 · 8 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (15)

🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (54) 🌉 Interdisciplinary Bridge 🏆 Grand Slam 🤝 Dynamic Duo (27) ⚡ Prolific Year (5) 🗃️ Keyword Collector (138) 🔥 Unstoppable (7) 💎 Century Club (31)

Conferences

CVPR (9) ICCV (6) NIPS (4) AAAI (3) ECCV (3) ICLR (3) ICML (2) IJCAI (2)

Top co-authors

Yi Yang (28) Yifan Sun (4) Jianxin Ma (4) Chang Zhou (4) Dewei Zhou (3) Yunchao Wei (3) Yuanyou Xu (3) Xiaolong Shen (2) Yunzhi Zhuge (2) Xiaodi Li (2)

Keywords

video object segmentation (4) semantic segmentation (4) diffusion model (4) instance segmentation (4) attention mechanism (3) image restoration (3) domain adaptation (2) contrastive learning (2) object detection (2) image generation (2) 3d reconstruction (2) transformer architecture (2) few-shot learning (2) vision transformer (2) object tracking (2) multi-object tracking (2) domain generalization (1) online learning (1) embedding learning (1) temporal modeling (1)

Papers

Insert Anything: Image Insertion via In-Context Editing in DiT AAAI 2026 The Devil is in Temporal Token: High Quality Video Reasoning Segmentation CVPR 2025 SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons CVPR 2025 3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation ICLR 2025 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge ICLR 2025 Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation AAAI 2025 Origin Identification for Text-Guided Image-to-Image Diffusion Models ICML 2025 DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models ICCV 2025 SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction CVPR 2024 DRIP: Unleashing Diffusion Priors for Joint Foreground and Alpha Prediction in Image Matting NIPS 2024 Controllable 3D Face Generation with Conditional Style Code Diffusion AAAI 2024 HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting ECCV 2024 DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent) ICML 2024 FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation CVPR 2023 Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation CVPR 2023 Video Object Segmentation in Panoptic Wild Scenes IJCAI 2023 Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction NIPS 2023 Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation ICCV 2023 Pyramid Diffusion Models for Low-light Image Enhancement IJCAI 2023 JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery ICCV 2023 TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering ICCV 2023 Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation ICCV 2023 ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification CVPR 2023 Decompose to Generalize: Species-Generalized Animal Pose Estimation ICLR 2023 Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation ECCV 2022 Decoupling Features in Hierarchical Propagation for Video Object Segmentation NIPS 2022 H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection CVPR 2022 Associating Objects with Transformers for Video Object Segmentation NIPS 2021 DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency CVPR 2021 Collaborative Video Object Segmentation by Foreground-Background Integration ECCV 2020 Gated Channel Transformation for Visual Recognition CVPR 2020 Very Long Natural Scenery Image Prediction by Outpainting ICCV 2019