Zongxin Yang
32 papers · 2019–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Academic Marathon (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (8) π Cross-Pollinator (15)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(54)
π
Interdisciplinary Bridge
π
Grand Slam
π€
Dynamic Duo
(27)
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(138)
π₯
Unstoppable
(7)
π
Century Club
(31)
Conferences
CVPR (9)
ICCV (6)
NIPS (4)
AAAI (3)
ECCV (3)
ICLR (3)
ICML (2)
IJCAI (2)
Top co-authors
Keywords
video object segmentation
(4)
semantic segmentation
(4)
diffusion model
(4)
instance segmentation
(4)
attention mechanism
(3)
image restoration
(3)
domain adaptation
(2)
contrastive learning
(2)
object detection
(2)
image generation
(2)
3d reconstruction
(2)
transformer architecture
(2)
few-shot learning
(2)
vision transformer
(2)
object tracking
(2)
multi-object tracking
(2)
domain generalization
(1)
online learning
(1)
embedding learning
(1)
temporal modeling
(1)
Papers
Insert Anything: Image Insertion via In-Context Editing in DiT
AAAI 2026
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
CVPR 2025
SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons
CVPR 2025
3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation
ICLR 2025
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge
ICLR 2025
Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation
AAAI 2025
Origin Identification for Text-Guided Image-to-Image Diffusion Models
ICML 2025
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
ICCV 2025
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
CVPR 2024
DRIP: Unleashing Diffusion Priors for Joint Foreground and Alpha Prediction in Image Matting
NIPS 2024
Controllable 3D Face Generation with Conditional Style Code Diffusion
AAAI 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
ECCV 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
ICML 2024
FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation
CVPR 2023
Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation
CVPR 2023
Video Object Segmentation in Panoptic Wild Scenes
IJCAI 2023
Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction
NIPS 2023
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
ICCV 2023
Pyramid Diffusion Models for Low-light Image Enhancement
IJCAI 2023
JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery
ICCV 2023
TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering
ICCV 2023
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
ICCV 2023
ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification
CVPR 2023
Decompose to Generalize: Species-Generalized Animal Pose Estimation
ICLR 2023
Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation
ECCV 2022
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
NIPS 2022
H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection
CVPR 2022
Associating Objects with Transformers for Video Object Segmentation
NIPS 2021
DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency
CVPR 2021
Collaborative Video Object Segmentation by Foreground-Background Integration
ECCV 2020
Gated Channel Transformation for Visual Recognition
CVPR 2020
Very Long Natural Scenery Image Prediction by Outpainting
ICCV 2019