Xiaogang Wang
193 papers · 2007–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (13) π Interdisciplinary Bridge π Conference Polyglot (8)
π
Conference Polyglot
(8)
πΊοΈ
Taxonomy Completionist
(13)
π£
Hot Topic Early Bird
π
Keyword Trendsetter Combo
(17)
π
Conference Loyalist
(102)
π
Keyword Champion
π€
Dynamic Duo
(69)
π
Grand Slam
π₯
Mega-Team
(23)
π±
Topic Pioneer
π
Triple Crown
π¬
Deep Specialist
(35)
β‘
Prolific Year
(30)
π
Century Club
(193)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(13)
ποΈ
Keyword Collector
(661)
Conferences
CVPR (102)
ICCV (45)
ECCV (17)
NIPS (13)
AAAI (6)
ICLR (6)
ICML (3)
IJCAI (1)
Top co-authors
Keywords
convolutional neural network
(43)
deep learning
(20)
person re-identification
(17)
object detection
(16)
multi-task learning
(10)
neural network
(10)
face recognition
(10)
feature extraction
(9)
human pose estimation
(9)
pedestrian detection
(9)
feature learning
(9)
semantic segmentation
(9)
metric learning
(8)
representation learning
(8)
feature representation
(8)
depth estimation
(8)
transfer learning
(8)
point cloud
(7)
deep neural network
(7)
self-supervised learning
(7)
Papers
3D Dental Model Segmentation with Geometrical Boundary Preserving
CVPR 2025
MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos
ICCV 2025
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
CVPR 2025
ConsistentCity: Semantic Flow-guided Occupancy DiT for Temporally Consistent Driving Scene Synthesis
ICCV 2025
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
ICLR 2024
FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance
CVPR 2024
Digital Life Project: Autonomous 3D Characters with Social Intelligence
CVPR 2024
Cached Transformers: Improving Transformers with Differentiable Memory Cachde
AAAI 2024
Phased Consistency Models
NIPS 2024
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
CVPR 2024
Real-Time Controllable Denoising for Image and Video
CVPR 2023
InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions
CVPR 2023
Siamese Image Modeling for Self-Supervised Vision Representation Learning
CVPR 2023
Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information
CVPR 2023
A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift
CVPR 2023
A Unified Conditional Framework for Diffusion-based Image Restoration
NIPS 2023
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
CVPR 2023
ViTAS: Vision Transformer Architecture Search
ECCV 2022
Pose for Everything: Towards Category-Agnostic Pose Estimation
ECCV 2022
RNNPose: Recurrent 6-DoF Object Pose Refinement With Robust Correspondence Field Estimation and Pose Optimization
CVPR 2022
Point2Seq: Detecting 3D Objects As Sequences
CVPR 2022
GreedyNASv2: Greedier Search With a Greedy Path Filter
CVPR 2022
Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer
CVPR 2022
Dynamic Token Normalization improves Vision Transformers
ICLR 2022
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs
NIPS 2022
IDR: Self-Supervised Image Denoising via Iterative Data Refinement
CVPR 2022
Learning a Structured Latent Space for Unsupervised Point Cloud Completion
CVPR 2022
Frozen CLIP Models Are Efficient Video Learners
ECCV 2022
Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space
ECCV 2022
Learning Degradation Representations for Image Deblurring
ECCV 2022
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search
CVPR 2021
Fast Convergence of DETR With Spatially Modulated Co-Attention
ICCV 2021
Learning With Privileged Tasks
ICCV 2021
STAR: A Structure-Aware Lightweight Transformer for Real-Time Image Enhancement
ICCV 2021
Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation
ICLR 2021
Deformable DETR: Deformable Transformers for End-to-End Object Detection
ICLR 2021
Rethinking Noise Synthesis and Modeling in Raw Denoising
ICCV 2021
Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution
ICML 2021
Weakly Supervised Contrastive Learning
ICCV 2021
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
ICCV 2021
Voxel-Based Network for Shape Completion by Leveraging Edge Generation
ICCV 2021
LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-Based 3D Detector
ICCV 2021
ReSSL: Relational Self-Supervised Learning with Weak Augmentation
NIPS 2021
Learning Fine-Grained Segmentation of 3D Shapes Without Part Labels
CVPR 2021
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
CVPR 2021
Semantic Scene Completion via Integrating Instances and Scene In-the-Loop
CVPR 2021
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
CVPR 2021
Visually Informed Binaural Audio Generation without Binaural Audios
CVPR 2021
Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation
AAAI 2020
Channel Equilibrium Networks for Learning Deep Representation
ICML 2020
Cascaded Refinement Network for Point Cloud Completion
CVPR 2020
Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images
CVPR 2020
StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching
CVPR 2020
Robust Superpixel-Guided Attentional Adversarial Attack
CVPR 2020
Revisiting the Sibling Head in Object Detector
CVPR 2020
Density-Aware Feature Embedding for Face Clustering
CVPR 2020
Search to Distill: Pearls Are Everywhere but Not the Eyes
CVPR 2020
3D Human Mesh Regression With Dense Correspondence
CVPR 2020
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
CVPR 2020
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
ECCV 2020
Adapting Object Detectors with Conditional Domain Normalization
ECCV 2020
Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation
ECCV 2020
PIE-NET: Parametric Inference of Point Cloud Edges
NIPS 2020
KPNet: Towards Minimal Face Detector
AAAI 2020
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations
CVPR 2019
Feature Intertwiner for Object Detection
ICLR 2019
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
AAAI 2019
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph
NIPS 2019
Finding Task-Relevant Features for Few-Shot Learning by Category Traversal
CVPR 2019
SSN: Learning Sparse Switchable Normalization via SparsestMax
CVPR 2019
PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud
CVPR 2019
GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving
CVPR 2019
Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing
CVPR 2019
Semantics Disentangling for Text-To-Image Generation
CVPR 2019
Group-Wise Correlation Stereo Network
CVPR 2019
Video Generation From Single Semantic Label Map
CVPR 2019
DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images
CVPR 2019
Context and Attribute Grounded Dense Captioning
CVPR 2019
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering
CVPR 2019
Conditional Adversarial Generative Flow for Controllable Image Synthesis
CVPR 2019
Shape2Motion: Joint Analysis of Motion Parts and Attributes From 3D Shapes
CVPR 2019
P2SGrad: Refined Gradients for Optimizing Deep Face Models
CVPR 2019
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis
NIPS 2019
Gradient Harmonized Single-Stage Detector
AAAI 2019
Unsupervised Cross-Spectral Stereo Matching by Learning to Synthesize
AAAI 2019
Vision-Infused Deep Audio Inpainting
ICCV 2019
Interpolated Convolutional Networks for 3D Point Cloud Understanding
ICCV 2019
Differentiable Kernel Evolution
ICCV 2019
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
ICCV 2019
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
ICCV 2019
Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM
ICCV 2019
Deep Self-Learning From Noisy Labels
ICCV 2019
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
ICCV 2019
Multi-Modality Latent Interaction Network for Visual Question Answering
ICCV 2019
Visual Question Generation as Dual Task of Visual Question Answering
CVPR 2018
Eliminating Background-Bias for Robust Person Re-Identification
CVPR 2018
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification
NIPS 2018
Neural Network Encapsulation
ECCV 2018
Transductive Centroid Projection for Semi-supervised Large-scale Recognition
ECCV 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
ECCV 2018
Question-Guided Hybrid Convolution for Visual Question Answering
ECCV 2018
Learning Monocular Depth by Distilling Cross-domain Stereo Networks
ECCV 2018
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
ECCV 2018
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation
ECCV 2018
Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association
ECCV 2018
Person Re-identification with Deep Similarity-Guided Graph Neural Network
ECCV 2018
Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification
CVPR 2018
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing
CVPR 2018
FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis
CVPR 2018
Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding
CVPR 2018
Exploring Disentangled Feature Representation Beyond Face Identification
CVPR 2018
Deep Group-Shuffling Random Walk for Person Re-Identification
CVPR 2018
3D Human Pose Estimation in the Wild by Adversarial Learning
CVPR 2018
Decoupling the Layers in Residual Networks
ICLR 2018
Group Consistent Similarity Learning via Deep CRF for Person Re-Identification
CVPR 2018
Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration
CVPR 2018
Context Encoding for Semantic Segmentation
CVPR 2018
End-to-End Deep Kronecker-Product Matching for Person Re-Identification
CVPR 2018
ViP-CNN: Visual Phrase Guided Convolutional Neural Network
CVPR 2017
Learning Object Interactions and Descriptions for Semantic Image Segmentation
CVPR 2017
Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification
CVPR 2017
Learning Cross-Modal Deep Representations for Robust Pedestrian Detection
CVPR 2017
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
CVPR 2017
Joint Detection and Identification Feature Learning for Person Search
CVPR 2017
Residual Attention Network for Image Classification
CVPR 2017
Pyramid Scene Parsing Network
CVPR 2017
Person Search With Natural Language Description
CVPR 2017
Multi-Context Attention for Human Pose Estimation
CVPR 2017
Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion
CVPR 2017
Object Detection in Videos With Tubelet Proposal Networks
CVPR 2017
Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction
NIPS 2017
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
ICCV 2017
Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-Identification
ICCV 2017
Recurrent Scale Approximation for Object Detection in CNN
ICCV 2017
Scene Graph Generation From Objects, Phrases and Region Captions
ICCV 2017
Learning Feature Pyramids for Human Pose Estimation
ICCV 2017
Identity-Aware Textual-Visual Matching With Latent Co-Attention
ICCV 2017
Learning Deep Neural Networks for Vehicle Re-ID With Visual-Spatio-Temporal Path Proposals
ICCV 2017
Chained Cascade Network for Object Detection
ICCV 2017
Deep Dual Learning for Semantic Image Segmentation
ICCV 2017
Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism
ICCV 2017
StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks
ICCV 2017
STCT: Sequentially Training Convolutional Networks for Visual Tracking
CVPR 2016
CRF-CNN: Modeling Structured Information in Human Pose Estimation
NIPS 2016
Multi-Bias Non-linear Activation in Deep Neural Networks
ICML 2016
Object Detection From Video Tubelets With Convolutional Neural Networks
CVPR 2016
Factors in Finetuning Deep Model for Object Detection With Long-Tail Distribution
CVPR 2016
End-To-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation
CVPR 2016
Structured Feature Learning for Pose Estimation
CVPR 2016
Sparsifying Neural Network Connections for Face Recognition
CVPR 2016
Slicing Convolutional Neural Network for Crowd Video Understanding
CVPR 2016
DeepFashion: Powering Robust Clothes Recognition and Retrieval With Rich Annotations
CVPR 2016
Learning Deep Feature Representations With Domain Guided Dropout for Person Re-Identification
CVPR 2016
Understanding Pedestrian Behaviors From Stationary Crowd Groups
CVPR 2015
Pedestrian Detection Aided by Deep Learning Semantic Tasks
CVPR 2015
Saliency Detection by Multi-Context Deep Learning
CVPR 2015
Cross-Scene Crowd Counting via Deep Convolutional Neural Networks
CVPR 2015
Video Matting via Sparse and Low-Rank Representation
ICCV 2015
Learning Deep Representation With Large-Scale Attributes
ICCV 2015
Deep Learning Strong Parts for Pedestrian Detection
ICCV 2015
Visual Tracking With Fully Convolutional Networks
ICCV 2015
Pedestrian Travel Time Estimation in Crowded Scenes
ICCV 2015
Deeply Learned Face Representations Are Sparse, Selective, and Robust
CVPR 2015
Deeply Learned Attributes for Crowded Scene Understanding
CVPR 2015
Multi-Task Recurrent Neural Network for Immediacy Prediction
ICCV 2015
Deep Learning Face Attributes in the Wild
ICCV 2015
Learning From Massive Noisy Labeled Data for Image Classification
CVPR 2015
DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection
CVPR 2015
Scene-Independent Group Profiling in Crowd
CVPR 2014
L0 Regularized Stationary Time Estimation for Crowd Group Analysis
CVPR 2014
Deep Learning Face Representation by Joint Identification-Verification
NIPS 2014
Multi-source Deep Learning for Human Pose Estimation
CVPR 2014
Deep Learning Face Representation from Predicting 10,000 Classes
CVPR 2014
Switchable Deep Network for Pedestrian Detection
CVPR 2014
Multi-View Perceptron: a Deep Model for Learning Face Identity and View Representations
NIPS 2014
DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification
CVPR 2014
Learning Mid-level Filters for Person Re-identification
CVPR 2014
Dimensionality Reduction with Generalized Linear Models
IJCAI 2013
Measuring Crowd Collectiveness
CVPR 2013
Modeling Mutual Visibility Relationship in Pedestrian Detection
CVPR 2013
Single-Pedestrian Detection Aided by Multi-pedestrian Detection
CVPR 2013
Unsupervised Salience Learning for Person Re-identification
CVPR 2013
Locally Aligned Feature Transforms across Views
CVPR 2013
Deep Convolutional Network Cascade for Facial Point Detection
CVPR 2013
Hybrid Deep Learning for Face Verification
ICCV 2013
Multi-stage Contextual Deep Learning for Pedestrian Detection
ICCV 2013
Pedestrian Parsing via Deep Decompositional Network
ICCV 2013
Deep Learning Identity-Preserving Face Space
ICCV 2013
A Deep Sum-Product Architecture for Robust Facial Attributes Analysis
ICCV 2013
Person Re-identification by Salience Matching
ICCV 2013
Joint Deep Learning for Pedestrian Detection
ICCV 2013
Visual Semantic Complex Network for Web Images
ICCV 2013
Spatial Latent Dirichlet Allocation
NIPS 2007