conftrace_

Xiaogang Wang

193 papers · 2007–2025 · 8 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+18 more ↓ 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird πŸ—ΊοΈ Taxonomy Completionist (13) πŸŒ‰ Interdisciplinary Bridge 🌍 Conference Polyglot (8)
🌍 Conference Polyglot (8) πŸ—ΊοΈ Taxonomy Completionist (13) 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (17) 🏠 Conference Loyalist (102) πŸ† Keyword Champion 🀝 Dynamic Duo (69) πŸ† Grand Slam πŸ‘₯ Mega-Team (23) 🌱 Topic Pioneer πŸ‘‘ Triple Crown πŸ”¬ Deep Specialist (35) ⚑ Prolific Year (30) πŸ’Ž Century Club (193) πŸ“ˆ Trend Setter πŸš€ Conference Pioneer πŸ”₯ Unstoppable (13) πŸ—ƒοΈ Keyword Collector (661)

Conferences

CVPR (102) ICCV (45) ECCV (17) NIPS (13) AAAI (6) ICLR (6) ICML (3) IJCAI (1)

Papers

3D Dental Model Segmentation with Geometrical Boundary Preserving CVPR 2025 MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos ICCV 2025 SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding CVPR 2025 ConsistentCity: Semantic Flow-guided Occupancy DiT for Temporally Consistent Driving Scene Synthesis ICCV 2025 ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process ICLR 2024 FaceCom: Towards High-fidelity 3D Facial Shape Completion via Optimization and Inpainting Guidance CVPR 2024 Digital Life Project: Autonomous 3D Characters with Social Intelligence CVPR 2024 Cached Transformers: Improving Transformers with Differentiable Memory Cachde AAAI 2024 Phased Consistency Models NIPS 2024 Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft CVPR 2024 Real-Time Controllable Denoising for Image and Video CVPR 2023 InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions CVPR 2023 Siamese Image Modeling for Self-Supervised Vision Representation Learning CVPR 2023 Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information CVPR 2023 A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift CVPR 2023 A Unified Conditional Framework for Diffusion-based Image Restoration NIPS 2023 Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks CVPR 2023 ViTAS: Vision Transformer Architecture Search ECCV 2022 Pose for Everything: Towards Category-Agnostic Pose Estimation ECCV 2022 RNNPose: Recurrent 6-DoF Object Pose Refinement With Robust Correspondence Field Estimation and Pose Optimization CVPR 2022 Point2Seq: Detecting 3D Objects As Sequences CVPR 2022 GreedyNASv2: Greedier Search With a Greedy Path Filter CVPR 2022 Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer CVPR 2022 Dynamic Token Normalization improves Vision Transformers ICLR 2022 Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs NIPS 2022 IDR: Self-Supervised Image Denoising via Iterative Data Refinement CVPR 2022 Learning a Structured Latent Space for Unsupervised Point Cloud Completion CVPR 2022 Frozen CLIP Models Are Efficient Video Learners ECCV 2022 Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space ECCV 2022 Learning Degradation Representations for Image Deblurring ECCV 2022 ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search CVPR 2021 Fast Convergence of DETR With Spatially Modulated Co-Attention ICCV 2021 Learning With Privileged Tasks ICCV 2021 STAR: A Structure-Aware Lightweight Transformer for Real-Time Image Enhancement ICCV 2021 Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation ICLR 2021 Deformable DETR: Deformable Transformers for End-to-End Object Detection ICLR 2021 Rethinking Noise Synthesis and Modeling in Raw Denoising ICCV 2021 Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution ICML 2021 Weakly Supervised Contrastive Learning ICCV 2021 FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting ICCV 2021 Voxel-Based Network for Shape Completion by Leveraging Edge Generation ICCV 2021 LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-Based 3D Detector ICCV 2021 ReSSL: Relational Self-Supervised Learning with Weak Augmentation NIPS 2021 Learning Fine-Grained Segmentation of 3D Shapes Without Part Labels CVPR 2021 Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation CVPR 2021 Semantic Scene Completion via Integrating Instances and Scene In-the-Loop CVPR 2021 DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network CVPR 2021 Visually Informed Binaural Audio Generation without Binaural Audios CVPR 2021 Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation AAAI 2020 Channel Equilibrium Networks for Learning Deep Representation ICML 2020 Cascaded Refinement Network for Point Cloud Completion CVPR 2020 Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images CVPR 2020 StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching CVPR 2020 Robust Superpixel-Guided Attentional Adversarial Attack CVPR 2020 Revisiting the Sibling Head in Object Detector CVPR 2020 Density-Aware Feature Embedding for Face Clustering CVPR 2020 Search to Distill: Pearls Are Everywhere but Not the Eyes CVPR 2020 3D Human Mesh Regression With Dense Correspondence CVPR 2020 PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection CVPR 2020 Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions ECCV 2020 Adapting Object Detectors with Conditional Domain Normalization ECCV 2020 Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation ECCV 2020 PIE-NET: Parametric Inference of Point Cloud Edges NIPS 2020 KPNet: Towards Minimal Face Detector AAAI 2020 AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations CVPR 2019 Feature Intertwiner for Object Detection ICLR 2019 Talking Face Generation by Adversarially Disentangled Audio-Visual Representation AAAI 2019 PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph NIPS 2019 Finding Task-Relevant Features for Few-Shot Learning by Category Traversal CVPR 2019 SSN: Learning Sparse Switchable Normalization via SparsestMax CVPR 2019 PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud CVPR 2019 GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving CVPR 2019 Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing CVPR 2019 Semantics Disentangling for Text-To-Image Generation CVPR 2019 Group-Wise Correlation Stereo Network CVPR 2019 Video Generation From Single Semantic Label Map CVPR 2019 DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images CVPR 2019 Context and Attribute Grounded Dense Captioning CVPR 2019 Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering CVPR 2019 Conditional Adversarial Generative Flow for Controllable Image Synthesis CVPR 2019 Shape2Motion: Joint Analysis of Motion Parts and Attributes From 3D Shapes CVPR 2019 P2SGrad: Refined Gradients for Optimizing Deep Face Models CVPR 2019 Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis NIPS 2019 Gradient Harmonized Single-Stage Detector AAAI 2019 Unsupervised Cross-Spectral Stereo Matching by Learning to Synthesize AAAI 2019 Vision-Infused Deep Audio Inpainting ICCV 2019 Interpolated Convolutional Networks for 3D Point Cloud Understanding ICCV 2019 Differentiable Kernel Evolution ICCV 2019 Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once ICCV 2019 Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks ICCV 2019 Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM ICCV 2019 Deep Self-Learning From Noisy Labels ICCV 2019 CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval ICCV 2019 Multi-Modality Latent Interaction Network for Visual Question Answering ICCV 2019 Visual Question Generation as Dual Task of Visual Question Answering CVPR 2018 Eliminating Background-Bias for Robust Person Re-Identification CVPR 2018 FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification NIPS 2018 Neural Network Encapsulation ECCV 2018 Transductive Centroid Projection for Semi-supervised Large-scale Recognition ECCV 2018 Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data ECCV 2018 Question-Guided Hybrid Convolution for Visual Question Answering ECCV 2018 Learning Monocular Depth by Distilling Cross-domain Stereo Networks ECCV 2018 Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition ECCV 2018 Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation ECCV 2018 Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association ECCV 2018 Person Re-identification with Deep Similarity-Guided Graph Neural Network ECCV 2018 Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification CVPR 2018 PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing CVPR 2018 FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis CVPR 2018 Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding CVPR 2018 Exploring Disentangled Feature Representation Beyond Face Identification CVPR 2018 Deep Group-Shuffling Random Walk for Person Re-Identification CVPR 2018 3D Human Pose Estimation in the Wild by Adversarial Learning CVPR 2018 Decoupling the Layers in Residual Networks ICLR 2018 Group Consistent Similarity Learning via Deep CRF for Person Re-Identification CVPR 2018 Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration CVPR 2018 Context Encoding for Semantic Segmentation CVPR 2018 End-to-End Deep Kronecker-Product Matching for Person Re-Identification CVPR 2018 ViP-CNN: Visual Phrase Guided Convolutional Neural Network CVPR 2017 Learning Object Interactions and Descriptions for Semantic Image Segmentation CVPR 2017 Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification CVPR 2017 Learning Cross-Modal Deep Representations for Robust Pedestrian Detection CVPR 2017 Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation CVPR 2017 Joint Detection and Identification Feature Learning for Person Search CVPR 2017 Residual Attention Network for Image Classification CVPR 2017 Pyramid Scene Parsing Network CVPR 2017 Person Search With Natural Language Description CVPR 2017 Multi-Context Attention for Human Pose Estimation CVPR 2017 Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion CVPR 2017 Object Detection in Videos With Tubelet Proposal Networks CVPR 2017 Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction NIPS 2017 HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis ICCV 2017 Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-Identification ICCV 2017 Recurrent Scale Approximation for Object Detection in CNN ICCV 2017 Scene Graph Generation From Objects, Phrases and Region Captions ICCV 2017 Learning Feature Pyramids for Human Pose Estimation ICCV 2017 Identity-Aware Textual-Visual Matching With Latent Co-Attention ICCV 2017 Learning Deep Neural Networks for Vehicle Re-ID With Visual-Spatio-Temporal Path Proposals ICCV 2017 Chained Cascade Network for Object Detection ICCV 2017 Deep Dual Learning for Semantic Image Segmentation ICCV 2017 Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism ICCV 2017 StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks ICCV 2017 STCT: Sequentially Training Convolutional Networks for Visual Tracking CVPR 2016 CRF-CNN: Modeling Structured Information in Human Pose Estimation NIPS 2016 Multi-Bias Non-linear Activation in Deep Neural Networks ICML 2016 Object Detection From Video Tubelets With Convolutional Neural Networks CVPR 2016 Factors in Finetuning Deep Model for Object Detection With Long-Tail Distribution CVPR 2016 End-To-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation CVPR 2016 Structured Feature Learning for Pose Estimation CVPR 2016 Sparsifying Neural Network Connections for Face Recognition CVPR 2016 Slicing Convolutional Neural Network for Crowd Video Understanding CVPR 2016 DeepFashion: Powering Robust Clothes Recognition and Retrieval With Rich Annotations CVPR 2016 Learning Deep Feature Representations With Domain Guided Dropout for Person Re-Identification CVPR 2016 Understanding Pedestrian Behaviors From Stationary Crowd Groups CVPR 2015 Pedestrian Detection Aided by Deep Learning Semantic Tasks CVPR 2015 Saliency Detection by Multi-Context Deep Learning CVPR 2015 Cross-Scene Crowd Counting via Deep Convolutional Neural Networks CVPR 2015 Video Matting via Sparse and Low-Rank Representation ICCV 2015 Learning Deep Representation With Large-Scale Attributes ICCV 2015 Deep Learning Strong Parts for Pedestrian Detection ICCV 2015 Visual Tracking With Fully Convolutional Networks ICCV 2015 Pedestrian Travel Time Estimation in Crowded Scenes ICCV 2015 Deeply Learned Face Representations Are Sparse, Selective, and Robust CVPR 2015 Deeply Learned Attributes for Crowded Scene Understanding CVPR 2015 Multi-Task Recurrent Neural Network for Immediacy Prediction ICCV 2015 Deep Learning Face Attributes in the Wild ICCV 2015 Learning From Massive Noisy Labeled Data for Image Classification CVPR 2015 DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection CVPR 2015 Scene-Independent Group Profiling in Crowd CVPR 2014 L0 Regularized Stationary Time Estimation for Crowd Group Analysis CVPR 2014 Deep Learning Face Representation by Joint Identification-Verification NIPS 2014 Multi-source Deep Learning for Human Pose Estimation CVPR 2014 Deep Learning Face Representation from Predicting 10,000 Classes CVPR 2014 Switchable Deep Network for Pedestrian Detection CVPR 2014 Multi-View Perceptron: a Deep Model for Learning Face Identity and View Representations NIPS 2014 DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification CVPR 2014 Learning Mid-level Filters for Person Re-identification CVPR 2014 Dimensionality Reduction with Generalized Linear Models IJCAI 2013 Measuring Crowd Collectiveness CVPR 2013 Modeling Mutual Visibility Relationship in Pedestrian Detection CVPR 2013 Single-Pedestrian Detection Aided by Multi-pedestrian Detection CVPR 2013 Unsupervised Salience Learning for Person Re-identification CVPR 2013 Locally Aligned Feature Transforms across Views CVPR 2013 Deep Convolutional Network Cascade for Facial Point Detection CVPR 2013 Hybrid Deep Learning for Face Verification ICCV 2013 Multi-stage Contextual Deep Learning for Pedestrian Detection ICCV 2013 Pedestrian Parsing via Deep Decompositional Network ICCV 2013 Deep Learning Identity-Preserving Face Space ICCV 2013 A Deep Sum-Product Architecture for Robust Facial Attributes Analysis ICCV 2013 Person Re-identification by Salience Matching ICCV 2013 Joint Deep Learning for Pedestrian Detection ICCV 2013 Visual Semantic Complex Network for Web Images ICCV 2013 Spatial Latent Dirichlet Allocation NIPS 2007