Xiaohui Shen

67 papers · 2013–2026 · 6 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (13) 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (81)

🌍 Conference Polyglot (6) 🏃 Academic Marathon (13) 🗺️ Taxonomy Completionist (81) 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (30) 🔬 Deep Specialist (10) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🤝 Dynamic Duo (32) ⚡ Prolific Year (8) 💎 Century Club (67) 🗃️ Keyword Collector (268) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (14)

Conferences

CVPR (30) ICCV (18) ECCV (8) NIPS (6) WACV (4) ICML (1)

Top co-authors

Zhe Lin (32) Xiaodan Liang (11) Radomir Mech (10) Liang-Chieh Chen (10) Qihang Yu (10) Shuicheng Yan (9) Xin Lu (8) Jimei Yang (7) Brian Price (7) Jiashi Feng (7)

Keywords

convolutional neural network (15) semantic segmentation (9) generative model (6) human parsing (6) salient object detection (4) object detection (4) generative adversarial network (4) image editing (3) representation learning (3) image generation (3) neural network (3) face detection (3) image inpainting (3) transformer architecture (3) instance segmentation (2) visual recognition (2) transfer learning (2) visual attention (2) contrastive learning (2) face recognition (2)

Papers

LVM-Lite: Training Large Vision Models with Efficient Sequential Modeling WACV 2026 FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching ICML 2025 Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation ICCV 2025 Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens ICCV 2025 D-Attn: Decomposed Attention for Large Vision-and-Language Model ICCV 2025 Randomized Autoregressive Visual Generation ICCV 2025 COCONut: Modernizing COCO Segmentation CVPR 2024 Towards Open-Ended Visual Recognition with Large Language Models ECCV 2024 ViTamin: Designing Scalable Vision Models in the Vision-Language Era CVPR 2024 An Image is Worth 32 Tokens for Reconstruction and Generation NIPS 2024 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization NIPS 2024 MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval CVPR 2024 R2Former: Unified Retrieval and Reranking Transformer for Place Recognition CVPR 2023 Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP NIPS 2023 Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis WACV 2022 Video Salient Object Detection via Contrastive Features and Attention Modules WACV 2022 SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing CVPR 2022 A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder ICCV 2021 Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses ECCV 2020 Fashion Editing With Adversarial Parsing Learning CVPR 2020 Learning Progressive Joint Propagation for Human Motion Prediction ECCV 2020 Video Object Detection via Object-level Temporal Aggregation ECCV 2020 Best Frame Selection in a Short Video WACV 2020 Towards Multi-Pose Guided Virtual Try-On Network ICCV 2019 Free-Form Image Inpainting With Gated Convolution ICCV 2019 FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On ICCV 2019 Semantic Component Decomposition for Face Attribute Manipulation CVPR 2019 Graphonomy: Universal Human Parsing via Graph Transfer Learning CVPR 2019 Towards Interpretable Face Recognition ICCV 2019 Sequence-to-Segment Networks for Segment Detection NIPS 2018 Learning to Blend Photos ECCV 2018 Concept Mask: Large-Scale Segmentation from Semantic Concepts ECCV 2018 Compositing-aware Image Search ECCV 2018 A Modulation Module for Multi-task Learning with Applications in Image Retrieval ECCV 2018 Good View Hunting: Learning Photo Composition From Dense View Pairs CVPR 2018 Generative Image Inpainting With Contextual Attention CVPR 2018 Learning to Understand Image Blur CVPR 2018 MAttNet: Modular Attention Network for Referring Expression Comprehension CVPR 2018 Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition CVPR 2017 Predicting Scene Parsing and Motion Dynamics in the Future NIPS 2017 Personalized Image Aesthetics ICCV 2017 FoveaNet: Perspective-Aware Urban Scene Parsing ICCV 2017 Recurrent Multimodal Interaction for Referring Image Segmentation ICCV 2017 Scene Parsing With Global Context Embedding ICCV 2017 Video Scene Parsing With Predictive Feature Learning ICCV 2017 Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing CVPR 2017 Interpretable Structure-Evolving LSTM CVPR 2017 Deep Image Harmonization CVPR 2017 Event-Specific Image Importance CVPR 2016 Reversible Recursive Instance-Level Object Segmentation CVPR 2016 A Multi-Level Contextual Model For Person Recognition in Photo Albums CVPR 2016 SURGE: Surface Regularized Geometry Estimation from a Single Image NIPS 2016 Unconstrained Salient Object Detection via Proposal Subset Optimization CVPR 2016 Shortlist Selection With Residual-Aware Distance Estimator for K-Nearest Neighbor Search CVPR 2016 Automatic Content-Aware Color and Tone Stylization CVPR 2016 Semantic Object Parsing With Local-Global Long Short-Term Memory CVPR 2016 Joint Object and Part Segmentation Using Deep Learned Potentials ICCV 2015 Matching-CNN Meets KNN: Quasi-Parametric Human Parsing CVPR 2015 A Convolutional Neural Network Cascade for Face Detection CVPR 2015 Salient Object Subitizing CVPR 2015 Human Parsing With Contextualized Convolutional Neural Network ICCV 2015 Minimum Barrier Salient Object Detection at 80 FPS ICCV 2015 Towards Unified Depth and Semantic Prediction From a Single Image CVPR 2015 Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation ICCV 2015 Towards Unified Human Parsing and Pose Estimation CVPR 2014 Efficient Boosted Exemplar-based Face Detection CVPR 2014 Detecting and Aligning Faces by Image Retrieval CVPR 2013