Xiaohui Shen
67 papers · 2013–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (13) π Conference Polyglot (6) π Renaissance Researcher (8) πΊοΈ Taxonomy Completionist (81)
π
Conference Polyglot
(6)
π
Academic Marathon
(13)
πΊοΈ
Taxonomy Completionist
(81)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(30)
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π€
Dynamic Duo
(32)
β‘
Prolific Year
(8)
π
Century Club
(67)
ποΈ
Keyword Collector
(268)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(14)
Conferences
CVPR (30)
ICCV (18)
ECCV (8)
NIPS (6)
WACV (4)
ICML (1)
Top co-authors
Keywords
convolutional neural network
(15)
semantic segmentation
(9)
generative model
(6)
human parsing
(6)
salient object detection
(4)
object detection
(4)
generative adversarial network
(4)
image editing
(3)
representation learning
(3)
image generation
(3)
neural network
(3)
face detection
(3)
image inpainting
(3)
transformer architecture
(3)
instance segmentation
(2)
visual recognition
(2)
transfer learning
(2)
visual attention
(2)
contrastive learning
(2)
face recognition
(2)
Papers
LVM-Lite: Training Large Vision Models with Efficient Sequential Modeling
WACV 2026
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching
ICML 2025
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation
ICCV 2025
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
ICCV 2025
D-Attn: Decomposed Attention for Large Vision-and-Language Model
ICCV 2025
Randomized Autoregressive Visual Generation
ICCV 2025
COCONut: Modernizing COCO Segmentation
CVPR 2024
Towards Open-Ended Visual Recognition with Large Language Models
ECCV 2024
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
CVPR 2024
An Image is Worth 32 Tokens for Reconstruction and Generation
NIPS 2024
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization
NIPS 2024
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval
CVPR 2024
R2Former: Unified Retrieval and Reranking Transformer for Place Recognition
CVPR 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
NIPS 2023
Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis
WACV 2022
Video Salient Object Detection via Contrastive Features and Attention Modules
WACV 2022
SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
CVPR 2022
A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder
ICCV 2021
Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses
ECCV 2020
Fashion Editing With Adversarial Parsing Learning
CVPR 2020
Learning Progressive Joint Propagation for Human Motion Prediction
ECCV 2020
Video Object Detection via Object-level Temporal Aggregation
ECCV 2020
Best Frame Selection in a Short Video
WACV 2020
Towards Multi-Pose Guided Virtual Try-On Network
ICCV 2019
Free-Form Image Inpainting With Gated Convolution
ICCV 2019
FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On
ICCV 2019
Semantic Component Decomposition for Face Attribute Manipulation
CVPR 2019
Graphonomy: Universal Human Parsing via Graph Transfer Learning
CVPR 2019
Towards Interpretable Face Recognition
ICCV 2019
Sequence-to-Segment Networks for Segment Detection
NIPS 2018
Learning to Blend Photos
ECCV 2018
Concept Mask: Large-Scale Segmentation from Semantic Concepts
ECCV 2018
Compositing-aware Image Search
ECCV 2018
A Modulation Module for Multi-task Learning with Applications in Image Retrieval
ECCV 2018
Good View Hunting: Learning Photo Composition From Dense View Pairs
CVPR 2018
Generative Image Inpainting With Contextual Attention
CVPR 2018
Learning to Understand Image Blur
CVPR 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
CVPR 2018
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition
CVPR 2017
Predicting Scene Parsing and Motion Dynamics in the Future
NIPS 2017
Personalized Image Aesthetics
ICCV 2017
FoveaNet: Perspective-Aware Urban Scene Parsing
ICCV 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
ICCV 2017
Scene Parsing With Global Context Embedding
ICCV 2017
Video Scene Parsing With Predictive Feature Learning
ICCV 2017
Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing
CVPR 2017
Interpretable Structure-Evolving LSTM
CVPR 2017
Deep Image Harmonization
CVPR 2017
Event-Specific Image Importance
CVPR 2016
Reversible Recursive Instance-Level Object Segmentation
CVPR 2016
A Multi-Level Contextual Model For Person Recognition in Photo Albums
CVPR 2016
SURGE: Surface Regularized Geometry Estimation from a Single Image
NIPS 2016
Unconstrained Salient Object Detection via Proposal Subset Optimization
CVPR 2016
Shortlist Selection With Residual-Aware Distance Estimator for K-Nearest Neighbor Search
CVPR 2016
Automatic Content-Aware Color and Tone Stylization
CVPR 2016
Semantic Object Parsing With Local-Global Long Short-Term Memory
CVPR 2016
Joint Object and Part Segmentation Using Deep Learned Potentials
ICCV 2015
Matching-CNN Meets KNN: Quasi-Parametric Human Parsing
CVPR 2015
A Convolutional Neural Network Cascade for Face Detection
CVPR 2015
Salient Object Subitizing
CVPR 2015
Human Parsing With Contextualized Convolutional Neural Network
ICCV 2015
Minimum Barrier Salient Object Detection at 80 FPS
ICCV 2015
Towards Unified Depth and Semantic Prediction From a Single Image
CVPR 2015
Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation
ICCV 2015
Towards Unified Human Parsing and Pose Estimation
CVPR 2014
Efficient Boosted Exemplar-based Face Detection
CVPR 2014
Detecting and Aligning Faces by Image Retrieval
CVPR 2013