Xinggang Wang
81 papers · 2011–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Academic Marathon (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (9) π Cross-Pollinator (10)
π
Cross-Pollinator
(10)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(88)
π
Conference Loyalist
(29)
π§¬
Topic Evolution
π
Grand Slam
π€
Dynamic Duo
(50)
π
Keyword Champion
π¬
Deep Specialist
(20)
π
Conference Pioneer
ποΈ
Keyword Collector
(289)
π
Century Club
(75)
π₯
Unstoppable
(11)
π
Trend Setter
β‘
Prolific Year
(13)
Conferences
CVPR (29)
ECCV (13)
ICCV (13)
AAAI (11)
ICLR (7)
NIPS (4)
ICML (2)
ACML (1)
CORL (1)
Top co-authors
Research topics
Keywords
object detection
(12)
vision transformer
(9)
instance segmentation
(7)
image classification
(6)
semantic segmentation
(5)
vision-language model
(4)
knowledge distillation
(4)
diffusion model
(4)
convolutional neural network
(4)
generative model
(4)
model compression
(4)
query-based detection
(4)
transformer architecture
(4)
attention mechanism
(4)
video instance segmentation
(4)
weakly supervised learning
(3)
transfer learning
(3)
image generation
(3)
point cloud
(3)
deep neural network
(3)
Papers
LENS: Learning to Segment Anything with Unified Reinforced Reasoning
AAAI 2026
Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
AAAI 2026
MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning
AAAI 2026
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation
AAAI 2026
Gait Recognition via Collaborating Discriminative and Generative Diffusion Models
AAAI 2026
Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids
AAAI 2026
GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
ICCV 2025
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation
CVPR 2025
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
CVPR 2025
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
CVPR 2025
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
AAAI 2025
GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images
AAAI 2025
ControlAR: Controllable Image Generation with Autoregressive Models
ICLR 2025
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
ICLR 2025
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling
ICCV 2025
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
CVPR 2025
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
CVPR 2025
"Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice"
ECCV 2024
Occupancy as Set of Points
ECCV 2024
Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition
ECCV 2024
Visual Text Generation in the Wild
ECCV 2024
Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection
ECCV 2024
Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction
ECCV 2024
Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views
ECCV 2024
YOLO-World: Real-Time Open-Vocabulary Object Detection
CVPR 2024
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
CVPR 2024
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
ICML 2024
MobileInst: Video Instance Segmentation on the Mobile
AAAI 2024
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
NIPS 2024
4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
CVPR 2024
Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
CVPR 2024
MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
ICLR 2023
Circuit as Set of Points
NIPS 2023
PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
CVPR 2023
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
CVPR 2023
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation
CVPR 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
CVPR 2023
Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt
CVPR 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
ICCV 2023
Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose Estimation
ICCV 2023
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
ICCV 2023
VAD: Vectorized Scene Representation for Efficient Autonomous Driving
ICCV 2023
Graph Contrastive Learning for Skeleton-based Action Recognition
ICLR 2023
Corrupted Image Modeling for Self-Supervised Visual Pre-Training
ICLR 2023
Robust Multi-Object Tracking by Marginal Inference
ECCV 2022
AiATrack: Attention in Attention for Transformer Visual Tracking
ECCV 2022
Sparse Instance Activation for Real-Time Instance Segmentation
CVPR 2022
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception
CVPR 2022
MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens
CVPR 2022
Bag of Instances Aggregation Boosts Self-supervised Distillation
ICLR 2022
Vision-based Uneven BEV Representation Learning with Polar Rasterization and Surface Estimation
CORL 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
CVPR 2022
Temporally Efficient Vision Transformer for Video Instance Segmentation
CVPR 2022
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
ECCV 2022
Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images
CVPR 2021
Instances As Queries
ICCV 2021
Context-Sensitive Temporal Feature Learning for Gait Recognition
ICCV 2021
Crossover Learning for Fast Online Video Instance Segmentation
ICCV 2021
Hierarchical Aggregation for 3D Instance Segmentation
ICCV 2021
You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection
NIPS 2021
Human De-Occlusion: Invisible Perception and Recovery for Humans
CVPR 2021
Diversity Transfer Network for Few-Shot Learning
AAAI 2020
Fast Neural Network Adaptation via Parameter Remapping and Architecture Search
ICLR 2020
Boundary-preserving Mask R-CNN
ECCV 2020
Densely Connected Search Space for More Flexible Neural Architecture Search
CVPR 2020
Direct Object Recognition Without Line-Of-Sight Using Optical Coherence
CVPR 2019
Detect or Track: Towards Cost-Effective Video Object Detection/Tracking
AAAI 2019
Mask Scoring R-CNN
CVPR 2019
RENAS: Reinforced Evolutionary Neural Architecture Search
CVPR 2019
CCNet: Criss-Cross Attention for Semantic Segmentation
ICCV 2019
Deep Multi-instance Learning with Dynamic Pooling
ACML 2018
Weakly Supervised Region Proposal Network and Object Detection
ECCV 2018
Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing
CVPR 2018
Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-identification
ECCV 2018
Object-Level Proposals
ICCV 2017
Multiple Instance Detection Network With Online Instance Classifier Refinement
CVPR 2017
Robust Scene Text Recognition With Automatic Rectification
CVPR 2016
DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection
CVPR 2015
Relaxed Multiple-Instance SVM With Application to Object Discovery
ICCV 2015
Max-Margin Multiple-Instance Dictionary Learning
ICML 2013
Maximal Cliques that Satisfy Hard Constraints with Application to Deformable Object Model Learning
NIPS 2011