Xiaoming Wei
21 papers · 2021–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (6) π Academic Marathon (5) πΊοΈ Taxonomy Completionist (61)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(6)
π€
Dynamic Duo
(10)
π§¬
Topic Evolution
β‘
Prolific Year
(6)
π
Century Club
(20)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(110)
Conferences
CVPR (10)
AAAI (5)
ICLR (2)
IJCAI (2)
ECCV (1)
ICCV (1)
Top co-authors
Keywords
multimodal learning
(3)
video object segmentation
(2)
semantic segmentation
(2)
attention mechanism
(2)
vision transformer
(2)
natural language generation
(1)
weakly supervised learning
(1)
network architecture
(1)
visual question answering
(1)
video prediction
(1)
semi-supervised learning
(1)
uncertainty quantification
(1)
image captioning
(1)
image retrieval
(1)
metric learning
(1)
lane detection
(1)
autonomous driving
(1)
video understanding
(1)
referring expression
(1)
representation learning
(1)
Papers
ViType: High-Fidelity Visual Text Rendering via Glyph-Aware Multimodal Diffusion
AAAI 2026
ARIG: Autoregressive Interactive Head Generation for Real-time Conversations
ICCV 2025
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
CVPR 2025
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
AAAI 2025
Denoising with a Joint-Embedding Predictive Architecture
ICLR 2025
BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised Learning
CVPR 2024
Real3D: The Curious Case of Neural Scene Degeneration
AAAI 2024
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
CVPR 2024
Animating General Image with Large Visual Motion Model
CVPR 2024
Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond
CVPR 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
IJCAI 2023
Uncertainty-Aware Image Captioning
AAAI 2023
Rethinking skip connection model as a learnable Markov chain
ICLR 2023
Bridging Search Region Interaction With Template for RGB-T Tracking
CVPR 2023
Elastic Aggregation for Federated Optimization
CVPR 2023
Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough
AAAI 2022
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
CVPR 2022
Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation
ECCV 2022
Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation
CVPR 2021
Structure Guided Lane Detection
IJCAI 2021
Rethinking BiSeNet for Real-Time Semantic Segmentation
CVPR 2021