Xiaoming Wei

21 papers · 2021–2026 · 6 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (6) 🏃 Academic Marathon (5) 🗺️ Taxonomy Completionist (61)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (6) 🤝 Dynamic Duo (10) 🧬 Topic Evolution ⚡ Prolific Year (6) 💎 Century Club (20) 🔥 Unstoppable (5) 🗃️ Keyword Collector (110)

Conferences

CVPR (10) AAAI (5) ICLR (2) IJCAI (2) ECCV (1) ICCV (1)

Top co-authors

Xiaolin Wei (10) Junshi Huang (8) Si Liu (5) Dengsheng Chen (5) Tianrui Hui (5) Jie Hu (5) Enhua Wu (4) Jizhong Han (4) Mingyuan Fan (3) Jiao Dai (2)

Keywords

multimodal learning (3) video object segmentation (2) semantic segmentation (2) attention mechanism (2) vision transformer (2) natural language generation (1) weakly supervised learning (1) network architecture (1) visual question answering (1) video prediction (1) semi-supervised learning (1) uncertainty quantification (1) image captioning (1) image retrieval (1) metric learning (1) lane detection (1) autonomous driving (1) video understanding (1) referring expression (1) representation learning (1)

Papers

ViType: High-Fidelity Visual Text Rendering via Glyph-Aware Multimodal Diffusion AAAI 2026 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations ICCV 2025 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding CVPR 2025 Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation AAAI 2025 Denoising with a Joint-Embedding Predictive Architecture ICLR 2025 BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised Learning CVPR 2024 Real3D: The Curious Case of Neural Scene Degeneration AAAI 2024 ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting CVPR 2024 Animating General Image with Large Visual Motion Model CVPR 2024 Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond CVPR 2023 Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding IJCAI 2023 Uncertainty-Aware Image Captioning AAAI 2023 Rethinking skip connection model as a learnable Markov chain ICLR 2023 Bridging Search Region Interaction With Template for RGB-T Tracking CVPR 2023 Elastic Aggregation for Federated Optimization CVPR 2023 Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough AAAI 2022 Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation CVPR 2022 Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation ECCV 2022 Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation CVPR 2021 Structure Guided Lane Detection IJCAI 2021 Rethinking BiSeNet for Real-Time Semantic Segmentation CVPR 2021