Shaofei Huang

12 papers · 2020–2025 · 6 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🏃 Academic Marathon (5) 🗺️ Taxonomy Completionist (31)

🗺️ Taxonomy Completionist (31) 🧭 Keyword Pioneer 🤝 Dynamic Duo (11) 🌱 Topic Pioneer 💎 Century Club (12) 🗃️ Keyword Collector (64) 📈 Trend Setter 🔥 Unstoppable (6) 🚀 Conference Pioneer

Conferences

CVPR (7) AAAI (1) ECCV (1) ICCV (1) IJCAI (1) MICCAI (1)

Top co-authors

Si Liu (11) Tianrui Hui (8) Jizhong Han (7) Guanbin Li (5) Hongyu Li (3) Jiao Dai (3) Rui Ling (2) Luoqi Liu (2) Fei Wang (2) Xiaoming Wei (2)

Keywords

semantic segmentation (3) multimodal learning (3) bird's eye view (2) video understanding (2) object detection (1) self-supervised learning (1) feature extraction (1) lane detection (1) sound source localization (1) autonomous driving (1) audio-visual learning (1) depth estimation (1) natural language understanding (1) instance segmentation (1) gaussian splatting (1) knowledge distillation (1) video segmentation (1) sound localization (1) natural language (1) image generation (1)

Papers

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation AAAI 2025 Revisiting Audio-Visual Segmentation with Vision-Centric Transformer CVPR 2025 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding CVPR 2025 Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization ICCV 2025 Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training CVPR 2024 Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation MICCAI 2024 Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection CVPR 2023 Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation IJCAI 2023 A Keypoint-Based Global Association Network for Lane Detection CVPR 2022 Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation CVPR 2021 Referring Image Segmentation via Cross-Modal Progressive Comprehension CVPR 2020 Linguistic Structure Guided Context Modeling for Referring Image Segmentation ECCV 2020