Shaofei Huang
12 papers · 2020–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (6) π Academic Marathon (5) πΊοΈ Taxonomy Completionist (31)
πΊοΈ
Taxonomy Completionist
(31)
π§
Keyword Pioneer
π€
Dynamic Duo
(11)
π±
Topic Pioneer
π
Century Club
(12)
ποΈ
Keyword Collector
(64)
π
Trend Setter
π₯
Unstoppable
(6)
π
Conference Pioneer
Conferences
CVPR (7)
AAAI (1)
ECCV (1)
ICCV (1)
IJCAI (1)
MICCAI (1)
Top co-authors
Keywords
semantic segmentation
(3)
multimodal learning
(3)
bird's eye view
(2)
video understanding
(2)
object detection
(1)
self-supervised learning
(1)
feature extraction
(1)
lane detection
(1)
sound source localization
(1)
autonomous driving
(1)
audio-visual learning
(1)
depth estimation
(1)
natural language understanding
(1)
instance segmentation
(1)
gaussian splatting
(1)
knowledge distillation
(1)
video segmentation
(1)
sound localization
(1)
natural language
(1)
image generation
(1)
Papers
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
AAAI 2025
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
CVPR 2025
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
CVPR 2025
Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
ICCV 2025
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
CVPR 2024
Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation
MICCAI 2024
Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection
CVPR 2023
Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
IJCAI 2023
A Keypoint-Based Global Association Network for Lane Detection
CVPR 2022
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
CVPR 2021
Referring Image Segmentation via Cross-Modal Progressive Comprehension
CVPR 2020
Linguistic Structure Guided Context Modeling for Referring Image Segmentation
ECCV 2020