Shiwei Wu
4 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (3) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
CVPR (2)
ACL (1)
NIPS (1)
Top co-authors
Keywords
multimodal learning
(2)
video understanding
(2)
vision-language model
(2)
text summarization
(1)
efficient computing
(1)
real-time processing
(1)
multimodal model
(1)
video-language model
(1)
graphical user interface
(1)
entity recognition
(1)
multimodal summarization
(1)
cross-modal correlation
(1)
vision token
(1)
streaming video
(1)
visual agent
(1)
large language model
(1)
image selection
(1)
mixture of depth
(1)
entity-guided learning
(1)
visual text integration
(1)
Papers
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
CVPR 2025
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
NIPS 2024
Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization
ACL 2024
VideoLLM-online: Online Video Large Language Model for Streaming Video
CVPR 2024