Shoubin Yu
11 papers · 2023–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Cross-Pollinator (15) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (19) π§ Keyword Pioneer π Conference Polyglot (5)
π
Renaissance Researcher
(5)
π€
Dynamic Duo
(11)
β‘
Prolific Year
(9)
π
Century Club
(11)
Conferences
EMNLP (4)
ICLR (3)
CVPR (2)
ICCV (1)
NIPS (1)
Top co-authors
Keywords
video understanding
(3)
large language model
(3)
multi-modal learning
(2)
multimodal reasoning
(2)
diffusion model
(2)
video editing
(2)
video generation
(2)
video question answering
(2)
video diffusion
(2)
video reasoning
(2)
multimodal learning
(1)
hierarchical representation
(1)
instruction following
(1)
language model
(1)
chain-of-thought reasoning
(1)
video segmentation
(1)
visual grounding
(1)
visual reasoning
(1)
visual question answering
(1)
medical diagnosis
(1)
Papers
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
CVPR 2025
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
CVPR 2025
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
ICLR 2025
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
ICLR 2025
RACCooN: Versatile Instructional Video Editing with Auto-Generated Narratives
EMNLP 2025
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
EMNLP 2025
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
EMNLP 2025
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
ICCV 2025
A Simple LLM Framework for Long-Range Video Question-Answering
EMNLP 2024
Self-Chained Image-Language Model for Video Localization and Question Answering
NIPS 2023