conftrace_

Shoubin Yu

11 papers · 2023–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🐝 Cross-Pollinator (15) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (19) 🧭 Keyword Pioneer 🌍 Conference Polyglot (5)

🌈 Renaissance Researcher (5) 🤝 Dynamic Duo (11) ⚡ Prolific Year (9) 💎 Century Club (11)

Conferences

EMNLP (4) ICLR (3) CVPR (2) ICCV (1) NIPS (1)

Top co-authors

Mohit Bansal (11) Jaehong Yoon (6) Ziyang Wang (4) Gedas Bertasius (3) Md Mohaiminul Islam (2) Yicong Hong (2) Chen Chen (1) Jaemin Cho (1) Limin Wang (1) Yu Qiao (1)

Keywords

video understanding (3) large language model (3) multi-modal learning (2) multimodal reasoning (2) diffusion model (2) video editing (2) video generation (2) video question answering (2) video diffusion (2) video reasoning (2) multimodal learning (1) hierarchical representation (1) instruction following (1) language model (1) chain-of-thought reasoning (1) video segmentation (1) visual grounding (1) visual reasoning (1) visual question answering (1) medical diagnosis (1)

Papers

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel ICLR 2025 Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level CVPR 2025 VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos CVPR 2025 CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion ICLR 2025 SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation ICLR 2025 RACCooN: Versatile Instructional Video Editing with Auto-Generated Narratives EMNLP 2025 Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning EMNLP 2025 MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation EMNLP 2025 VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation ICCV 2025 A Simple LLM Framework for Long-Range Video Question-Answering EMNLP 2024 Self-Chained Image-Language Model for Video Localization and Question Answering NIPS 2023