Junbin Xiao

18 papers · 2020–2025 · 7 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🏃 Academic Marathon (5) 🌍 Conference Polyglot (7) 🗺️ Taxonomy Completionist (31)

🏃 Academic Marathon (5) 🗺️ Taxonomy Completionist (31) 🌈 Renaissance Researcher (7) 🔬 Deep Specialist (10) 🤝 Dynamic Duo (13) 🏆 Keyword Champion (3) ❓ The Questioner 💎 Century Club (18) 🗃️ Keyword Collector (75) ⚡ Prolific Year (6) 🔥 Unstoppable (6)

Conferences

CVPR (7) ICCV (4) AAAI (2) ECCV (2) ACL (1) EMNLP (1) MICCAI (1)

Top co-authors

Tat-Seng Chua (13) Yicong Li (10) Angela Yao (8) Xiang Wang (4) Wei Ji (4) Jianru Xue (2) Chen Lv (2) Lei-lei Li (2) Hongkai Yu (2) Jianwu Fang (2)

Keywords

multimodal learning (7) video question answering (7) video understanding (6) temporal grounding (3) egocentric vision (3) temporal reasoning (2) visual question answering (2) video diffusion (2) cross-modal interaction (2) visual grounding (2) causal inference (2) vision-language model (2) affordance segmentation (2) causal reasoning (2) social media analysis (1) self-supervised learning (1) video classification (1) domain generalization (1) contrastive learning (1) video synthesis (1)

Papers

Unleashing the Power of LLMs for Medical Video Answer Localization MICCAI 2025 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering CVPR 2025 On the Consistency of Video Large Language Models in Temporal Comprehension CVPR 2025 Visual Intention Grounding for Egocentric Assistants ICCV 2025 Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis ICCV 2025 Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories ICCV 2025 LASO: Language-guided Affordance Segmentation on 3D Object CVPR 2024 Abductive Ego-View Accident Video Understanding for Safe Driving Perception CVPR 2024 Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives ACL 2024 Can I Trust Your Answer? Visually Grounded Video Question Answering CVPR 2024 FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms AAAI 2023 Discovering Spatio-Temporal Rationales for Video Question Answering ICCV 2023 Video Question Answering: Datasets, Algorithms and Challenges EMNLP 2022 Invariant Grounding for Video Question Answering CVPR 2022 Video as Conditional Graph Hierarchy for Multi-Granular Question Answering AAAI 2022 Video Graph Transformer for Video Question Answering ECCV 2022 NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions CVPR 2021 Visual Relation Grounding in Videos ECCV 2020