Junbin Xiao
18 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (7) π Academic Marathon (5) π Conference Polyglot (7) πΊοΈ Taxonomy Completionist (31)
π
Academic Marathon
(5)
πΊοΈ
Taxonomy Completionist
(31)
π
Renaissance Researcher
(7)
π¬
Deep Specialist
(10)
π€
Dynamic Duo
(13)
π
Keyword Champion
(3)
β
The Questioner
π
Century Club
(18)
ποΈ
Keyword Collector
(75)
β‘
Prolific Year
(6)
π₯
Unstoppable
(6)
Conferences
CVPR (7)
ICCV (4)
AAAI (2)
ECCV (2)
ACL (1)
EMNLP (1)
MICCAI (1)
Top co-authors
Keywords
multimodal learning
(7)
video question answering
(7)
video understanding
(6)
temporal grounding
(3)
egocentric vision
(3)
temporal reasoning
(2)
visual question answering
(2)
video diffusion
(2)
cross-modal interaction
(2)
visual grounding
(2)
causal inference
(2)
vision-language model
(2)
affordance segmentation
(2)
causal reasoning
(2)
social media analysis
(1)
self-supervised learning
(1)
video classification
(1)
domain generalization
(1)
contrastive learning
(1)
video synthesis
(1)
Papers
Unleashing the Power of LLMs for Medical Video Answer Localization
MICCAI 2025
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
On the Consistency of Video Large Language Models in Temporal Comprehension
CVPR 2025
Visual Intention Grounding for Egocentric Assistants
ICCV 2025
Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis
ICCV 2025
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories
ICCV 2025
LASO: Language-guided Affordance Segmentation on 3D Object
CVPR 2024
Abductive Ego-View Accident Video Understanding for Safe Driving Perception
CVPR 2024
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
ACL 2024
Can I Trust Your Answer? Visually Grounded Video Question Answering
CVPR 2024
FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms
AAAI 2023
Discovering Spatio-Temporal Rationales for Video Question Answering
ICCV 2023
Video Question Answering: Datasets, Algorithms and Challenges
EMNLP 2022
Invariant Grounding for Video Question Answering
CVPR 2022
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering
AAAI 2022
Video Graph Transformer for Video Question Answering
ECCV 2022
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions
CVPR 2021
Visual Relation Grounding in Videos
ECCV 2020