Shangzhe Di
5 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(4)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(12)
π
Cross-Pollinator
(15)
Conferences
CVPR (2)
AAAI (1)
ICCV (1)
ICLR (1)
Top co-authors
Keywords
video question answering
(3)
video understanding
(3)
instruction tuning
(2)
question answering
(2)
egocentric video
(2)
vision-language model
(1)
multimodal large language model
(1)
temporal attention
(1)
multimodal reasoning
(1)
multi-hop reasoning
(1)
video representation
(1)
multitask training
(1)
action detection
(1)
online action detection
(1)
streaming video
(1)
large language model
(1)
spatial-temporal dynamics
(1)
agent-based system
(1)
vision transformer
(1)
query grounding
(1)
Papers
Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
AAAI 2025
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation
CVPR 2025
Learning Streaming Video Representation via Multitask Training
ICCV 2025
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
ICLR 2025
Grounded Question-Answering in Long Egocentric Videos
CVPR 2024