Zijia Zhao
7 papers · 2023–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (15) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
ACL (2)
CVPR (2)
ICLR (2)
NIPS (1)
Top co-authors
Keywords
visual question answering
(1)
motion estimation
(1)
video captioning
(1)
question answering
(1)
multimodal learning
(1)
egocentric vision
(1)
multi-modal learning
(1)
video understanding
(1)
audio-text retrieval
(1)
foundation model
(1)
vision-language model
(1)
multimodal large language model
(1)
video language model
(1)
multi-hop reasoning
(1)
video-text retrieval
(1)
motion vector
(1)
token reduction
(1)
large vision language model
(1)
frame sampling
(1)
motion understanding
(1)
Papers
M3-VQA: A Benchmark for Multimodal, Multi-Entity, Multi-Hop Visual Question Answering
ACL 2026
Efficient Motion-Aware Video MLLM
CVPR 2025
Exploring the Design Space of Visual Context Representation in Video MLLMs
ICLR 2025
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
ICLR 2025
Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions
ACL 2024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
CVPR 2024
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
NIPS 2023