Zhihang Liu
4 papers · 2024–2026 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌈
Renaissance Researcher
(5)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
AAAI (3)
CVPR (1)
Top co-authors
Keywords
multimodal large language model
(2)
large language model
(2)
visual token
(2)
video understanding
(2)
instruction tuning
(1)
spatio-temporal localization
(1)
cross-modal alignment
(1)
semantic modeling
(1)
token compression
(1)
video grounding
(1)
video moment retrieval
(1)
visual document
(1)
video token
(1)
video token compression
(1)
instruction injection
(1)
conditional pre-training
(1)
region level
(1)
retrieval augmented generation
(1)
query-guided learning
(1)
question answering
(1)
Papers
RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding
AAAI 2026
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability
AAAI 2026
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models
CVPR 2025
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval
AAAI 2024