Handong Li
6 papers · 2023–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
ICCV (3)
EMNLP (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
foundation model
(3)
multimodal learning
(3)
vision-language model
(2)
video understanding
(2)
large language model
(2)
audio-text retrieval
(1)
vision language model
(1)
low-rank adaptation
(1)
long video understanding
(1)
video-text retrieval
(1)
visual token compression
(1)
video-language understanding
(1)
parameter-efficient adaptation
(1)
universal representation
(1)
video-language model
(1)
token merging
(1)
spatiotemporal representation
(1)
video pretraining
(1)
causal aggregation
(1)
omni-modal intelligence
(1)
Papers
ViPE: Visual Perception in Parameter Space for Efficient Video-Language Understanding
EMNLP 2025
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
ICCV 2025
Breaking the Encoder Barrier for Seamless Video-Language Understanding
ICCV 2025
Learning Beyond Still Frames: Scaling Vision-Language Models with Video
ICCV 2025
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
ICLR 2024
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
NIPS 2023