Enxin Song
4 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (3) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
ICLR (2)
CVPR (1)
ICCV (1)
Top co-authors
Keywords
video understanding
(2)
large language model
(2)
multimodal large language model
(1)
memory mechanism
(1)
linear rnn
(1)
video-language model
(1)
model efficiency
(1)
video foundation model
(1)
long video
(1)
long video processing
(1)
visual token merge
(1)
video captioning
(1)
sparse memory
(1)
multimodal learning
(1)
Papers
Bringing RNNs Back to Efficient Open-Ended Video Understanding
ICCV 2025
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
ICLR 2025
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
ICLR 2025
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
CVPR 2024