Yunlong Tang
6 papers · 2025–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Cross-Pollinator (15) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π Conference Polyglot (3)
π
Renaissance Researcher
(5)
β
Rising Star
(6)
β‘
Prolific Year
(6)
β
The Questioner
(2)
Conferences
AAAI (3)
CVPR (2)
ICCV (1)
Top co-authors
Keywords
multimodal large language model
(3)
large language model
(2)
chain-of-thought reasoning
(1)
multimodal learning
(1)
audio-visual learning
(1)
cross-modal learning
(1)
video understanding
(1)
flow matching
(1)
instruction tuning
(1)
diffusion model
(1)
salient object detection
(1)
video summarization
(1)
vision language model
(1)
attention head
(1)
multimodal language model
(1)
visual token
(1)
video question answering
(1)
attention weight
(1)
visual understanding
(1)
acoustic synthesis
(1)
Papers
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
AAAI 2025
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
AAAI 2025
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion
AAAI 2025
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
CVPR 2025
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
CVPR 2025
p-AVAS: Can Physics-Integrated Audio-Visual Modeling Boost Neural Acoustic Synthesis?
ICCV 2025