conftrace_

Yunlong Tang

6 papers · 2025–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🐝 Cross-Pollinator (15) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🌍 Conference Polyglot (3)

🌈 Renaissance Researcher (5) ⭐ Rising Star (6) ⚡ Prolific Year (6) ❓ The Questioner (2)

Conferences

AAAI (3) CVPR (2) ICCV (1)

Top co-authors

Chenliang Xu (6) Jing Bi (3) Hang Hua (3) Susan Liang (2) Chao Huang (2) Mingqian Feng (2) Zeliang Zhang (2) Junjia Guo (2) Yiting Liao (1) Daiki Shimada (1)

Keywords

multimodal large language model (3) large language model (2) chain-of-thought reasoning (1) multimodal learning (1) audio-visual learning (1) cross-modal learning (1) video understanding (1) flow matching (1) instruction tuning (1) diffusion model (1) salient object detection (1) video summarization (1) vision language model (1) attention head (1) multimodal language model (1) visual token (1) video question answering (1) attention weight (1) visual understanding (1) acoustic synthesis (1)

Papers

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning AAAI 2025 Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding AAAI 2025 CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion AAAI 2025 VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? CVPR 2025 Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach CVPR 2025 p-AVAS: Can Physics-Integrated Audio-Visual Modeling Boost Neural Acoustic Synthesis? ICCV 2025