Yudong Yang
4 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(3)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
ICML (2)
EMNLP (1)
INTERSPEECH (1)
Top co-authors
Keywords
benchmark evaluation
(1)
visual question answering
(1)
audio-visual learning
(1)
video understanding
(1)
optical flow
(1)
diffusion model
(1)
multimodal large language model
(1)
visual language model
(1)
ultrasound tongue imaging
(1)
audio-visual interaction
(1)
acoustic to articulatory inversion
(1)
video comprehension
(1)
tongue trajectory
(1)
audio-centric video understanding
(1)
Papers
Audio-centric Video Understanding Benchmark without Text Shortcut
EMNLP 2025
Improving LLM Video Understanding with 16 Frames Per Second
ICML 2025
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
ICML 2025
Optical Flow Guided Tongue Trajectory Generation for Diffusion-based Acoustic to Articulatory Inversion
INTERSPEECH 2024