Changli Tang
7 papers · 2023–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (4) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (10) π Cross-Pollinator (12)
π
Renaissance Researcher
(6)
β
The Questioner
Conferences
ICML (3)
INTERSPEECH (2)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
benchmark evaluation
(1)
multi-task learning
(1)
k-means clustering
(1)
sound source localization
(1)
visual question answering
(1)
self-supervised learning
(1)
audio-visual learning
(1)
video understanding
(1)
multimodal large language model
(1)
speech representation
(1)
visual language model
(1)
far-field speech recognition
(1)
multi-channel audio
(1)
spatial audio
(1)
speech extraction
(1)
audio-visual interaction
(1)
teacher network
(1)
large language model
(1)
speech pre-training
(1)
video comprehension
(1)
Papers
Improving LLM Video Understanding with 16 Frames Per Second
ICML 2025
Audio-centric Video Understanding Benchmark without Text Shortcut
EMNLP 2025
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
ICML 2025
Can Large Language Models Understand Spatial Audio?
INTERSPEECH 2024
SALMONN: Towards Generic Hearing Abilities for Large Language Models
ICLR 2024
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
ICML 2024
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
INTERSPEECH 2023