Zhesong Yu
3 papers · 2019–2024 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🏃
Academic Marathon
(5)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
INTERSPEECH (2)
IJCAI (1)
Top co-authors
Keywords
representation learning
(1)
zero-shot learning
(1)
attention mechanism
(1)
multimodal learning
(1)
instruction tuning
(1)
audio-visual fusion
(1)
convolutional neural network
(1)
cross-modal alignment
(1)
cross-modal fusion
(1)
music information retrieval
(1)
voice activity detection
(1)
audio-language model
(1)
large language model
(1)
cover song identification
(1)
temporal pyramid pooling
(1)
multi-target pretraining
(1)
musical video
(1)
Papers
MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning
INTERSPEECH 2024
Attention-Based Cross-Modal Fusion for Audio-Visual Voice Activity Detection in Musical Video Streams
INTERSPEECH 2021
Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification
IJCAI 2019