Jianyuan Sun
4 papers · 2022–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
INTERSPEECH (4)
Top co-authors
Keywords
feature fusion
(2)
audio captioning
(2)
machine translation
(1)
multimodal learning
(1)
image captioning
(1)
cross-modal retrieval
(1)
audio-text retrieval
(1)
multi-scale feature
(1)
visual feature
(1)
transformer decoder
(1)
triplet loss
(1)
encoder-decoder model
(1)
audio representation
(1)
pyramid network
(1)
automated audio captioning
(1)
audio-visual attention
(1)
pyramid feature fusion
(1)
metric learning
(1)
cross-content attention
(1)
embedding learning
(1)
Papers
PFCA-Net: Pyramid Feature Fusion and Cross Content Attention Network for Automated Audio Captioning
INTERSPEECH 2024
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
INTERSPEECH 2023
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
INTERSPEECH 2023
On Metric Learning for Audio-Text Cross-Modal Retrieval
INTERSPEECH 2022