Zhiyong Yan
8 papers · 2021–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Cross-Pollinator (4) π§ Keyword Pioneer π£ Hot Topic Early Bird π Interdisciplinary Bridge π Renaissance Researcher (6)
πΊοΈ
Taxonomy Completionist
(23)
π₯
Mega-Team
(20)
Conferences
INTERSPEECH (8)
Top co-authors
Keywords
audio encoder
(2)
audio tagging
(2)
speech corpus
(2)
transfer learning
(2)
non-native speech
(1)
multimodal learning
(1)
self-supervised learning
(1)
automatic speech recognition
(1)
speech recognition
(1)
cross-modal retrieval
(1)
keyword spotting
(1)
speaker embedding
(1)
audio-text retrieval
(1)
semi-supervised training
(1)
convolutional neural network
(1)
model architecture
(1)
noise robustness
(1)
language model
(1)
low-rank adaptation
(1)
audio classification
(1)
Papers
Bridging Language Gaps in Audio-Text Retrieval
INTERSPEECH 2024
Scaling up masked audio encoder learning for general audio classification
INTERSPEECH 2024
Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
INTERSPEECH 2024
Streaming Audio Transformers for Online Audio Tagging
INTERSPEECH 2024
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
INTERSPEECH 2023
UniKW-AT: Unified Keyword Spotting and Audio Tagging
INTERSPEECH 2022
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10,000 Hours of Transcribed Audio
INTERSPEECH 2021
speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment
INTERSPEECH 2021