Wangyou Zhang
13 papers · 2019–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (11)
🏃
Academic Marathon
(6)
🐝
Cross-Pollinator
(11)
🤝
Dynamic Duo
(10)
🧬
Topic Evolution
🗃️
Keyword Collector
(63)
💎
Century Club
(13)
📈
Trend Setter
Conferences
INTERSPEECH (11)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
speech enhancement
(5)
speaker recognition
(4)
self-supervised learning
(3)
speech separation
(3)
speech recognition
(2)
end-to-end speech recognition
(2)
permutation invariant training
(2)
weakly supervised learning
(1)
automatic speech recognition
(1)
domain generalization
(1)
model architecture
(1)
spoken language understanding
(1)
deep learning
(1)
continuous speech
(1)
curriculum learning
(1)
signal-to-noise ratio
(1)
speech synthesis
(1)
speaker verification
(1)
speaker embedding
(1)
knowledge distillation
(1)
Papers
VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music
NAACL 2025
Towards Robust Speech Representation Learning for Thousands of Languages
EMNLP 2024
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement
INTERSPEECH 2024
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
INTERSPEECH 2024
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
INTERSPEECH 2024
Overlap Aware Continuous Speech Separation without Permutation Invariant Training
INTERSPEECH 2023
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition
INTERSPEECH 2023
Separating Long-Form Speech with Group-wise Permutation Invariant Training
INTERSPEECH 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
INTERSPEECH 2022
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
INTERSPEECH 2020
Learning Contextual Language Embeddings for Monaural Multi-Talker Speech Recognition
INTERSPEECH 2020
Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System
INTERSPEECH 2019
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking
INTERSPEECH 2019