conftrace_

Wangyou Zhang

13 papers · 2019–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+7 more ↓

🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (11)

🏃 Academic Marathon (6) 🐝 Cross-Pollinator (11) 🤝 Dynamic Duo (10) 🧬 Topic Evolution 🗃️ Keyword Collector (63) 💎 Century Club (13) 📈 Trend Setter

Conferences

INTERSPEECH (11) EMNLP (1) NAACL (1)

Top co-authors

Yanmin Qian (10) Shinji Watanabe (7) Chenda Li (4) Xuankai Chang (4) Jiatong Shi (3) Jee-weon Jung (2) Kohei Saijo (2) Zhaoheng Ni (2) Samuele Cornell (2) Robin Scheibler (2)

Keywords

speech enhancement (5) speaker recognition (4) self-supervised learning (3) speech separation (3) speech recognition (2) end-to-end speech recognition (2) permutation invariant training (2) weakly supervised learning (1) automatic speech recognition (1) domain generalization (1) model architecture (1) spoken language understanding (1) deep learning (1) continuous speech (1) curriculum learning (1) signal-to-noise ratio (1) speech synthesis (1) speaker verification (1) speaker embedding (1) knowledge distillation (1)

Papers

VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music NAACL 2025 Towards Robust Speech Representation Learning for Thousands of Languages EMNLP 2024 Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement INTERSPEECH 2024 ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models INTERSPEECH 2024 URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement INTERSPEECH 2024 Overlap Aware Continuous Speech Separation without Permutation Invariant Training INTERSPEECH 2023 Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition INTERSPEECH 2023 Separating Long-Form Speech with Group-wise Permutation Invariant Training INTERSPEECH 2022 ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding INTERSPEECH 2022 End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming INTERSPEECH 2020 Learning Contextual Language Embeddings for Monaural Multi-Talker Speech Recognition INTERSPEECH 2020 Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System INTERSPEECH 2019 Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking INTERSPEECH 2019