Zhihao Du

12 papers · 2019–2025 · 4 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🌍 Conference Polyglot (4)

🌍 Conference Polyglot (4) 🏃 Academic Marathon (6) 🐝 Cross-Pollinator (13) 🤝 Dynamic Duo (10) 🧬 Topic Evolution 📈 Trend Setter 💎 Century Club (12) 🗃️ Keyword Collector (65)

Conferences

INTERSPEECH (8) EMNLP (2) AAAI (1) ACL (1)

Top co-authors

Shiliang Zhang (10) Jiqing Han (5) Qian Chen (4) Siqi Zheng (3) Fan Yu (3) Jiaming Wang (2) Yue Gu (2) Zhifu Gao (2) Guanrou Yang (1) Yuxiao Lin (1)

Keywords

automatic speech recognition (4) end-to-end model (3) character error rate (2) voice activity detection (2) neural network (2) speaker diarization (2) speaker adaptation (2) large language model (2) speech recognition (1) modality alignment (1) in-context learning (1) speaker embedding (1) speaker verification (1) mandarin speech (1) adversarial learning (1) speech enhancement (1) language model (1) self-supervised learning (1) denoising autoencoder (1) noise robustness (1)

Papers

UniSpeaker: A Unified Approach for Multimodality-driven Speaker Generation EMNLP 2025 OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation ACL 2025 Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration AAAI 2025 Personality-memory Gated Adaptation: An Efficient Speaker Adaptation for Personalized End-to-end Automatic Speech Recognition INTERSPEECH 2024 FunASR: A Fundamental End-to-End Speech Recognition Toolkit INTERSPEECH 2023 CASA-ASR: Context-Aware Speaker-Attributed ASR INTERSPEECH 2023 Personality-aware Training based Speaker Adaptation for End-to-end Speech Recognition INTERSPEECH 2023 Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis EMNLP 2022 A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings INTERSPEECH 2022 Self-Supervised Adversarial Multi-Task Learning for Vocoder-Based Monaural Speech Enhancement INTERSPEECH 2020 Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition INTERSPEECH 2020 Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events INTERSPEECH 2019