Zhihao Du
12 papers · 2019–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Conference Polyglot (4)
π
Conference Polyglot
(4)
π
Academic Marathon
(6)
π
Cross-Pollinator
(13)
π€
Dynamic Duo
(10)
π§¬
Topic Evolution
π
Trend Setter
π
Century Club
(12)
ποΈ
Keyword Collector
(65)
Conferences
INTERSPEECH (8)
EMNLP (2)
AAAI (1)
ACL (1)
Top co-authors
Keywords
automatic speech recognition
(4)
end-to-end model
(3)
character error rate
(2)
voice activity detection
(2)
neural network
(2)
speaker diarization
(2)
speaker adaptation
(2)
large language model
(2)
speech recognition
(1)
modality alignment
(1)
in-context learning
(1)
speaker embedding
(1)
speaker verification
(1)
mandarin speech
(1)
adversarial learning
(1)
speech enhancement
(1)
language model
(1)
self-supervised learning
(1)
denoising autoencoder
(1)
noise robustness
(1)
Papers
UniSpeaker: A Unified Approach for Multimodality-driven Speaker Generation
EMNLP 2025
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
ACL 2025
Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration
AAAI 2025
Personality-memory Gated Adaptation: An Efficient Speaker Adaptation for Personalized End-to-end Automatic Speech Recognition
INTERSPEECH 2024
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
INTERSPEECH 2023
CASA-ASR: Context-Aware Speaker-Attributed ASR
INTERSPEECH 2023
Personality-aware Training based Speaker Adaptation for End-to-end Speech Recognition
INTERSPEECH 2023
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
EMNLP 2022
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
INTERSPEECH 2022
Self-Supervised Adversarial Multi-Task Learning for Vocoder-Based Monaural Speech Enhancement
INTERSPEECH 2020
Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition
INTERSPEECH 2020
Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events
INTERSPEECH 2019