Zhong Meng

29 papers · 2016–2024 · 2 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (2)

🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (16) 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (27) 🤝 Dynamic Duo (13) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🔬 Deep Specialist (13) 🗃️ Keyword Collector (56) 🔥 Unstoppable (9) 🚀 Conference Pioneer ⚡ Prolific Year (6) 💎 Century Club (29)

Conferences

INTERSPEECH (27) NAACL (2)

Top co-authors

Jinyu Li (13) Yifan Gong (10) Naoyuki Kanda (10) Yashesh Gaur (9) Takuya Yoshioka (7) Rohit Prabhavalkar (6) Xiaofei Wang (6) Yu Wu (6) Zhuo Chen (6) Weiran Wang (5)

Keywords

automatic speech recognition (13) speech recognition (7) end-to-end speech recognition (5) word error rate (4) transformer transducer (4) recurrent neural network transducer (4) speaker identification (3) adversarial learning (3) deep neural network (3) speaker counting (3) connectionist temporal classification (3) end-to-end model (3) keyword spotting (2) speech enhancement (2) domain adaptation (2) language model (2) beam search (2) acoustic modeling (2) feature mapping (2) long short-term memory (2)

Papers

Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR NAACL 2024 Text Injection for Neural Contextual Biasing INTERSPEECH 2024 Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions INTERSPEECH 2024 Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping INTERSPEECH 2024 Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm INTERSPEECH 2024 Massive End-to-end Speech Recognition Models with Time Reduction NAACL 2024 Text Injection for Capitalization and Turn-Taking Prediction in Speech Models INTERSPEECH 2023 Improving Joint Speech-Text Representations Without Alignment INTERSPEECH 2023 Streaming Multi-Talker ASR with Token-Level Serialized Output Training INTERSPEECH 2022 Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition INTERSPEECH 2022 Separating Long-Form Speech with Group-wise Permutation Invariant Training INTERSPEECH 2022 Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings INTERSPEECH 2022 On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer INTERSPEECH 2021 Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS INTERSPEECH 2021 Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition INTERSPEECH 2021 Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone INTERSPEECH 2021 Improving Multilingual Transformer Transducer Models by Reducing Language Confusions INTERSPEECH 2021 End-to-End Speaker-Attributed ASR with Transformer INTERSPEECH 2021 Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers INTERSPEECH 2020 Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability INTERSPEECH 2020 Serialized Output Training for End-to-End Overlapped Speech Recognition INTERSPEECH 2020 Acoustic-to-Phrase Models for Speech Recognition INTERSPEECH 2019 Speaker Adaptation for Attention-Based End-to-End Speech Recognition INTERSPEECH 2019 Cycle-Consistent Speech Enhancement INTERSPEECH 2018 Adversarial Feature-Mapping for Speech Enhancement INTERSPEECH 2018 Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting INTERSPEECH 2017 Minimum Semantic Error Cost Training of Deep Long Short-Term Memory Networks for Topic Spotting on Conversational Speech INTERSPEECH 2017 Statistical Modeling of Speaker’s Voice with Temporal Co-Location for Active Voice Authentication INTERSPEECH 2016 Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting INTERSPEECH 2016