Zhong Meng
29 papers · 2016–2024 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (2)
π
Cross-Pollinator
(12)
πΊοΈ
Taxonomy Completionist
(16)
π£
Hot Topic Early Bird
π
Conference Loyalist
(27)
π€
Dynamic Duo
(13)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π¬
Deep Specialist
(13)
ποΈ
Keyword Collector
(56)
π₯
Unstoppable
(9)
π
Conference Pioneer
β‘
Prolific Year
(6)
π
Century Club
(29)
Conferences
INTERSPEECH (27)
NAACL (2)
Top co-authors
Keywords
automatic speech recognition
(13)
speech recognition
(7)
end-to-end speech recognition
(5)
word error rate
(4)
transformer transducer
(4)
recurrent neural network transducer
(4)
speaker identification
(3)
adversarial learning
(3)
deep neural network
(3)
speaker counting
(3)
connectionist temporal classification
(3)
end-to-end model
(3)
keyword spotting
(2)
speech enhancement
(2)
domain adaptation
(2)
language model
(2)
beam search
(2)
acoustic modeling
(2)
feature mapping
(2)
long short-term memory
(2)
Papers
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR
NAACL 2024
Text Injection for Neural Contextual Biasing
INTERSPEECH 2024
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
INTERSPEECH 2024
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping
INTERSPEECH 2024
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
INTERSPEECH 2024
Massive End-to-end Speech Recognition Models with Time Reduction
NAACL 2024
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
INTERSPEECH 2023
Improving Joint Speech-Text Representations Without Alignment
INTERSPEECH 2023
Streaming Multi-Talker ASR with Token-Level Serialized Output Training
INTERSPEECH 2022
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
INTERSPEECH 2022
Separating Long-Form Speech with Group-wise Permutation Invariant Training
INTERSPEECH 2022
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings
INTERSPEECH 2022
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
INTERSPEECH 2021
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS
INTERSPEECH 2021
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
INTERSPEECH 2021
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
INTERSPEECH 2021
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions
INTERSPEECH 2021
End-to-End Speaker-Attributed ASR with Transformer
INTERSPEECH 2021
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers
INTERSPEECH 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
INTERSPEECH 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
INTERSPEECH 2020
Acoustic-to-Phrase Models for Speech Recognition
INTERSPEECH 2019
Speaker Adaptation for Attention-Based End-to-End Speech Recognition
INTERSPEECH 2019
Cycle-Consistent Speech Enhancement
INTERSPEECH 2018
Adversarial Feature-Mapping for Speech Enhancement
INTERSPEECH 2018
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting
INTERSPEECH 2017
Minimum Semantic Error Cost Training of Deep Long Short-Term Memory Networks for Topic Spotting on Conversational Speech
INTERSPEECH 2017
Statistical Modeling of Speakerβs Voice with Temporal Co-Location for Active Voice Authentication
INTERSPEECH 2016
Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting
INTERSPEECH 2016