conftrace_

George Saon

26 papers · 2016–2024 · 2 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🌍 Conference Polyglot (2) 🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (34) 🌍 Conference Polyglot (2) 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (25) 🤝 Dynamic Duo (15) 🔬 Deep Specialist (16) 🧬 Topic Evolution 🏆 Keyword Champion (3) 📈 Trend Setter 🗃️ Keyword Collector (94) ⚡ Prolific Year (5) 💎 Century Club (26) 🚀 Conference Pioneer 🔥 Unstoppable (6)

Conferences

INTERSPEECH (25) EMNLP (1)

Top co-authors

Brian Kingsbury (15) Xiaodong Cui (9) Zoltán Tüske (9) Gakuto Kurata (8) Kartik Audhkhasi (6) Samuel Thomas (5) Michael Picheny (5) Bhuvana Ramabhadran (4) Masayuki Suzuki (4) David Haws (3)

Keywords

speech recognition (13) language model (8) word error rate (8) automatic speech recognition (7) long short-term memory (6) recurrent neural network transducer (6) acoustic model (5) recurrent neural network (4) data augmentation (3) end-to-end speech recognition (3) convolutional neural network (3) acoustic modeling (3) rnn transducer (3) multi-task learning (2) beam search (2) conversational speech (2) label smoothing (2) model quantization (2) domain adaptation (2) connectionist temporal classification (2)

Papers

Exploring the limits of decoder-only models trained on public speech recognition corpora INTERSPEECH 2024 Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries EMNLP 2023 Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition INTERSPEECH 2023 Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization INTERSPEECH 2022 Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing INTERSPEECH 2022 Global RNN Transducer Models For Multi-dialect Speech Recognition INTERSPEECH 2022 Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems INTERSPEECH 2022 Extending RNN-T-based speech recognition systems with emotion and language classification INTERSPEECH 2022 VQ-T: RNN Transducers using Vector-Quantized Prediction Network States INTERSPEECH 2022 Reducing Exposure Bias in Training Recurrent Neural Network Transducers INTERSPEECH 2021 Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio INTERSPEECH 2021 4-Bit Quantization of LSTM-Based Speech Recognition Models INTERSPEECH 2021 On the Limit of English Conversational Speech Recognition INTERSPEECH 2021 Integrating Dialog History into End-to-End Spoken Language Understanding Systems INTERSPEECH 2021 Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition INTERSPEECH 2020 Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard INTERSPEECH 2020 Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition INTERSPEECH 2019 Advancing Sequence-to-Sequence Based Speech Recognition INTERSPEECH 2019 Challenging the Boundaries of Speech Recognition: The MALACH Corpus INTERSPEECH 2019 A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition INTERSPEECH 2019 Embedding-Based Speaker Adaptive Training of Deep Neural Networks INTERSPEECH 2017 English Conversational Telephone Speech Recognition by Humans and Machines INTERSPEECH 2017 Empirical Exploration of Novel Architectures and Objectives for Language Models INTERSPEECH 2017 Direct Acoustics-to-Word Models for English Conversational Speech Recognition INTERSPEECH 2017 The IBM 2016 English Conversational Telephone Speech Recognition System INTERSPEECH 2016 Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings INTERSPEECH 2016