George Saon
26 papers · 2016–2024 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Conference Polyglot (2) π Academic Marathon (8) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Cross-Pollinator
(12)
πΊοΈ
Taxonomy Completionist
(34)
π
Conference Polyglot
(2)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(25)
π€
Dynamic Duo
(15)
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π
Trend Setter
ποΈ
Keyword Collector
(94)
β‘
Prolific Year
(5)
π
Century Club
(26)
π
Conference Pioneer
π₯
Unstoppable
(6)
Conferences
INTERSPEECH (25)
EMNLP (1)
Top co-authors
Keywords
speech recognition
(13)
language model
(8)
word error rate
(8)
automatic speech recognition
(7)
long short-term memory
(6)
recurrent neural network transducer
(6)
acoustic model
(5)
recurrent neural network
(4)
data augmentation
(3)
end-to-end speech recognition
(3)
convolutional neural network
(3)
acoustic modeling
(3)
rnn transducer
(3)
multi-task learning
(2)
beam search
(2)
conversational speech
(2)
label smoothing
(2)
model quantization
(2)
domain adaptation
(2)
connectionist temporal classification
(2)
Papers
Exploring the limits of decoder-only models trained on public speech recognition corpora
INTERSPEECH 2024
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries
EMNLP 2023
Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition
INTERSPEECH 2023
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
INTERSPEECH 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
INTERSPEECH 2022
Global RNN Transducer Models For Multi-dialect Speech Recognition
INTERSPEECH 2022
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
INTERSPEECH 2022
Extending RNN-T-based speech recognition systems with emotion and language classification
INTERSPEECH 2022
VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
INTERSPEECH 2022
Reducing Exposure Bias in Training Recurrent Neural Network Transducers
INTERSPEECH 2021
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio
INTERSPEECH 2021
4-Bit Quantization of LSTM-Based Speech Recognition Models
INTERSPEECH 2021
On the Limit of English Conversational Speech Recognition
INTERSPEECH 2021
Integrating Dialog History into End-to-End Spoken Language Understanding Systems
INTERSPEECH 2021
Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition
INTERSPEECH 2020
Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard
INTERSPEECH 2020
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition
INTERSPEECH 2019
Advancing Sequence-to-Sequence Based Speech Recognition
INTERSPEECH 2019
Challenging the Boundaries of Speech Recognition: The MALACH Corpus
INTERSPEECH 2019
A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition
INTERSPEECH 2019
Embedding-Based Speaker Adaptive Training of Deep Neural Networks
INTERSPEECH 2017
English Conversational Telephone Speech Recognition by Humans and Machines
INTERSPEECH 2017
Empirical Exploration of Novel Architectures and Objectives for Language Models
INTERSPEECH 2017
Direct Acoustics-to-Word Models for English Conversational Speech Recognition
INTERSPEECH 2017
The IBM 2016 English Conversational Telephone Speech Recognition System
INTERSPEECH 2016
Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings
INTERSPEECH 2016