Brian Kingsbury
36 papers · 2009–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (7) π§ Keyword Pioneer π Interdisciplinary Bridge π£ Hot Topic Early Bird π Academic Marathon (16)
πΊοΈ
Taxonomy Completionist
(50)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(26)
π€
Dynamic Duo
(15)
π§¬
Topic Evolution
π¬
Deep Specialist
(16)
π₯
Unstoppable
(7)
π
Trend Setter
π
Conference Pioneer
ποΈ
Keyword Collector
(153)
β‘
Prolific Year
(7)
π
Century Club
(36)
Conferences
INTERSPEECH (26)
NAACL (3)
CVPR (2)
ICML (2)
ICCV (1)
JMLR (1)
NIPS (1)
Top co-authors
Keywords
speech recognition
(10)
word error rate
(6)
automatic speech recognition
(5)
self-supervised learning
(5)
recurrent neural network transducer
(5)
video retrieval
(4)
language model
(4)
recurrent neural network
(4)
acoustic model
(3)
acoustic modeling
(3)
long short-term memory
(3)
data augmentation
(3)
spoken language understanding
(3)
deep neural network
(3)
zero-shot retrieval
(3)
contrastive learning
(3)
connectionist temporal classification
(2)
stochastic optimization
(2)
representation learning
(2)
label smoothing
(2)
Papers
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
CVPR 2025
M2ASR: Multilingual Multi-task Automatic Speech Recognition via Multi-objective Optimization
INTERSPEECH 2024
Exploring the limits of decoder-only models trained on public speech recognition corpora
INTERSPEECH 2024
Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition
INTERSPEECH 2023
ConvKT: Conversation-Level Knowledge Transfer for Context Aware End-to-End Spoken Language Understanding
INTERSPEECH 2023
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
INTERSPEECH 2023
VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
INTERSPEECH 2022
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
INTERSPEECH 2022
A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization
NIPS 2022
Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval
CVPR 2022
Global RNN Transducer Models For Multi-dialect Speech Recognition
INTERSPEECH 2022
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
INTERSPEECH 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
INTERSPEECH 2022
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio
INTERSPEECH 2021
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
ICCV 2021
Integrating Dialog History into End-to-End Spoken Language Understanding Systems
INTERSPEECH 2021
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
INTERSPEECH 2021
Reducing Exposure Bias in Training Recurrent Neural Network Transducers
INTERSPEECH 2021
On the Limit of English Conversational Speech Recognition
INTERSPEECH 2021
4-Bit Quantization of LSTM-Based Speech Recognition Models
INTERSPEECH 2021
Cascaded Multilingual Audio-Visual Learning from Videos
INTERSPEECH 2021
Representation Based Meta-Learning for Few-Shot Spoken Intent Recognition
INTERSPEECH 2020
End-to-End Spoken Language Understanding Without Full Transcripts
INTERSPEECH 2020
Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard
INTERSPEECH 2020
Transliteration Based Data Augmentation for Training Multilingual ASR Acoustic Models in Low Resource Settings
INTERSPEECH 2020
Kernel Approximation Methods for Speech Recognition
JMLR 2019
A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition
INTERSPEECH 2019
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition
INTERSPEECH 2019
Challenging the Boundaries of Speech Recognition: The MALACH Corpus
INTERSPEECH 2019
Estimating Information Flow in Deep Neural Networks
ICML 2019
Beyond Backprop: Online Alternating Minimization with Auxiliary Variables
ICML 2019
Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling
INTERSPEECH 2016
Multilingual Data Selection for Low Resource Speech Recognition
INTERSPEECH 2016
Deep Neural Network Language Models
NAACL 2012
Tied-Mixture Language Modeling in Continuous Space
NAACL 2009
Fast decoding for open vocabulary spoken term detection
NAACL 2009