conftrace_

Brian Kingsbury

36 papers · 2009–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+13 more ↓

🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🏃 Academic Marathon (16)

🗺️ Taxonomy Completionist (50) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (26) 🤝 Dynamic Duo (15) 🧬 Topic Evolution 🔬 Deep Specialist (16) 🔥 Unstoppable (7) 📈 Trend Setter 🚀 Conference Pioneer 🗃️ Keyword Collector (153) ⚡ Prolific Year (7) 💎 Century Club (36)

Conferences

INTERSPEECH (26) NAACL (3) CVPR (2) ICML (2) ICCV (1) JMLR (1) NIPS (1)

Top co-authors

George Saon (15) Samuel Thomas (13) Xiaodong Cui (9) Zoltán Tüske (9) Kartik Audhkhasi (7) Michael Picheny (7) Hilde Kuehne (6) Andrew Rouditchenko (6) Rogerio Feris (5) James Glass (5)

Keywords

speech recognition (10) word error rate (6) automatic speech recognition (5) self-supervised learning (5) recurrent neural network transducer (5) video retrieval (4) language model (4) recurrent neural network (4) acoustic model (3) acoustic modeling (3) long short-term memory (3) data augmentation (3) spoken language understanding (3) deep neural network (3) zero-shot retrieval (3) contrastive learning (3) connectionist temporal classification (2) stochastic optimization (2) representation learning (2) label smoothing (2)

Papers

CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment CVPR 2025 M2ASR: Multilingual Multi-task Automatic Speech Recognition via Multi-objective Optimization INTERSPEECH 2024 Exploring the limits of decoder-only models trained on public speech recognition corpora INTERSPEECH 2024 Improving RNN Transducer Acoustic Models for English Conversational Speech Recognition INTERSPEECH 2023 ConvKT: Conversation-Level Knowledge Transfer for Context Aware End-to-End Spoken Language Understanding INTERSPEECH 2023 Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages INTERSPEECH 2023 VQ-T: RNN Transducers using Vector-Quantized Prediction Network States INTERSPEECH 2022 Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization INTERSPEECH 2022 A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization NIPS 2022 Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval CVPR 2022 Global RNN Transducer Models For Multi-dialect Speech Recognition INTERSPEECH 2022 Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems INTERSPEECH 2022 Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing INTERSPEECH 2022 Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio INTERSPEECH 2021 Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos ICCV 2021 Integrating Dialog History into End-to-End Spoken Language Understanding Systems INTERSPEECH 2021 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos INTERSPEECH 2021 Reducing Exposure Bias in Training Recurrent Neural Network Transducers INTERSPEECH 2021 On the Limit of English Conversational Speech Recognition INTERSPEECH 2021 4-Bit Quantization of LSTM-Based Speech Recognition Models INTERSPEECH 2021 Cascaded Multilingual Audio-Visual Learning from Videos INTERSPEECH 2021 Representation Based Meta-Learning for Few-Shot Spoken Intent Recognition INTERSPEECH 2020 End-to-End Spoken Language Understanding Without Full Transcripts INTERSPEECH 2020 Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard INTERSPEECH 2020 Transliteration Based Data Augmentation for Training Multilingual ASR Acoustic Models in Low Resource Settings INTERSPEECH 2020 Kernel Approximation Methods for Speech Recognition JMLR 2019 A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition INTERSPEECH 2019 Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition INTERSPEECH 2019 Challenging the Boundaries of Speech Recognition: The MALACH Corpus INTERSPEECH 2019 Estimating Information Flow in Deep Neural Networks ICML 2019 Beyond Backprop: Online Alternating Minimization with Auxiliary Variables ICML 2019 Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling INTERSPEECH 2016 Multilingual Data Selection for Low Resource Speech Recognition INTERSPEECH 2016 Deep Neural Network Language Models NAACL 2012 Tied-Mixture Language Modeling in Continuous Space NAACL 2009 Fast decoding for open vocabulary spoken term detection NAACL 2009