Sanjeev Khudanpur

82 papers · 2003–2026 · 7 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🗺️ Taxonomy Completionist (26) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (8) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (8) 🗺️ Taxonomy Completionist (26) 🏠 Conference Loyalist (49) 🏆 Keyword Champion (2) 🧬 Topic Evolution 👥 Mega-Team (20) 🔬 Deep Specialist (24) 🤝 Dynamic Duo (36) ⚡ Prolific Year (7) 🔥 Unstoppable (12) ❓ The Questioner (2) 💎 Century Club (81) 🗃️ Keyword Collector (104) 🚀 Conference Pioneer

Conferences

INTERSPEECH (49) ACL (12) NAACL (10) EMNLP (6) COLING (2) IJCNLP (2) EACL (1)

Top co-authors

Daniel Povey (36) Matthew Wiesner (14) Yiming Wang (10) Vimal Manohar (9) Najim Dehak (8) Jan Trmal (7) Hainan Xu (7) Zhifei Li (7) David Snyder (7) Jason Eisner (6)

Keywords

automatic speech recognition (20) deep neural network (9) speech recognition (8) word error rate (8) machine translation (6) speaker diarization (6) speech translation (6) speaker recognition (5) acoustic model (4) acoustic modeling (4) language identification (4) speaker embedding (4) low-resource language (3) connectionist temporal classification (3) language diarization (3) domain adaptation (3) time delay neural network (3) convolutional neural network (3) probabilistic linear discriminant analysis (3) self-supervised learning (2)

Papers

CSPB: Conversational Speech Processing Benchmark for Self-supervised Speech Models EACL 2026 Benchmarking Language Model Creativity: A Case Study on Code Generation NAACL 2025 Whisper-UT: A Unified Translation Framework for Speech and Text EMNLP 2025 HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation ACL 2025 Enhancing Neural Transducer for Multilingual ASR with Synchronized Language Diarization INTERSPEECH 2024 Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language INTERSPEECH 2024 Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages NAACL 2024 JHU IWSLT 2024 Dialectal and Low-resource System Description ACL 2024 Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation INTERSPEECH 2024 ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition COLING 2024 Multi-Channel Multi-Speaker ASR Using Target Speaker’s Solo Segment INTERSPEECH 2024 GPU-accelerated Guided Source Separation for Meeting Transcription INTERSPEECH 2023 Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts INTERSPEECH 2023 JHU IWSLT 2023 Dialect Speech Translation System Description ACL 2023 JHU IWSLT 2023 Multilingual Speech Translation System Description ACL 2023 Investigating model performance in language identification: beyond simple error statistics INTERSPEECH 2023 MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization INTERSPEECH 2023 HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation INTERSPEECH 2023 Chunking Defense for Adversarial Attacks on ASR INTERSPEECH 2022 PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification INTERSPEECH 2022 JHU IWSLT 2022 Dialect Speech Translation System Description ACL 2022 Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser INTERSPEECH 2022 Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem INTERSPEECH 2021 Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora EMNLP 2021 GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10,000 Hours of Transcribed Audio INTERSPEECH 2021 Speaker Verification-Based Evaluation of Single-Channel Speech Separation INTERSPEECH 2021 Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition INTERSPEECH 2021 End-to-End Language Diarization for Bilingual Code-Switching Speech INTERSPEECH 2021 An Alternative to MFCCs for ASR INTERSPEECH 2020 Efficient MDI Adaptation for n-Gram Language Models INTERSPEECH 2020 Wake Word Detection with Alignment-Free Lattice-Free MMI INTERSPEECH 2020 Neural Language Modeling with Implicit Cache Pointers INTERSPEECH 2020 PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR INTERSPEECH 2020 Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network INTERSPEECH 2019 Multi-PLDA Diarization on Children’s Speech INTERSPEECH 2019 State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18 INTERSPEECH 2019 x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition INTERSPEECH 2019 Speaker Recognition Benchmark Using the CHiME-5 Corpus INTERSPEECH 2019 The JHU Speaker Recognition System for the VOiCES 2019 Challenge INTERSPEECH 2019 The JHU ASR System for VOiCES from a Distance Challenge 2019 INTERSPEECH 2019 Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings INTERSPEECH 2019 End-to-end Speech Recognition Using Lattice-free MMI INTERSPEECH 2018 A GPU-based WFST Decoder with Exact Lattice Generation INTERSPEECH 2018 Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages INTERSPEECH 2018 Output-Gate Projected Gated Recurrent Unit for Speech Recognition INTERSPEECH 2018 Acoustic Modeling from Frequency Domain Representations of Speech INTERSPEECH 2018 Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge INTERSPEECH 2018 Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition INTERSPEECH 2018 Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks INTERSPEECH 2018 End-to-end Deep Neural Network Age Estimation INTERSPEECH 2018 The Kaldi OpenKWS System: Improving Low Resource Keyword Search INTERSPEECH 2017 Backstitch: Counteracting Finite-Sample Bias via Negative Steps INTERSPEECH 2017 An Exploration of Dropout with LSTMs INTERSPEECH 2017 Deep Neural Network Embeddings for Text-Independent Speaker Verification INTERSPEECH 2017 Phone Duration Modeling for LVCSR Using Neural Networks INTERSPEECH 2017 Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework INTERSPEECH 2017 Topic Identification for Speech Without ASR INTERSPEECH 2017 Acoustic Modelling from the Signal Domain Using CNNs INTERSPEECH 2016 Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI INTERSPEECH 2016 Far-Field ASR Without Parallel Data INTERSPEECH 2016 A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation EMNLP 2015 Online Learning in Tensor Space ACL 2014 Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection ACL 2014 Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining ACL 2012 Revisiting the Case for Explicit Syntactic Information in Language Models NAACL 2012 Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT NAACL 2012 Minimum Imputed-Risk: Unsupervised Discriminative Training for Machine Translation EMNLP 2011 Efficient Subsampling for Training Complex Language Models EMNLP 2011 A Comparative Study of Word Co-occurrence for Term Clustering in Language Model-based Sentence Retrieval NAACL 2010 Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets COLING 2010 Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation IJCNLP 2009 Efficient Extraction of Oracle-best Translations from Hypergraphs NAACL 2009 Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation ACL 2009 Variational Decoding for Statistical Machine Translation ACL 2009 Variational Decoding for Statistical Machine Translation IJCNLP 2009 Machine Translation System Combination using ITG-based Alignments ACL 2008 Unsupervised Learning of Acoustic Sub-word Units ACL 2008 Cross-Instance Tuning of Unsupervised Document Clustering Algorithms NAACL 2007 A Smorgasbord of Features for Statistical Machine Translation NAACL 2004 Latent Semantic Information in Maximum Entropy Language Models for Conversational Speech Recognition NAACL 2003 Desparately Seeking Cebuano NAACL 2003 Cross-Lingual Lexical Triggers in Statistical Language Modeling EMNLP 2003