Sanjeev Khudanpur
82 papers · 2003–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
๐บ๏ธ Taxonomy Completionist (26) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Renaissance Researcher (8) ๐ฃ Hot Topic Early Bird
๐
Interdisciplinary Bridge
๐
Cross-Pollinator
(8)
๐บ๏ธ
Taxonomy Completionist
(26)
๐
Conference Loyalist
(49)
๐
Keyword Champion
(2)
๐งฌ
Topic Evolution
๐ฅ
Mega-Team
(20)
๐ฌ
Deep Specialist
(24)
๐ค
Dynamic Duo
(36)
โก
Prolific Year
(7)
๐ฅ
Unstoppable
(12)
โ
The Questioner
(2)
๐
Century Club
(81)
๐๏ธ
Keyword Collector
(104)
๐
Conference Pioneer
Conferences
INTERSPEECH (49)
ACL (12)
NAACL (10)
EMNLP (6)
COLING (2)
IJCNLP (2)
EACL (1)
Top co-authors
Keywords
automatic speech recognition
(20)
deep neural network
(9)
speech recognition
(8)
word error rate
(8)
machine translation
(6)
speaker diarization
(6)
speech translation
(6)
speaker recognition
(5)
acoustic model
(4)
acoustic modeling
(4)
language identification
(4)
speaker embedding
(4)
low-resource language
(3)
connectionist temporal classification
(3)
language diarization
(3)
domain adaptation
(3)
time delay neural network
(3)
convolutional neural network
(3)
probabilistic linear discriminant analysis
(3)
self-supervised learning
(2)
Papers
CSPB: Conversational Speech Processing Benchmark for Self-supervised Speech Models
EACL 2026
Benchmarking Language Model Creativity: A Case Study on Code Generation
NAACL 2025
Whisper-UT: A Unified Translation Framework for Speech and Text
EMNLP 2025
HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation
ACL 2025
Enhancing Neural Transducer for Multilingual ASR with Synchronized Language Diarization
INTERSPEECH 2024
Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language
INTERSPEECH 2024
Kreyรฒl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
NAACL 2024
JHU IWSLT 2024 Dialectal and Low-resource System Description
ACL 2024
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation
INTERSPEECH 2024
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition
COLING 2024
Multi-Channel Multi-Speaker ASR Using Target Speakerโs Solo Segment
INTERSPEECH 2024
GPU-accelerated Guided Source Separation for Meeting Transcription
INTERSPEECH 2023
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
INTERSPEECH 2023
JHU IWSLT 2023 Dialect Speech Translation System Description
ACL 2023
JHU IWSLT 2023 Multilingual Speech Translation System Description
ACL 2023
Investigating model performance in language identification: beyond simple error statistics
INTERSPEECH 2023
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
INTERSPEECH 2023
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation
INTERSPEECH 2023
Chunking Defense for Adversarial Attacks on ASR
INTERSPEECH 2022
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
INTERSPEECH 2022
JHU IWSLT 2022 Dialect Speech Translation System Description
ACL 2022
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser
INTERSPEECH 2022
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem
INTERSPEECH 2021
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora
EMNLP 2021
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10,000 Hours of Transcribed Audio
INTERSPEECH 2021
Speaker Verification-Based Evaluation of Single-Channel Speech Separation
INTERSPEECH 2021
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition
INTERSPEECH 2021
End-to-End Language Diarization for Bilingual Code-Switching Speech
INTERSPEECH 2021
An Alternative to MFCCs for ASR
INTERSPEECH 2020
Efficient MDI Adaptation for n-Gram Language Models
INTERSPEECH 2020
Wake Word Detection with Alignment-Free Lattice-Free MMI
INTERSPEECH 2020
Neural Language Modeling with Implicit Cache Pointers
INTERSPEECH 2020
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
INTERSPEECH 2020
Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network
INTERSPEECH 2019
Multi-PLDA Diarization on Childrenโs Speech
INTERSPEECH 2019
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18
INTERSPEECH 2019
x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition
INTERSPEECH 2019
Speaker Recognition Benchmark Using the CHiME-5 Corpus
INTERSPEECH 2019
The JHU Speaker Recognition System for the VOiCES 2019 Challenge
INTERSPEECH 2019
The JHU ASR System for VOiCES from a Distance Challenge 2019
INTERSPEECH 2019
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings
INTERSPEECH 2019
End-to-end Speech Recognition Using Lattice-free MMI
INTERSPEECH 2018
A GPU-based WFST Decoder with Exact Lattice Generation
INTERSPEECH 2018
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages
INTERSPEECH 2018
Output-Gate Projected Gated Recurrent Unit for Speech Recognition
INTERSPEECH 2018
Acoustic Modeling from Frequency Domain Representations of Speech
INTERSPEECH 2018
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge
INTERSPEECH 2018
Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition
INTERSPEECH 2018
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks
INTERSPEECH 2018
End-to-end Deep Neural Network Age Estimation
INTERSPEECH 2018
The Kaldi OpenKWS System: Improving Low Resource Keyword Search
INTERSPEECH 2017
Backstitch: Counteracting Finite-Sample Bias via Negative Steps
INTERSPEECH 2017
An Exploration of Dropout with LSTMs
INTERSPEECH 2017
Deep Neural Network Embeddings for Text-Independent Speaker Verification
INTERSPEECH 2017
Phone Duration Modeling for LVCSR Using Neural Networks
INTERSPEECH 2017
Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework
INTERSPEECH 2017
Topic Identification for Speech Without ASR
INTERSPEECH 2017
Acoustic Modelling from the Signal Domain Using CNNs
INTERSPEECH 2016
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI
INTERSPEECH 2016
Far-Field ASR Without Parallel Data
INTERSPEECH 2016
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation
EMNLP 2015
Online Learning in Tensor Space
ACL 2014
Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection
ACL 2014
Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining
ACL 2012
Revisiting the Case for Explicit Syntactic Information in Language Models
NAACL 2012
Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT
NAACL 2012
Minimum Imputed-Risk: Unsupervised Discriminative Training for Machine Translation
EMNLP 2011
Efficient Subsampling for Training Complex Language Models
EMNLP 2011
A Comparative Study of Word Co-occurrence for Term Clustering in Language Model-based Sentence Retrieval
NAACL 2010
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets
COLING 2010
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation
IJCNLP 2009
Efficient Extraction of Oracle-best Translations from Hypergraphs
NAACL 2009
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation
ACL 2009
Variational Decoding for Statistical Machine Translation
ACL 2009
Variational Decoding for Statistical Machine Translation
IJCNLP 2009
Machine Translation System Combination using ITG-based Alignments
ACL 2008
Unsupervised Learning of Acoustic Sub-word Units
ACL 2008
Cross-Instance Tuning of Unsupervised Document Clustering Algorithms
NAACL 2007
A Smorgasbord of Features for Statistical Machine Translation
NAACL 2004
Latent Semantic Information in Maximum Entropy Language Models for Conversational Speech Recognition
NAACL 2003
Desparately Seeking Cebuano
NAACL 2003
Cross-Lingual Lexical Triggers in Statistical Language Modeling
EMNLP 2003