Co-occurring keywords
Papers
Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
INTERSPEECH 2024
Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language
INTERSPEECH 2024
A Dataset and Two-pass System for Reading Miscue Detection
INTERSPEECH 2024
SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR
INTERSPEECH 2024
SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding
INTERSPEECH 2024
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
CVPR 2024
Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech
INTERSPEECH 2024
Beam-search SIEVE for low-memory speech recognition
INTERSPEECH 2024
Speech Recognition Models are Strong Lip-readers
INTERSPEECH 2024
A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
INTERSPEECH 2024
Quantifying Unintended Memorization in BEST-RQ ASR Encoders
INTERSPEECH 2024