Co-occurring keywords
Papers
Spoken-to-written text conversion with Large Language Model
INTERSPEECH 2024
Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models
INTERSPEECH 2024
CaptainA self-study mobile app for practising speaking: task completion assessment and feedback with generative AI
INTERSPEECH 2024
Prompting Large Language Models with Mispronunciation Detection and Diagnosis Abilities
INTERSPEECH 2024
Lightweight Transducer Based on Frame-Level Criterion
INTERSPEECH 2024
A Multitask Training Approach to Enhance Whisper with Open-Vocabulary Keyword Spotting
INTERSPEECH 2024
Multimodal Belief Prediction
INTERSPEECH 2024
Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
INTERSPEECH 2023
Improving Joint Speech-Text Representations Without Alignment
INTERSPEECH 2023
Leveraging Cross-Utterance Context For ASR Decoding
INTERSPEECH 2023
Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
INTERSPEECH 2023
Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed Speech
INTERSPEECH 2023
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data
INTERSPEECH 2023
Blank Collapse: Compressing CTC Emission for the Faster Decoding
INTERSPEECH 2023