Co-occurring keywords
Papers
Adapting an Unadaptable ASR System
INTERSPEECH 2023
Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment
ACL 2023
MCR-Data2vec 2.0: Improving Self-supervised Speech Pre-training via Model-level Consistency Regularization
INTERSPEECH 2023
DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages
EMNLP 2023
PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images
INTERSPEECH 2023
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
EMNLP 2023
Head movements in two- and four-person interactive conversational tasks in noisy and moderately reverberant conditions
INTERSPEECH 2023
ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
INTERSPEECH 2023
Investigating Reproducibility at Interspeech Conferences: A Longitudinal and Comparative Perspective
INTERSPEECH 2023
Towards Multi-Lingual Audio Question Answering
INTERSPEECH 2023
Exploration on HuBERT with Multiple Resolution
INTERSPEECH 2023
Biased Self-supervised Learning for ASR
INTERSPEECH 2023
Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings
INTERSPEECH 2023
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement
INTERSPEECH 2023
Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
INTERSPEECH 2023
Probing Self-supervised Speech Models for Phonetic and Phonemic Information: A Case Study in Aspiration
INTERSPEECH 2023
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
INTERSPEECH 2023