Co-occurring keywords
Papers
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
EMNLP 2023
Automatic Prediction of Language Learners' Listenability Using Speech and Text Features Extracted from Listening Drills
INTERSPEECH 2023
Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings
INTERSPEECH 2023
VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining
EMNLP 2023
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities
EMNLP 2023
Abusive Speech Detection in Indic Languages Using Acoustic Features
INTERSPEECH 2023
Identifying Stable Sections for Formant Frequency Extraction of French Nasal Vowels Based on Difference Thresholds
INTERSPEECH 2023
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features
INTERSPEECH 2023
Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences
INTERSPEECH 2023
Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment
ACL 2023
Differential Privacy enabled Dementia Classification: An Exploration of the Privacy-Accuracy Trade-off in Speech Signal Data
INTERSPEECH 2023
Topological Data Analysis for Speech Processing
INTERSPEECH 2023
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants
ACL 2023
ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs
INTERSPEECH 2023
A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment
INTERSPEECH 2023