speech processing

478 papers

Explore in graph

Co-occurring keywords

multimodal learning (4622) automatic speech recognition (1764) speech recognition (1223) self-supervised learning (3751) representation learning (6174) large language model (12755) acoustic feature (265) neural network (6616) speech analysis (363) feature extraction (1578)

Papers

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations EMNLP 2023

Automatic Prediction of Language Learners' Listenability Using Speech and Text Features Extracted from Listening Drills INTERSPEECH 2023

Dialect Transfer for Swiss German Speech Translation EMNLP 2023

Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings INTERSPEECH 2023

VivesDebate-Speech: A Corpus of Spoken Argumentation to Leverage Audio Features for Argument Mining EMNLP 2023

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities EMNLP 2023

SQuAD-SRC: A Dataset for Multi-Accent Spoken Reading Comprehension IJCAI 2023

Abusive Speech Detection in Indic Languages Using Acoustic Features INTERSPEECH 2023

Identifying Stable Sections for Formant Frequency Extraction of French Nasal Vowels Based on Difference Thresholds INTERSPEECH 2023

Putting Natural in Natural Language Processing ACL 2023

Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features INTERSPEECH 2023

Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences INTERSPEECH 2023

Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment ACL 2023

Differential Privacy enabled Dementia Classification: An Exploration of the Privacy-Accuracy Trade-off in Speech Signal Data INTERSPEECH 2023

Does Listener Gaze in Face-to-Face Interaction Follow the Entropy Rate Constancy Principle: An Empirical Study EMNLP 2023

Topological Data Analysis for Speech Processing INTERSPEECH 2023

When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants ACL 2023

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs INTERSPEECH 2023

Memory Network-Based End-To-End Neural ES-KMeans for Improved Word Segmentation INTERSPEECH 2023

Learning Co-Speech Gesture for Multimodal Aphasia Type Detection EMNLP 2023

A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment INTERSPEECH 2023

Consonant-emphasis Method Incorporating Robust Consonant-section Detection to Improve Intelligibility of Bone-conducted speech INTERSPEECH 2023

Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language ACL 2023

Emotions in Spoken Language - Do we need acoustics? ACL 2023

Using ASR-Generated Text for Spoken Language Modeling ACL 2022