Co-occurring keywords
Papers
BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis
INTERSPEECH 2024
Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
ACL 2024
Interface Design for Self-Supervised Speech Models
INTERSPEECH 2024
This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach
INTERSPEECH 2024
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces
NAACL 2024
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline
COLING 2024
Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech Separation
INTERSPEECH 2024
AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents
EACL 2024
Deep Prosodic Features in Tandem with Perceptual Judgments of Word Reduction for Tone Recognition in Conversed Speech
INTERSPEECH 2024
On The Performance of EMA-synchronized Speech and Stand-alone Speech in Acoustic-to-articulatory Inversion
INTERSPEECH 2024
Linear-Complexity Self-Supervised Learning for Speech Processing
INTERSPEECH 2024
XANE: eXplainable Acoustic Neural Embeddings
INTERSPEECH 2024