Co-occurring keywords
Papers
Team ML_Forge@DravidianLangTech 2025: Multimodal Hate Speech Detection in Dravidian Languages
NAACL 2025
On The Performance of EMA-synchronized Speech and Stand-alone Speech in Acoustic-to-articulatory Inversion
INTERSPEECH 2024
ED-sKWS: Early-Decision Spiking Neural Networks for Rapid, and Energy-Efficient Keyword Spotting
INTERSPEECH 2024
Towards Classifying Mother Tongue from Infant Cries - Findings Substantiating Prenatal Learning Theory
INTERSPEECH 2024
Post-Net: A linguistically inspired sequence-dependent transformed neural architecture for automatic syllable stress detection
INTERSPEECH 2024
On the social bias of speech self-supervised models
INTERSPEECH 2024
Towards Self-Attention Understanding for Automatic Articulatory Processes Analysis in Cleft Lip and Palate Speech
INTERSPEECH 2024
On Comparing Time- and Frequency-Domain Rhythm Measures in Classifying Assamese Dialects
INTERSPEECH 2024
AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents
EACL 2024
A Cross-Attention Layer coupled with Multimodal Fusion Methods for Recognizing Depression from Spontaneous Speech
INTERSPEECH 2024
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios
INTERSPEECH 2024
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces
NAACL 2024
BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis
INTERSPEECH 2024
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline
COLING 2024
Linear-Complexity Self-Supervised Learning for Speech Processing
INTERSPEECH 2024