Co-occurring keywords
Papers
Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss
INTERSPEECH 2024
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
INTERSPEECH 2024
AR-NLU: A Framework for Enhancing Natural Language Understanding Model Robustness against ASR Errors
INTERSPEECH 2024
Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation
INTERSPEECH 2024
Self-Supervised Learning with Multi-Head Multi-Mode Knowledge Distillation for Speaker Verification
INTERSPEECH 2024
M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
INTERSPEECH 2024
HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language Model
NIPS 2024
Multi-Loss Fusion: Angular and Contrastive Integration for Machine-Generated Text Detection
EMNLP 2024