Co-occurring keywords
Papers
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition
INTERSPEECH 2023
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
INTERSPEECH 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
INTERSPEECH 2023
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?
INTERSPEECH 2023
An extension of disentanglement metrics and its application to voice
INTERSPEECH 2023
MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations
INTERSPEECH 2023
SVVAD: Personal Voice Activity Detection for Speaker Verification
INTERSPEECH 2023
MCR-Data2vec 2.0: Improving Self-supervised Speech Pre-training via Model-level Consistency Regularization
INTERSPEECH 2023
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition
INTERSPEECH 2023
Improving Joint Speech-Text Representations Without Alignment
INTERSPEECH 2023
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
INTERSPEECH 2023
Improving Small Footprint Few-shot Keyword Spotting with Supervision on Auxiliary Data
INTERSPEECH 2023
Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system
INTERSPEECH 2023
Diverse Feature Mapping and Fusion via Multitask Learning for Multilingual Speech Emotion Recognition
INTERSPEECH 2023
Leveraging Label Information for Multimodal Emotion Recognition
INTERSPEECH 2023