Papers
Matching Latent Encoding for Audio-Text based Keyword Spotting
INTERSPEECH 2023
Semantic Enrichment Towards Efficient Speech Representations
INTERSPEECH 2023
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
INTERSPEECH 2023
Multimodal Speech Recognition for Language-Guided Embodied Agents
INTERSPEECH 2023