Co-occurring keywords
Papers
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
INTERSPEECH 2023
GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering
INTERSPEECH 2023
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding
INTERSPEECH 2023
Improving Code-Switching and Name Entity Recognition in ASR with Speech Editing based Data Augmentation
INTERSPEECH 2023
Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
INTERSPEECH 2023
Iteratively Improving Speech Recognition and Voice Conversion
INTERSPEECH 2023
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling
INTERSPEECH 2023
Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
INTERSPEECH 2023
MOCKS 1.0: Multilingual Open Custom Keyword Spotting Testset
INTERSPEECH 2023
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
INTERSPEECH 2023
CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning
INTERSPEECH 2023
Don’t Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
INTERSPEECH 2023
Using Random Forests to classify language as a function of syllable timing in two groups: children with cochlear implants and with normal hearing
INTERSPEECH 2023