Co-occurring keywords
Papers
Event Camera Data Pre-training
ICCV 2023
Turbo your multi-modal classification with contrastive learning
INTERSPEECH 2023
PromptStyle: Controllable Style Transfer for Text-to-Speech with Natural Language Descriptions
INTERSPEECH 2023
Multi-Scale Attention for Audio Question Answering
INTERSPEECH 2023
Image-driven Audio-visual Universal Source Separation
INTERSPEECH 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
INTERSPEECH 2023