Multi-Modal Learning
3194 directly classified papers
Papers per year
Papers
What Does BERT with Vision Look At?
ACL 2020
FaceFilter: Audio-Visual Speech Separation Using Still Images
INTERSPEECH 2020
Fusion Architectures for Word-Based Audiovisual Speech Recognition
INTERSPEECH 2020
Audio-Visual Multi-Channel Recognition of Overlapped Speech
INTERSPEECH 2020
Vocoder-Based Speech Synthesis from Silent Videos
INTERSPEECH 2020
Domain Adversarial Neural Networks for Dysarthric Speech Recognition
INTERSPEECH 2020