Co-occurring keywords
Papers
FVTTS : Face Based Voice Synthesis for Text-to-Speech
INTERSPEECH 2024
Enhancing Image-to-Text Generation in Radiology Reports through Cross-modal Multi-Task Learning
COLING 2024