Multi-Modal Learning
1213 directly classified papers
Papers per year
Papers
Rethinking the Visual Cues in Audio-Visual Speaker Extraction
INTERSPEECH 2023
Multi-channel separation of dynamic speech and sound events
INTERSPEECH 2023