Video Understanding
2,296 papers
Papers per year
2
2
63
28
59
34
51
69
135
164
277
244
339
321
423
85
'15
'20
'25
Papers
A Transformer-Based Audio Captioning Model with Keyword Estimation
INTERSPEECH 2020
Caption Alignment for Low Resource Audio-Visual Data
INTERSPEECH 2020
Vocoder-Based Speech Synthesis from Silent Videos
INTERSPEECH 2020