Co-occurring keywords
Papers
Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
CVPR 2023
Audio-Visual Scene Classification Based on Multi-modal Graph Fusion
INTERSPEECH 2022
A Transformer-Based Audio Captioning Model with Keyword Estimation
INTERSPEECH 2020
Multi-Scale Recognition With DAG-CNNs
ICCV 2015