Co-occurring keywords
Papers
Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning
ACL 2023
Unsupervised Sounding Pixel Learning
EMNLP 2023
Few-Shot Learning With Visual Distribution Calibration and Cross-Modal Distribution Alignment
CVPR 2023
Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
CVPR 2023