Co-occurring keywords
Papers
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes
ACL 2021
Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
NIPS 2021
Recognizing Multimodal Entailment
ACL 2021