Co-occurring keywords
Papers
MM-Reasoner: A Multi-Modal Knowledge-Aware Framework for Knowledge-Based Visual Question Answering
EMNLP 2023
Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)
AAAI 2022
Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks
ACL 2022
Flexible Visual Grounding
ACL 2022
Neuro-Symbolic Visual Dialog
COLING 2022