Co-occurring keywords
Papers
ComprehendEdit: A Comprehensive Dataset and Evaluation Framework for Multimodal Knowledge Editing
AAAI 2025
Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
EMNLP 2025
HALLUCINOGEN: Benchmarking Hallucination in Implicit Reasoning within Large Vision Language Models
EMNLP 2025