Co-occurring keywords
Papers
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025
Unveiling the Power of Integration: Block Diagram Summarization through Local-Global Fusion
ACL 2024
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding
EMNLP 2024
“What is the value of templates?” Rethinking Document Information Extraction Datasets for LLMs
EMNLP 2024
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding
COLING 2024