Pei Fu
7 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (21) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (2)
ACL (2)
CVPR (2)
ICCV (1)
Top co-authors
Keywords
document understanding
(3)
multi-modal large language model
(2)
visual question answering
(2)
multimodal large language model
(2)
scene text detection
(1)
instruction tuning
(1)
vision language model
(1)
vision-language model
(1)
multimodal representation
(1)
visual foundation model
(1)
visual-language alignment
(1)
mask generation
(1)
token-level prediction
(1)
visual language model
(1)
optical character recognition
(1)
iterative reasoning
(1)
visual-text alignment
(1)
schema linking
(1)
text attribute
(1)
text recognition
(1)
Papers
Doc-V*: Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA
ACL 2026
AutoLink: Autonomous Schema Exploration and Expansion for Scalable Schema Linking in Text-to-SQL at Scale
AAAI 2026
InstructOCR: Instruction Boosting Scene Text Spotting
AAAI 2025
A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
ACL 2025
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
CVPR 2024