Tongkun Guan
7 papers · 2023–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
CVPR (2)
ECCV (2)
ICCV (2)
ACL (1)
Top co-authors
Keywords
document understanding
(3)
self-supervised learning
(2)
multi-modal large language model
(2)
multimodal large language model
(2)
visual foundation model
(1)
visual-language alignment
(1)
mask generation
(1)
scene text recognition
(1)
token-level prediction
(1)
text segmentation
(1)
visual language model
(1)
visual-text alignment
(1)
text recognition
(1)
visual language alignment
(1)
glyph structure
(1)
implicit attention
(1)
character segmentation
(1)
text-rich image understanding
(1)
character-to-character distillation
(1)
visual question answering
(1)
Papers
Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
ACL 2025
A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer
ECCV 2024
Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
ECCV 2024
Self-Supervised Character-to-Character Distillation for Text Recognition
ICCV 2023
Self-Supervised Implicit Glyph Attention for Text Recognition
CVPR 2023