Qintong Zhang
3 papers · 2025–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🐝
Cross-Pollinator
(15)
Conferences
ACL (1)
EMNLP (1)
ICCV (1)
Top co-authors
Keywords
multimodal learning
(1)
document understanding
(1)
document parsing
(1)
efficient inference
(1)
knowledge base
(1)
computational efficiency
(1)
vision-language model
(1)
multimodal large language model
(1)
coarse-to-fine strategy
(1)
token pruning
(1)
retrieval-augmented generation
(1)
inference acceleration
(1)
optical character recognition
(1)
layout analysis
(1)
efficient attention
(1)
vision token
(1)
duplication detection
(1)
ocr noise
(1)
content recognition
(1)
Papers
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
ACL 2026
Stop Looking for “Important Tokens” in Multimodal Language Models: Duplication Matters More
EMNLP 2025
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
ICCV 2025