Sibo Song
5 papers · 2022–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (2) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
β
The Questioner
Conferences
CVPR (3)
ACL (1)
MICCAI (1)
Top co-authors
Keywords
multimodal learning
(2)
document understanding
(2)
scene text detection
(1)
document analysis
(1)
vision-language model
(1)
vision-language pre-training
(1)
masked language modeling
(1)
cross-modal interaction
(1)
table recognition
(1)
medical visual question answering
(1)
entity recognition
(1)
key information extraction
(1)
logical coherence
(1)
unified framework
(1)
text recognition
(1)
entity labeling
(1)
text spotting
(1)
image-text contrastive learning
(1)
cross-modal encoder
(1)
word-in-image prediction
(1)
Papers
Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering
ACL 2026
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
MICCAI 2025
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
CVPR 2024
Modeling Entities As Semantic Points for Visual Information Extraction in the Wild
CVPR 2023
Vision-Language Pre-Training for Boosting Scene Text Detectors
CVPR 2022