Zhibin Lan
6 papers · 2023–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (2) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
ACL (3)
EMNLP (3)
Top co-authors
Keywords
optical character recognition
(2)
multimodal learning
(2)
text image translation
(2)
vision-language model
(2)
adaptive inference
(1)
diffusion model
(1)
large multimodal model
(1)
visual text generation
(1)
large vision-language model
(1)
image understanding
(1)
image-text alignment
(1)
visual token
(1)
cross-attention module
(1)
multimodal embedding
(1)
hard negative mining
(1)
text rendering
(1)
visual text
(1)
text image machine translation
(1)
position-aware translation
(1)
region-specific translation
(1)
Papers
AVG-LLaVA: An Efficient Large Multimodal Model with Adaptive Visual Granularity
ACL 2025
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
EMNLP 2025
PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models
EMNLP 2025
Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation
ACL 2024
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training
EMNLP 2024
Exploring Better Text Image Translation with Multimodal Codebook
ACL 2023