Zhuowan Li
12 papers · 2018–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (6) π Academic Marathon (7) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (40)
π£
Hot Topic Early Bird
π
Conference Polyglot
(6)
π
Academic Marathon
(7)
π§¬
Topic Evolution
π
Century Club
(12)
β
The Questioner
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(64)
Conferences
CVPR (5)
ICCV (2)
NIPS (2)
EACL (1)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
visual question answering
(6)
multimodal learning
(2)
chart understanding
(2)
representation learning
(2)
data augmentation
(2)
large language model
(2)
pose estimation
(1)
question answering
(1)
domain generalization
(1)
model robustness
(1)
image generation
(1)
prompt engineering
(1)
visual reasoning
(1)
image captioning
(1)
person re-identification
(1)
3d vision
(1)
object detection
(1)
visual representation
(1)
visual context
(1)
contrastive learning
(1)
Papers
Effective Training Data Synthesis for Improving MLLM Chart Understanding
ICCV 2025
Synthesize Step-by-Step: Tools Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
CVPR 2024
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
CVPR 2024
Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models
EACL 2024
Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach
EMNLP 2024
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
NIPS 2023
Super-CLEVR: A Virtual Benchmark To Diagnose Domain Robustness in Visual Reasoning
CVPR 2023
Visual Commonsense in Pretrained Unimodal and Multimodal Models
NAACL 2022
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
CVPR 2022
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
ICCV 2021
Context-Aware Group Captioning via Self-Attention and Contrastive Features
CVPR 2020
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification
NIPS 2018