Ching-Chen Kuo
4 papers · 2024–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (2) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (14) π Cross-Pollinator (14)
π
Renaissance Researcher
(5)
β
The Questioner
Conferences
ACL (2)
EMNLP (2)
Top co-authors
Keywords
multimodal large language model
(4)
multimodal learning
(3)
multimodal reasoning
(2)
synthetic datum
(1)
semantic mismatch
(1)
graphical user interface
(1)
multipanel image
(1)
inference-time intervention
(1)
layout understanding
(1)
gui navigation
(1)
gui screen reading
(1)
layout awareness
(1)
hierarchical layout tree
(1)
screen point-and-read
(1)
underspecified input
(1)
misspecified scenario
(1)
image comprehension
(1)
screen reading
(1)
inconsistency detection
(1)
layout grounding
(1)
Papers
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
ACL 2025
Hidden in Plain Sight: Reasoning in Underspecified and Misspecified Scenarios for Multimodal LLMs
EMNLP 2025
Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA
ACL 2024
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
EMNLP 2024