Jihao Wu
5 papers · 2023–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(2)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(12)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (3)
EMNLP (2)
Top co-authors
Keywords
screen understanding
(2)
multimodal learning
(2)
image captioning
(2)
gui agent
(2)
knowledge distillation
(2)
action prediction
(1)
autonomous agent
(1)
vision-language model
(1)
controllable generation
(1)
edge computing
(1)
task automation
(1)
visual encoder
(1)
cross-modal fusion
(1)
graphical user interface
(1)
visual feature extraction
(1)
large language model
(1)
screen stream understanding
(1)
mobile gui agent
(1)
ui grounding
(1)
cross-lingual visual question answering
(1)
Papers
Cross-Lingual Text-Rich Visual Comprehension: An Information Theory Perspective
AAAI 2025
UI-Hawk: Unleashing the Screen Stream Understanding for Mobile GUI Agents
EMNLP 2025
Android in the Zoo: Chain-of-Action-Thought for GUI Agents
EMNLP 2024
Efficient Image Captioning for Edge Devices
AAAI 2023
Controllable Image Captioning via Prompting
AAAI 2023