Zhiqi Ge
4 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Cross-Pollinator (12) π Conference Polyglot (4) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(12)
π₯
Mega-Team
(32)
Conferences
ICCV (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
multimodal large language model
(2)
discriminative training
(1)
generative training
(1)
dynamic time warping
(1)
vision language model
(1)
multi-modal large language model
(1)
vision-language model
(1)
semantic differentiation
(1)
zero-shot classification
(1)
gui automation
(1)
visual agent
(1)
graphical user interface automation
(1)
adaptive cropping
(1)
self-refining learning
(1)
large language model
(1)
multimodal learning
(1)
information-sensitive cropping
(1)
Papers
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
ICCV 2025
On Path to Multimodal Generalist: General-Level and General-Bench
ICML 2025
Unified Generative and Discriminative Training for Multi-modal Large Language Models
NIPS 2024
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
ICLR 2024