Co-occurring keywords
Papers
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
CVPR 2025
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
CVPR 2025
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information
AAAI 2025
A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation
COLING 2025
Referring to Any Person
ICCV 2025
OVEL: Online Video Entity Linking
COLING 2025