Co-occurring keywords
Papers
Language Decoupling with Fine-grained Knowledge Guidance for Referring Multi-object Tracking
ICCV 2025
Enhancing Spatial Reasoning in Multimodal Large Language Models through Reasoning-based Segmentation
ICCV 2025
KIA: Knowledge-Guided Implicit Vision-Language Alignment for Chest X-Ray Report Generation
COLING 2025
TS-CLIP: Time Series Understanding by CLIP
EMNLP 2025