Co-occurring keywords
Papers
Physics Context Builders: A Modular Framework for Physical Reasoning in Vision-Language Models
ICCV 2025
CAPSTONE: Composable Attribute‐Prompted Scene Translation for Zero‐Shot Vision–Language Reasoning
EMNLP 2025
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
ICCV 2025