Co-occurring keywords
Papers
Beyond Visual Understanding Introducing PARROT-360V for Vision Language Model Benchmarking
COLING 2025
Multimodal Commonsense Knowledge Distillation for Visual Question Answering (Student Abstract)
AAAI 2025
SHIFT: Smoothing Hallucinations by Information Flow Tuning for Multimodal Large Language Models
ICCV 2025
Ambiguity-aware Multi-level Incongruity Fusion Network for Multi-Modal Sarcasm Detection
COLING 2025
What If LLMs Can Smell: A Prototype
IJCAI 2025