Co-occurring keywords
Papers
CoT-VLNBench: A Benchmark for Visual Chain-of-Thought Reasoning in Vision-Language-Navigation Robots
AAAI 2026
BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards
AAAI 2026
CRAF: A Clinical Reasoning-Adaptive Framework via Reinforcement Learning for Similar Case Retrieval
AAAI 2026
GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models
AAAI 2026
TraveLLaMA: A Multimodal Travel Assistant with Large-Scale Dataset and Structured Reasoning
AAAI 2026
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos
WACV 2026
Snap Out of It: A Dual-Process Approach to Mitigating Overthinking in Language Model Reasoning
ACL 2025
IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems
ACL 2025
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
ICCV 2025