Co-occurring keywords
Papers
Knowledge-Enhanced Image Captioning with Adaptive Graph-based Multimodal Alignment and LLM
AAAI 2026
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
IJCAI 2025
Enhancing Large Language Models for Scientific Multimodal Summarization with Multimodal Output
COLING 2025
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects
CVPR 2025
JNLP at SemEval-2025 Task 1: Multimodal Idiomaticity Representation with Large Language Models
ACL 2025