Co-occurring keywords
Papers
Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models
EMNLP 2024
Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models
ACL 2024
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
EMNLP 2024
Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models
EMNLP 2024
Visual Pivoting Unsupervised Multimodal Machine Translation in Low-Resource Distant Language Pairs
EMNLP 2024
Training-free Deep Concept Injection Enables Language Models for Video Question Answering
EMNLP 2024
A Vision Check-up for Language Models
CVPR 2024