Co-occurring keywords
Papers
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
CVPR 2024
Transductive Zero-Shot and Few-Shot CLIP
CVPR 2024
Training-free Deep Concept Injection Enables Language Models for Video Question Answering
EMNLP 2024
Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting
NIPS 2024
ABM: Attention before Manipulation
IJCAI 2024