conftrace_

zero-shot learning

3650 papers

Explore in graph

Also known as

ZSS ZSL ZS ZERO-SHOT ZL

Co-occurring keywords

large language model (13587) few-shot learning (3398) vision-language model (2348) transfer learning (5449) cross-lingual transfer (1477) text classification (6793) multimodal learning (4645) contrastive learning (4032) domain adaptation (4595) language model (4599)

Papers

VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens CVPR 2024

Active Prompt Learning in Vision Language Models CVPR 2024

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers CVPR 2024

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks CVPR 2024

Language Models as Black-Box Optimizers for Vision-Language Models CVPR 2024

Anchor-based Robust Finetuning of Vision-Language Models CVPR 2024

Scaling Laws of Synthetic Images for Model Training ... for Now CVPR 2024

Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability Composability and Decomposability from Anatomy via Self Supervision CVPR 2024

General Object Foundation Model for Images and Videos at Scale CVPR 2024

X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization CVPR 2024

Label Propagation for Zero-shot Classification with Vision-Language Models CVPR 2024

Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework CVPR 2024

GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation CVPR 2024

Transductive Zero-Shot and Few-Shot CLIP CVPR 2024

AnyDoor: Zero-shot Object-level Image Customization CVPR 2024

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model CVPR 2024

Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D CVPR 2024

GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization EMNLP 2024

OpenGraph: Towards Open Graph Foundation Models EMNLP 2024

Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework EMNLP 2024

ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering EMNLP 2024

Training-free Deep Concept Injection Enables Language Models for Video Question Answering EMNLP 2024

Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting NIPS 2024

Learning from Observer Gaze: Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition CVPR 2024

ABM: Attention before Manipulation IJCAI 2024