Co-occurring keywords
Papers
Mitigating Endogenous Confirmation Bias in Noisy Label Learning for Vision-Language Models
AAAI 2026
Learning to See through Sound: From VggCaps to Multi2Cap for Richer Automated Audio Captioning
EMNLP 2025
Bridging Semantic and Modality Gaps in Zero-Shot Captioning via Retrieval from Synthetic Data
EMNLP 2025
Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval
AAAI 2025
Generative Planning with 3D-Vision Language Pre-training for End-to-End Autonomous Driving
AAAI 2025