Co-occurring keywords
Papers
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
CVPR 2025
Learning to See through Sound: From VggCaps to Multi2Cap for Richer Automated Audio Captioning
EMNLP 2025
Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval
AAAI 2025