Co-occurring keywords
Papers
Bidirectional Multi-Step Domain Generalization for Visible-Infrared Person Re-Identification
WACV 2025
3D Denoisers Are Good 2D Teachers: Molecular Pretraining via Denoising and Cross-Modal Distillation
AAAI 2025
Learning to See through Sound: From VggCaps to Multi2Cap for Richer Automated Audio Captioning
EMNLP 2025
From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech
CVPR 2025