Co-occurring keywords
Papers
Few-Shot Audio-Visual Class-Incremental Learning with Temporal Prompting and Regularization
AAAI 2025
EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics
AAAI 2025
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
CVPR 2024
Continual Audio-Visual Sound Separation
NIPS 2024
Segment beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation
AAAI 2024
Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification
INTERSPEECH 2024
Towards Multilingual Audio-Visual Question Answering
INTERSPEECH 2024
Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert
INTERSPEECH 2024