Co-occurring keywords
Papers
CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers
CVPR 2024
Referring Expression Counting
CVPR 2024
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models
INTERSPEECH 2024
DINO-VITS: Data-Efficient Zero-Shot TTS with Self-Supervised Speaker Verification Loss for Noise Robustness
INTERSPEECH 2024
MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation
ACL 2024
IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS
NIPS 2024
Dense Connector for MLLMs
NIPS 2024