conftrace_
2024 ECCV ECCV 2024

Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning