Co-occurring keywords
Papers
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
NIPS 2024
Towards Text-guided 3D Scene Composition
CVPR 2024
Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data
AAAI 2024