Co-occurring keywords
Papers
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning
CVPR 2024
Seek Commonality but Preserve Differences: Dissected Dynamics Modeling for Multi-modal Visual RL
NIPS 2024
Text-conditional Attribute Alignment across Latent Spaces for 3D Controllable Face Image Synthesis
CVPR 2024