Yizhi Song
6 papers · 2023–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
CVPR (2)
AAAI (1)
ECCV (1)
ICLR (1)
WACV (1)
Top co-authors
Keywords
object compositing
(2)
diffusion model
(2)
image generation
(2)
generative model
(2)
conditional generation
(1)
image editing
(1)
object tracking
(1)
image harmonization
(1)
vision language model
(1)
multimodal large language model
(1)
visual instruction tuning
(1)
temporal coherence
(1)
identity preservation
(1)
shape guidance
(1)
generative object compositing
(1)
detail preservation
(1)
diffusion-based image generation
(1)
3d visual instruction dataset
(1)
representation learning
(1)
camera-object relation recognition
(1)
Papers
Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation
WACV 2026
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
AAAI 2026
Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment
ICLR 2025
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
CVPR 2024
Thinking Outside the BBox: Unconstrained Generative Object Compositing
ECCV 2024
ObjectStitch: Object Compositing With Diffusion Model
CVPR 2023