Xichen Pan
6 papers · 2022–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Conference Polyglot (6) π Cross-Pollinator (8) π Renaissance Researcher (5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(20)
π
Trend Setter
Conferences
ACL (1)
CVPR (1)
ICLR (1)
ICML (1)
NIPS (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(2)
self-supervised learning
(2)
vision-language alignment
(1)
image editing
(1)
visual grounding
(1)
story visualization
(1)
visual representation
(1)
neural rendering
(1)
object manipulation
(1)
visual representation learning
(1)
generative model
(1)
vision language model
(1)
vision-language model
(1)
multimodal large language model
(1)
visual instruction tuning
(1)
spatial vision aggregator
(1)
vision model
(1)
latent diffusion model
(1)
audio-visual speech recognition
(1)
coherent image generation
(1)
Papers
PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop
ICML 2025
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
NIPS 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
CVPR 2024
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
ICLR 2024
Synthesizing Coherent Story With Auto-Regressive Latent Diffusion Models
WACV 2024
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
ACL 2022