Le Zhuo
11 papers · 2023–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Renaissance Researcher (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) πΊοΈ Taxonomy Completionist (17)
π
Cross-Pollinator
(14)
π₯
Mega-Team
(20)
β‘
Prolific Year
(7)
π
Century Club
(10)
Conferences
ICCV (4)
ICLR (3)
AAAI (1)
ACL (1)
CVPR (1)
NIPS (1)
Top co-authors
Keywords
diffusion transformer
(4)
multimodal learning
(3)
zero-shot learning
(2)
image generation
(2)
text-to-image generation
(2)
diffusion model
(2)
image captioning
(1)
cross-modal learning
(1)
image synthesis
(1)
flow-based diffusion
(1)
task generalization
(1)
frame selection
(1)
flag-dit architecture
(1)
text-to-image diffusion
(1)
sparse autoencoder
(1)
protein language model
(1)
progressive training
(1)
video question answering
(1)
visual in-context learning
(1)
in-context learning
(1)
Papers
TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation
AAAI 2026
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
CVPR 2025
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
ICCV 2025
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
ICLR 2025
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation
ICLR 2025
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
ICCV 2025
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
ICCV 2025
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
ACL 2024
Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT
NIPS 2024
Video Background Music Generation: Dataset, Method and Evaluation
ICCV 2023