Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Models
Deep Learning
›
Models
›
Diffusion Models
4342 directly classified papers
Papers per year
2010: 1
2015: 2
2016: 1
2018: 2
2019: 5
2020: 15
2021: 28
2022: 85
2023: 687
2024: 1279
2025: 1778
2026: 459
Papers
PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution
ICCV 2025
Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
ICCV 2025
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions
ICCV 2025
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
ICCV 2025
Diffusion Renderer: Neural Inverse and Forward Rendering with Video Diffusion Models
CVPR 2025
Estimating Body and Hand Motion in an Ego-sensed World
CVPR 2025
Dual Prompting Image Restoration with Diffusion Transformers
CVPR 2025
Enhancing Creative Generation on Stable Diffusion-based Models
CVPR 2025
Denoising Functional Maps: Diffusion Models for Shape Correspondence
CVPR 2025
ProReflow: Progressive Reflow with Decomposed Velocity
CVPR 2025
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention
CVPR 2025
BADGR: Bundle Adjustment Diffusion Conditioned by Gradients for Wide-Baseline Floor Plan Reconstruction
CVPR 2025
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
CVPR 2025
DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
CVPR 2025
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
CVPR 2025
REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning
CVPR 2025
SketchVideo: Sketch-based Video Generation and Editing
CVPR 2025
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
CVPR 2025
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
CVPR 2025
LaVin-DiT: Large Vision Diffusion Transformer
CVPR 2025
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
CVPR 2025
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
CVPR 2025
S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors
CVPR 2025
Structured 3D Latents for Scalable and Versatile 3D Generation
CVPR 2025
Unbiased Missing-modality Multimodal Learning
ICCV 2025
<
1
…
36
37
38
…
174
>