← Models

Deep Learning › Models ›

Diffusion Models

4342 directly classified papers

Papers per year

Papers

PatchScaler: An Efficient Patch-Independent Diffusion Model for Image Super-Resolution ICCV 2025

Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization ICCV 2025

RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions ICCV 2025

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation ICCV 2025

Diffusion Renderer: Neural Inverse and Forward Rendering with Video Diffusion Models CVPR 2025

Estimating Body and Hand Motion in an Ego-sensed World CVPR 2025

Dual Prompting Image Restoration with Diffusion Transformers CVPR 2025

Enhancing Creative Generation on Stable Diffusion-based Models CVPR 2025

Denoising Functional Maps: Diffusion Models for Shape Correspondence CVPR 2025

ProReflow: Progressive Reflow with Decomposed Velocity CVPR 2025

Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention CVPR 2025

BADGR: Bundle Adjustment Diffusion Conditioned by Gradients for Wide-Baseline Floor Plan Reconstruction CVPR 2025

DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture CVPR 2025

DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes CVPR 2025

EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation CVPR 2025

REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning CVPR 2025

SketchVideo: Sketch-based Video Generation and Editing CVPR 2025

AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models CVPR 2025

BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations CVPR 2025

LaVin-DiT: Large Vision Diffusion Transformer CVPR 2025

Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization CVPR 2025

SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing CVPR 2025

S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors CVPR 2025

Structured 3D Latents for Scalable and Versatile 3D Generation CVPR 2025

Unbiased Missing-modality Multimodal Learning ICCV 2025