Papers
DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion Models
Karl Holmquist, Bastian Wandt
DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Runyang Feng, Yixing Gao, Tze Ho Elden Tse et al.
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
Mengzhao Chen, Wenqi Shao, Peng Xu et al.
Diff-Retinex: Rethinking Low-light Image Enhancement with A Generative Diffusion Model
Xunpeng Yi, Han Xu, Hao Zhang et al.
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Sauradip Nag, Xiatian Zhu, Jiankang Deng et al.
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Weijia Wu, Yuzhong Zhao, Mike Zheng Shou et al.
Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion
Yutao Jiang, Yang Zhou, Yuan Liang et al.
Diffusion Action Segmentation
Daochang Liu, Qiyue Li, Anh-Dung Dinh et al.
Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
Wenkang Shan, Zhenhua Liu, Xinfeng Zhang et al.
Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation
Duo Peng, Ping Hu, Qiuhong Ke et al.
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen, Peize Sun, Yibing Song et al.
Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips
Yufei Ye, Poorvi Hebbar, Abhinav Gupta et al.
Diffusion in Style
Martin Nicolas Everaert, Marco Bocchio, Sami Arpa et al.
Diffusion Model as Representation Learner
Xingyi Yang, Xinchao Wang
Diffusion Models as Masked Autoencoders
Chen Wei, Karttikeya Mangalam, Po-Yao Huang et al.
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin, Hao Li, Zesen Cheng et al.
Diffusion-SDF: Conditional Generative Modeling of Signed Distance Functions
Gene Chou, Yuval Bahat, Felix Heide
DiffV2S: Diffusion-Based Video-to-Speech Synthesis with Vision-Guided Speaker Embedding
Jeongsoo Choi, Joanna Hong, Yong Man Ro
D-IF: Uncertainty-aware Human Digitization via Implicit Distribution Field
Xueting Yang, Yihao Luo, Yuliang Xiu et al.
DiLiGenT-Pi: Photometric Stereo for Planar Surfaces with Rich Details - Benchmark Dataset and Beyond
Feishi Wang, Jieji Ren, Heng Guo et al.
DIME-FM : DIstilling Multimodal and Efficient Foundation Models
Ximeng Sun, Pengchuan Zhang, Peizhao Zhang et al.
DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars
David Svitov, Dmitrii Gudkov, Renat Bashirov et al.
DIRE for Diffusion-Generated Image Detection
Zhendong Wang, Jianmin Bao, Wengang Zhou et al.
Discovering Spatio-Temporal Rationales for Video Question Answering
Yicong Li, Junbin Xiao, Chun Feng et al.
Discrepant and Multi-Instance Proxies for Unsupervised Person Re-Identification
Chang Zou, Zeqi Chen, Zhichao Cui et al.