Computer Vision › Generation ›

Video Generation

1433 directly classified papers

Papers per year

Papers

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models CVPR 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback CVPR 2024

DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation CVPR 2024

Video Interpolation with Diffusion Models CVPR 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation CVPR 2024

PEEKABOO: Interactive Video Generation via Masked-Diffusion CVPR 2024

FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation CVPR 2024

Bidirectional Autoregessive Diffusion Model for Dance Generation CVPR 2024

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics CVPR 2024

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis CVPR 2024

VBench: Comprehensive Benchmark Suite for Video Generative Models CVPR 2024

PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion CVPR 2024

CCEdit: Creative and Controllable Video Editing via Diffusion Models CVPR 2024

Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models CVPR 2024

Neural Sign Actors: A Diffusion Model for 3D Sign Language Production from Text CVPR 2024

MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production ACL 2024

DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation CVPR 2024

Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis CVPR 2024

VidToMe: Video Token Merging for Zero-Shot Video Editing CVPR 2024

Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes CVPR 2024

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models CVPR 2024

Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation ACL 2024

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos CVPR 2024

L4GM: Large 4D Gaussian Reconstruction Model NIPS 2024

Vivid-ZOO: Multi-View Video Generation with Diffusion Model NIPS 2024