Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Generation
Computer Vision
›
Generation
›
Video Generation
1433 directly classified papers
Papers per year
2006: 2
2007: 1
2013: 8
2014: 2
2015: 3
2016: 10
2017: 15
2018: 27
2019: 56
2020: 56
2021: 85
2022: 81
2023: 177
2024: 277
2025: 540
2026: 93
Papers
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
CVPR 2024
InstructVideo: Instructing Video Diffusion Models with Human Feedback
CVPR 2024
DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation
CVPR 2024
Video Interpolation with Diffusion Models
CVPR 2024
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
CVPR 2024
PEEKABOO: Interactive Video Generation via Masked-Diffusion
CVPR 2024
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
CVPR 2024
Bidirectional Autoregessive Diffusion Model for Dance Generation
CVPR 2024
BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics
CVPR 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
CVPR 2024
VBench: Comprehensive Benchmark Suite for Video Generative Models
CVPR 2024
PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion
CVPR 2024
CCEdit: Creative and Controllable Video Editing via Diffusion Models
CVPR 2024
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
CVPR 2024
Neural Sign Actors: A Diffusion Model for 3D Sign Language Production from Text
CVPR 2024
MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
ACL 2024
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
CVPR 2024
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
CVPR 2024
VidToMe: Video Token Merging for Zero-Shot Video Editing
CVPR 2024
Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes
CVPR 2024
TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
CVPR 2024
Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation
ACL 2024
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
CVPR 2024
L4GM: Large 4D Gaussian Reconstruction Model
NIPS 2024
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
NIPS 2024
<
1
…
29
30
31
…
58
>