Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Generation
Computer Vision
›
Generation
›
Video Generation
1433 directly classified papers
Papers per year
2006: 2
2007: 1
2013: 8
2014: 2
2015: 3
2016: 10
2017: 15
2018: 27
2019: 56
2020: 56
2021: 85
2022: 81
2023: 177
2024: 277
2025: 540
2026: 93
Papers
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
ACL 2024
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation
AAAI 2024
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
ACL 2024
LLM Knows Body Language, Too: Translating Speech Voices into Human Gestures
ACL 2024
Sign Language Translation with Sentence Embedding Supervision
ACL 2024
Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
AAAI 2024
Deep Hierarchical Video Compression
AAAI 2024
Video Frame Prediction from a Single Image and Events
AAAI 2024
Bridge to Non-Barrier Communication: Gloss-Prompted Fine-Grained Cued Speech Gesture Generation with Diffusion Model
IJCAI 2024
Enhanced Fine-Grained Motion Diffusion for Text-Driven Human Motion Synthesis
AAAI 2024
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization
CVPR 2024
4K4D: Real-Time 4D View Synthesis at 4K Resolution
CVPR 2024
GauHuman: Articulated Gaussian Splatting from Monocular Human Videos
CVPR 2024
POPDG: Popular 3D Dance Generation with PopDanceSet
CVPR 2024
Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models
NIPS 2024
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
WACV 2024
Controlling Character Motions Without Observable Driving Source
WACV 2024
Video ReCap: Recursive Captioning of Hour-Long Videos
CVPR 2024
Motion Diversification Networks
CVPR 2024
ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis
CVPR 2024
DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
CVPR 2024
Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis
CVPR 2024
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance Head-pose and Facial Expression Features
CVPR 2024
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping
AAAI 2024
Video2Game: Real-time Interactive Realistic and Browser-Compatible Environment from a Single Video
CVPR 2024
<
1
…
30
31
32
…
58
>