Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Processing
Computer Vision
›
Processing
›
Video Understanding
1592 directly classified papers
Papers per year
2006: 1
2012: 1
2013: 30
2014: 15
2015: 38
2016: 22
2017: 39
2018: 49
2019: 91
2020: 115
2021: 207
2022: 160
2023: 254
2024: 216
2025: 297
2026: 57
Papers
CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes
NIPS 2024
SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM
NIPS 2024
SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos
CVPR 2024
Step Differences in Instructional Video
CVPR 2024
Video-Based Human Pose Regression via Decoupled Space-Time Aggregation
CVPR 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
ACL 2024
Video ReCap: Recursive Captioning of Hour-Long Videos
CVPR 2024
Rethink Cross-Modal Fusion in Weakly-Supervised Audio-Visual Video Parsing
WACV 2024
Koala: Key Frame-Conditioned Long Video-LLM
CVPR 2024
Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow
CVPR 2024
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
CVPR 2024
Movie Genre Classification by Language Augmentation and Shot Sampling
WACV 2024
Enhanced Motion-Text Alignment for Image-to-Video Transfer Learning
CVPR 2024
Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model
CVPR 2024
Learning Object State Changes in Videos: An Open-World Perspective
CVPR 2024
Previously on ... From Recaps to Story Summarization
CVPR 2024
Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation
CVPR 2024
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
CVPR 2024
Endow SAM with Keen Eyes: Temporal-spatial Prompt Learning for Video Camouflaged Object Detection
CVPR 2024
The Background Also Matters: Background-Aware Motion-Guided Objects Discovery
WACV 2024
Towards Surveillance Video-and-Language Understanding: New Dataset Baselines and Challenges
CVPR 2024
Vript: A Video Is Worth Thousands of Words
NIPS 2024
Implicit Motion Function
CVPR 2024
Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection
CVPR 2024
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
CVPR 2024
<
1
…
18
19
20
…
64
>