Computer Vision › Processing ›

Video Understanding

1592 directly classified papers

Papers per year

Papers

Visual Semantic Role Labeling for Video Understanding CVPR 2021

Multi-Shot Temporal Event Localization: A Benchmark CVPR 2021

Sketch, Ground, and Refine: Top-Down Dense Video Captioning CVPR 2021

STVGBert: A Visual-Linguistic Transformer Based Framework for Spatio-Temporal Video Grounding ICCV 2021

Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing NIPS 2021

MAU: A Motion-Aware Unit for Video Prediction and Beyond NIPS 2021

Activity Image-to-Video Retrieval by Disentangling Appearance and Motion AAAI 2021

Learning Self-Similarity in Space and Time As Generalized Motion for Video Action Recognition ICCV 2021

Long Short-Term Transformer for Online Action Detection NIPS 2021

Long Short View Feature Decomposition via Contrastive Video Representation Learning ICCV 2021

VidTr: Video Transformer Without Convolutions ICCV 2021

Learning Action Completeness From Points for Weakly-Supervised Temporal Action Localization ICCV 2021

Zero-Shot Natural Language Video Localization ICCV 2021

End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks ICCV 2021

DramaQA: Character-Centered Video Story Understanding with Hierarchical QA AAAI 2021

VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding EMNLP 2021

Intraoperative Adverse Event Detection in Laparoscopic Surgery: Stabilized Multi-Stage Temporal Convolutional Network with Focal-Uncertainty Loss MLHC 2021

FlowCaps: Optical Flow Estimation With Capsule Networks for Action Recognition WACV 2021

Learning To Associate Every Segment for Video Panoptic Segmentation CVPR 2021

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval ICCV 2021

T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval CVPR 2021

Reducing the Annotation Effort for Video Object Segmentation Datasets WACV 2021

Towards Visually Explaining Video Understanding Networks With Perturbation WACV 2021

Crossover Learning for Fast Online Video Instance Segmentation ICCV 2021

SLAMP: Stochastic Latent Appearance and Motion Prediction ICCV 2021