Computer Vision › Analysis ›

Video Understanding

1098 directly classified papers

Papers per year

Papers

Retrieval-augmented Video Encoding for Instructional Captioning ACL 2023

Look Before You Match: Instance Understanding Matters in Video Object Segmentation CVPR 2023

How You Feelin'? Learning Emotions and Mental States in Movie Scenes CVPR 2023

Continuous Sign Language Recognition With Correlation Network CVPR 2023

Omnimatte3D: Associating Objects and Their Effects in Unconstrained Monocular Video CVPR 2023

Spatial-Temporal Concept Based Explanation of 3D ConvNets CVPR 2023

AutoLabel: CLIP-Based Framework for Open-Set Video Domain Adaptation CVPR 2023

Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding CVPR 2023

Self-Supervised Video Forensics by Audio-Visual Anomaly Detection CVPR 2023

Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training CVPR 2023

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning ICCV 2023

Breaking the "Object" in Video Object Segmentation CVPR 2023

TriDet: Temporal Action Detection With Relative Boundary Modeling CVPR 2023

Streaming Video Model CVPR 2023

AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders CVPR 2023

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline CVPR 2023

Modular Memorability: Tiered Representations for Video Memorability Prediction CVPR 2023

Selective Structured State-Spaces for Long-Form Video Understanding CVPR 2023

Test of Time: Instilling Video-Language Models With a Sense of Time CVPR 2023

ISLTranslate: Dataset for Translating Indian Sign Language ACL 2023

Spatial-Then-Temporal Self-Supervised Learning for Video Correspondence CVPR 2023

Text-Visual Prompting for Efficient 2D Temporal Video Grounding CVPR 2023

StepFormer: Self-Supervised Step Discovery and Localization in Instructional Videos CVPR 2023

Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization CVPR 2023

A Large-Scale Robustness Analysis of Video Action Recognition Models CVPR 2023