Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Analysis
Computer Vision
›
Analysis
›
Video Understanding
1098 directly classified papers
Papers per year
2006: 1
2012: 1
2013: 47
2014: 19
2015: 27
2016: 17
2017: 22
2018: 31
2019: 71
2020: 92
2021: 115
2022: 129
2023: 133
2024: 186
2025: 200
2026: 7
Papers
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
ACL 2022
Searching for fingerspelled content in American Sign Language
ACL 2022
What’s Different between Visual Question Answering for Machine “Understanding” Versus for Accessibility?
AACL 2022
Chop and Change: Anaphora Resolution in Instructional Cooking Videos
AACL 2022
Learning Optical Flow with Adaptive Graph Reasoning
AAAI 2022
Self-Training Multi-Sequence Learning with Transformer for Weakly Supervised Video Anomaly Detection
AAAI 2022
Rethinking Multi-Modal Alignment in Multi-Choice VideoQA from Feature and Sample Perspectives
EMNLP 2022
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers
INTERSPEECH 2022
BEVT: BERT Pretraining of Video Transformers
CVPR 2022
Contrastive Learning for Unsupervised Video Highlight Detection
CVPR 2022
Coarse-To-Fine Feature Mining for Video Semantic Segmentation
CVPR 2022
TransRAC: Encoding Multi-Scale Temporal Correlation With Transformers for Repetitive Action Counting
CVPR 2022
Weakly-Supervised Temporal Article Grounding
EMNLP 2022
CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering
EMNLP 2022
Group Contextualization for Video Recognition
CVPR 2022
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework
ACL 2022
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding
ACL 2022
VPAI_Lab at MedVidQA 2022: A Two-Stage Cross-modal Fusion Method for Medical Instructional Video Classification
ACL 2022
Implicit Motion Handling for Video Camouflaged Object Detection
CVPR 2022
Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities
CVPR 2022
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
CVPR 2022
CISLR: Corpus for Indian Sign Language Recognition
EMNLP 2022
Exposing the Limits of Video-Text Models through Contrast Sets
NAACL 2022
Contrastive Video-Language Learning with Fine-grained Frame Sampling
IJCNLP 2022
TransRank: Self-Supervised Video Representation Learning via Ranking-Based Transformation Recognition
CVPR 2022
<
1
…
22
23
24
…
44
>