Computer Vision › Processing ›

Video Understanding

1592 directly classified papers

Papers per year

Papers

Multilevel Language and Vision Integration for Text-to-Clip Retrieval AAAI 2019

More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation NIPS 2019

BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames CVPR 2019

Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video ACL 2019

Dense Temporal Convolution Network for Sign Language Translation IJCAI 2019

Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts IJCNLP 2019

EASSE: Easier Automatic Sentence Simplification Evaluation EMNLP 2019

Hallucinating Optical Flow Features for Video Classification IJCAI 2019

Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks IJCAI 2019

Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag EMNLP 2019

Video Interactive Captioning with Human Prompts IJCAI 2019

A Delay Metric for Video Object Detection: What Average Precision Fails to Tell ICCV 2019

STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification AAAI 2019

Semantic Proposal for Activity Localization in Videos via Sentence Query AAAI 2019

Controllable Video Captioning With POS Sequence Guidance Based on Gated Fusion Network ICCV 2019

Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query ICCV 2019

AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation ICCV 2019

Video Instance Segmentation ICCV 2019

Self-Supervised Learning With Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera ICCV 2019

TSM: Temporal Shift Module for Efficient Video Understanding ICCV 2019

STEP: Spatio-Temporal Progressive Learning for Video Action Detection CVPR 2019

Self-Supervised Spatio-Temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics CVPR 2019

End-to-End Dense Video Captioning With Masked Transformer CVPR 2018

VirtualHome: Simulating Household Activities via Programs CVPR 2018

Revisiting Video Saliency: A Large-Scale Benchmark and a New Model CVPR 2018