Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
video captioning
206 papers
Explore in graph
Also known as
MCN
Co-occurring keywords
video understanding
(1647)
multimodal learning
(4622)
image captioning
(728)
recurrent neural network
(1790)
video description
(25)
action recognition
(957)
attention mechanism
(3975)
natural language generation
(782)
vision-language model
(2235)
contrastive learning
(3979)
Papers
Improving Generation and Evaluation of Visual Stories via Semantic Consistency
NAACL 2021
Sketch, Ground, and Refine: Top-Down Dense Video Captioning
CVPR 2021
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos
EMNLP 2020
An Efficient Framework for Dense Video Captioning
AAAI 2020
Screencast Tutorial Video Understanding
CVPR 2020
SBAT: Video Captioning with Sparse Boundary-Aware Transformer
IJCAI 2020
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
IJCAI 2020
Spatio-Temporal Ranked-Attention Networks for Video Captioning
WACV 2020
Joint Commonsense and Relation Reasoning for Image and Video Captioning
AAAI 2020
Spatio-Temporal Graph for Video Captioning With Knowledge Distillation
CVPR 2020
Object Relational Graph With Teacher-Recommended Learning for Video Captioning
CVPR 2020
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
ACL 2020
Domain-Specific Semantics Guided Approach to Video Captioning
WACV 2020
Better Captioning With Sequence-Level Exploration
CVPR 2020
Semi-Supervised Learning for Video Captioning
EMNLP 2020
Syntax-Aware Action Targeting for Video Captioning
CVPR 2020
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
CVPR 2020
Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning
EMNLP 2020
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
ICCV 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
AAAI 2019
Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag
EMNLP 2019
Fully Convolutional Video Captioning with Coarse-to-Fine and Inherited Attention
AAAI 2019
Motion Guided Spatial Attention for Video Captioning
AAAI 2019
Video Interactive Captioning with Human Prompts
IJCAI 2019
Low-Rank HOCA: Efficient High-Order Cross-Modal Attention for Video Captioning
IJCNLP 2019
<
1
…
5
6
7
8
9
>