Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
image captioning
728 papers
Explore in graph
Also known as
IDC
PIC
IAC
IC
Co-occurring keywords
multimodal learning
(4622)
visual question answering
(1000)
vision-language model
(2235)
text generation
(2903)
attention mechanism
(3975)
visual grounding
(505)
zero-shot learning
(3637)
multi-modal learning
(1276)
vision language model
(752)
natural language generation
(782)
Papers
A Hierarchical Approach for Generating Descriptive Image Paragraphs
CVPR 2017
Obj2Text: Generating Visually Descriptive Language from Object Layouts
EMNLP 2017
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
CVPR 2017
Captioning Images With Diverse Objects
CVPR 2017
Semantic Compositional Networks for Visual Captioning
CVPR 2017
Dense Captioning With Joint Inference and Visual Context
CVPR 2017
Generating Descriptions With Grounded and Co-Referenced People
CVPR 2017
STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset
ACL 2017
Self-Critical Sequence Training for Image Captioning
CVPR 2017
StyleNet: Generating Attractive Visual Captions With Styles
CVPR 2017
Deep Reinforcement Learning-Based Image Captioning With Embedding Reward
CVPR 2017
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-In-The-Blank Image Captioning
CVPR 2017
Contrastive Learning for Image Captioning
NIPS 2017
Hierarchically-Attentive RNN for Album Summarization and Storytelling
EMNLP 2017
Multimodal Machine Learning: Integrating Language, Vision and Speech
ACL 2017
FOIL it! Find One mismatch between Image and Language caption
ACL 2017
Deliberation Networks: Sequence Generation Beyond One-Pass Decoding
NIPS 2017
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
NIPS 2017
Guided Open Vocabulary Image Captioning with Constrained Beam Search
EMNLP 2017
MAT: A Multimodal Attentive Translator for Image Captioning
IJCAI 2017
Context-Aware Captions From Context-Agnostic Supervision
CVPR 2017
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning
CVPR 2017
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
CVPR 2017
Learning Object Interactions and Descriptions for Semantic Image Segmentation
CVPR 2017
The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives
CVPR 2017
<
1
…
26
27
28
29
30
>