Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Generation
Computer Vision
›
Generation
›
Image Captioning
781 directly classified papers
Papers per year
2003: 1
2008: 1
2011: 1
2012: 1
2013: 5
2014: 2
2015: 21
2016: 17
2017: 36
2018: 47
2019: 92
2020: 73
2021: 96
2022: 91
2023: 107
2024: 86
2025: 96
2026: 8
Papers
Automated Generation of Accurate & Fluent Medical X-ray Reports
EMNLP 2021
Retrieval, Analogy, and Composition: A framework for Compositional Generalization in Image Captioning
EMNLP 2021
SciCap: Generating Captions for Scientific Figures
EMNLP 2021
COSMic: A Coherence-Aware Generation Metric for Image Descriptions
EMNLP 2021
QACE: Asking Questions to Evaluate an Image Caption
EMNLP 2021
End-to-End Dense Video Captioning With Parallel Decoding
ICCV 2021
Reference and coreference in situated dialogue
NAACL 2021
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
NAACL 2021
Quality Estimation for Image Captions Based on Large-scale Human Evaluations
NAACL 2021
Joint Commonsense and Relation Reasoning for Image and Video Captioning
AAAI 2020
Overcoming Language Priors in VQA via Decomposed Linguistic Representations
AAAI 2020
Feature Difference Makes Sense: A medical image captioning model exploiting feature difference and tag information
ACL 2020
Improving Image Captioning with Better Use of Caption
ACL 2020
Recurrent Nested Model for Sequence Generation
AAAI 2020
Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning
EMNLP 2020
Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling
AAAI 2020
Are Scene Graphs Good Enough to Improve Image Captioning?
AACL 2020
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
AAAI 2020
What Makes A Good Story? Designing Composite Rewards for Visual Storytelling
AAAI 2020
Language-Driven Region Pointer Advancement for Controllable Image Captioning
COLING 2020
How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domain
COLING 2020
Geo-Aware Image Caption Generation
COLING 2020
Image Caption Generation for News Articles
COLING 2020
Ad Lingua: Text Classification Improves Symbolism Prediction in Image Advertisements
COLING 2020
Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption
AAAI 2020
<
1
…
19
20
21
…
32
>