Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Negative Focus Detection via Contextual Attention Mechanism
EMNLP 2019
Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset
EMNLP 2019
Understanding the Effect of Textual Adversaries in Multimodal Machine Translation
EMNLP 2019
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation
EMNLP 2019
Modeling Graph Structure in Transformer for Better AMR-to-Text Generation
EMNLP 2019
What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues
EMNLP 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
EMNLP 2019
Neural News Recommendation with Heterogeneous User Behavior
EMNLP 2019
Jointly Learning to Align and Translate with Transformer Models
EMNLP 2019
Extracting Possessions from Social Media: Images Complement Language
EMNLP 2019
Hierarchy Response Learning for Neural Conversation Generation
EMNLP 2019
Partners in Crime: Multi-view Sequential Inference for Movie Understanding
EMNLP 2019
Aspect-Level Sentiment Analysis Via Convolution over Dependency Tree
EMNLP 2019
Assessing Post Deletion in Sina Weibo: Multi-modal Classification of Hot Topics
EMNLP 2019
Computational Analysis of the Historical Changes in Poetry and Prose
ACL 2019
A Turkish Dataset for Gender Identification of Twitter Users
ACL 2019
Inverse Cooking: Recipe Generation From Food Images
CVPR 2019
Deep Modular Co-Attention Networks for Visual Question Answering
CVPR 2019
Grounded Video Description
CVPR 2019
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
CVPR 2019
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering
CVPR 2019
Recursive Visual Attention in Visual Dialog
CVPR 2019
Text2Scene: Generating Compositional Scenes From Textual Descriptions
CVPR 2019
Audio Visual Scene-Aware Dialog
CVPR 2019
The Pros and Cons: Rank-Aware Temporal Attention for Skill Determination in Long Videos
CVPR 2019
<
1
…
477
478
479
…
523
>