Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
CVPR 2019
Memory-Attended Recurrent Network for Video Captioning
CVPR 2019
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
CVPR 2019
MFAS: Multimodal Fusion Architecture Search
CVPR 2019
Cross-Modality Personalization for Retrieval
CVPR 2019
Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning
CVPR 2019
MSCap: Multi-Style Image Captioning With Unpaired Stylized Text
CVPR 2019
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
CVPR 2019
LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search
CVPR 2019
Learning Words by Drawing Images
CVPR 2019
Actively Seeking and Learning From Live Data
CVPR 2019
It's Not About the Journey; It's About the Destination: Following Soft Paths Under Question-Guidance for Visual Reasoning
CVPR 2019
End-To-End Multi-Task Learning With Attention
CVPR 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection
CVPR 2019
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication
ACL 2019
Detecting Concealed Information in Text and Speech
ACL 2019
An Interactive Multi-Task Learning Network for End-to-End Aspect-Based Sentiment Analysis
ACL 2019
Context-aware Interactive Attention for Multi-modal Sentiment and Emotion Analysis
EMNLP 2019
Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag
EMNLP 2019
Emotion-Cause Pair Extraction: A New Task to Emotion Analysis in Texts
ACL 2019
Fact-Checking Meets Fauxtography: Verifying Claims About Images
EMNLP 2019
Using Clinical Notes with Time Series Data for ICU Management
EMNLP 2019
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining
EMNLP 2019
Table-to-Text Generation with Effective Hierarchical Encoder on Three Dimensions (Row, Column and Time)
EMNLP 2019
Towards Knowledge-Based Recommender Dialog System
EMNLP 2019
<
1
…
476
477
478
…
523
>