Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Listen to the Image
CVPR 2019
Controllable Text Simplification with Lexical Constraint Loss
ACL 2019
Multimodal Abstractive Summarization for How2 Videos
ACL 2019
Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-ray Reports
ACL 2019
Towards Comprehensive Description Generation from Factual Attribute-value Tables
ACL 2019
Faithful Multimodal Explanation for Visual Question Answering
ACL 2019
Model-Blind Video Denoising via Frame-To-Frame Training
CVPR 2019
Learning to Explain With Complemental Examples
CVPR 2019
Intention Oriented Image Captions With Guiding Objects
CVPR 2019
Visual Query Answering by Entity-Attribute Graph Matching and Reasoning
CVPR 2019
Recurrent Neural Networks With Intra-Frame Iterations for Video Deblurring
CVPR 2019
Speech2Face: Learning the Face Behind a Voice
CVPR 2019
Deep Video Inpainting
CVPR 2019
Detailed Human Shape Estimation From a Single Image by Hierarchical Mesh Deformation
CVPR 2019
Self-Supervised Spatio-Temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
CVPR 2019
Large-Scale Long-Tailed Recognition in an Open World
CVPR 2019
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition With Multimodal Training
CVPR 2019
Pushing the Boundaries of View Extrapolation With Multiplane Images
CVPR 2019
What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues
ACL 2019
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
ACL 2019
Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers
ACL 2019
Expressing Visual Relationships via Language
ACL 2019
Multilingual Unsupervised NMT using Shared Encoder and Language-Specific Decoders
ACL 2019
Dense Procedure Captioning in Narrated Instructional Videos
ACL 2019
Symbolic Inductive Bias for Visually Grounded Learning of Spoken Language
ACL 2019
<
1
…
478
479
480
…
523
>