Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Contextual Inter-modal Attention for Multi-modal Sentiment Analysis
EMNLP 2018
CUNI System for the WMT18 Multimodal Translation Task
EMNLP 2018
The First Multilingual Surface Realisation Shared Task (SR’18): Overview and Evaluation Results
ACL 2018
Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot Learning
NIPS 2018
Graphical Generative Adversarial Networks
NIPS 2018
EmotionX-DLC: Self-Attentive BiLSTM for Detecting Sequential Emotions in Dialogues
ACL 2018
Eyes are the Windows to the Soul: Predicting the Rating of Text Quality Using Gaze Behaviour
ACL 2018
A Neural Architecture for Automated ICD Coding
ACL 2018
Latent Alignment and Variational Attention
NIPS 2018
Self-Supervised Generation of Spatial Audio for 360° Video
NIPS 2018
Multimodal Generative Models for Scalable Weakly-Supervised Learning
NIPS 2018
Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis
NIPS 2018
Attention in Convolutional LSTM for Gesture Recognition
NIPS 2018
Active Matting
NIPS 2018
Dialog-based Interactive Image Retrieval
NIPS 2018
DeepExposure: Learning to Expose Photos with Asynchronously Reinforced Adversarial Learning
NIPS 2018
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
NIPS 2018
Chain of Reasoning for Visual Question Answering
NIPS 2018
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
NIPS 2018
Recognizing Emotions in Video Using Multimodal DNN Feature Fusion
ACL 2018
Enhancing Drug-Drug Interaction Extraction from Texts by Molecular Structure Information
ACL 2018
Affordances in Grounded Language Learning
ACL 2018
ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information
CVPR 2018
Grounding Referring Expressions in Images by Variational Context
CVPR 2018
Learning Answer Embeddings for Visual Question Answering
CVPR 2018
<
1
…
500
501
502
…
523
>