Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Context Dependent Semantic Parsing over Temporally Structured Data
NAACL 2019
Multi-Resolution Weak Supervision for Sequential Data
NIPS 2019
Can Character Embeddings Improve Cause-of-Death Classification for Verbal Autopsy Narratives?
ACL 2019
Multiple Admissibility: Judging Grammaticality using Unlabeled Data in Language Learning
ACL 2019
TIGEr: Text-to-Image Grounding for Image Caption Evaluation
EMNLP 2019
Fusion of Detected Objects in Text for Visual Question Answering
EMNLP 2019
UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
EMNLP 2019
HorizonNet: Learning Room Layout With 1D Representation and Pano Stretch Data Augmentation
CVPR 2019
Structure-Preserving Stereoscopic View Synthesis With Multi-Scale Adversarial Correlation Matching
CVPR 2019
Multi-Target Embodied Question Answering
CVPR 2019
Cycle-Consistency for Robust Visual Question Answering
CVPR 2019
GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering
CVPR 2019
Facial Emotion Distribution Learning by Exploiting Low-Rank Label Correlations Locally
CVPR 2019
Unsupervised Multi-Modal Neural Machine Translation
CVPR 2019
Multi-Task Learning of Hierarchical Vision-Language Representation
CVPR 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
EMNLP 2019
Robust Navigation with Language Pretraining and Stochastic Sampling
EMNLP 2019
Grounding learning of modifier dynamics: An application to color naming
EMNLP 2019
DEBUG: A Dense Bottom-Up Grounding Approach for Natural Language Video Localization
EMNLP 2019
Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention
CVPR 2019
RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion
CVPR 2019
HITSZ-ICRC: A Report for SMM4H Shared Task 2019-Automatic Classification and Extraction of Adverse Effect Mentions in Tweets
ACL 2019
Exploring Deep Multimodal Fusion of Text and Photo for Hate Speech Classification
ACL 2019
Chasing Ghosts: Instruction Following as Bayesian State Tracking
NIPS 2019
Cross Attention Network for Few-shot Classification
NIPS 2019
<
1
…
475
476
477
…
523
>