Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model
NIPS 2017
MalwareTextDB: A Database for Annotated Malware Articles
ACL 2017
Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images
CVPR 2017
Incorporating Global Visual Features into Attention-based Neural Machine Translation.
EMNLP 2017
Transformation-Grounded Image Generation Network for Novel 3D View Synthesis
CVPR 2017
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
CVPR 2017
Predictive-Corrective Networks for Action Detection
CVPR 2017
Asynchronous Temporal Fields for Action Recognition
CVPR 2017
Visual Dialog
CVPR 2017
Emotion Recognition in Context
CVPR 2017
Hierarchical Multimodal Metric Learning for Multimodal Classification
CVPR 2017
Learning Neural Representations of Human Cognition across Many fMRI Studies
NIPS 2017
Multichannel End-to-end Speech Recognition
ICML 2017
Towards a Universal Sentiment Classifier in Multiple languages
EMNLP 2017
Multimodal Learning and Reasoning for Visual Question Answering
NIPS 2017
Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols
NIPS 2017
Teaching Machines to Describe Images with Natural Language Feedback
NIPS 2017
Cold-Start Reinforcement Learning with Softmax Policy Gradient
NIPS 2017
Pixels to Graphs by Associative Embedding
NIPS 2017
Coupling Distributed and Symbolic Execution for Natural Language Queries
ICML 2017
Targeting EEG/LFP Synchrony with Neural Nets
NIPS 2017
Reconstructing perceived faces from brain activations with deep adversarial neural decoding
NIPS 2017
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts
NIPS 2017
Multimodal Machine Learning: Integrating Language, Vision and Speech
ACL 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
CVPR 2017
<
1
…
512
513
514
…
523
>