Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
A Radical-Aware Attention-Based Model for Chinese Text Classification
AAAI 2019
Multi-Interactive Memory Network for Aspect Based Multimodal Sentiment Analysis
AAAI 2019
Neural Collective Graphical Models for Estimating Spatio-Temporal Population Flow from Aggregated Data
AAAI 2019
Knowledge Aware Semantic Concept Expansion for Image-Text Matching
IJCAI 2019
HorNet: A Hierarchical Offshoot Recurrent Network for Improving Person Re-ID via Image Captioning
IJCAI 2019
Mappa Mundi: An Interactive Artistic Mind Map Generator with Artificial Imagination
IJCAI 2019
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression
AAAI 2019
Face Photo-Sketch Synthesis via Knowledge Transfer
IJCAI 2019
Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis
IJCAI 2019
Exploiting Interaction Links for Node Classification with Deep Graph Neural Networks
IJCAI 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
IJCAI 2019
Swell-and-Shrink: Decomposing Image Captioning by Transformation and Summarization
IJCAI 2019
FACSIMILE: Fast and Accurate Scans From an Image in Less Than a Second
ICCV 2019
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
ICCV 2019
Spatiotemporal Feature Residual Propagation for Action Prediction
ICCV 2019
Tag2Pix: Line Art Colorization Using Text Tag With SECat and Changing Loss
ICCV 2019
Adapting BERT for Target-Oriented Multimodal Sentiment Classification
IJCAI 2019
Relation-Aware Graph Attention Network for Visual Question Answering
ICCV 2019
Joint Optimization for Cooperative Image Captioning
ICCV 2019
Language Features Matter: Effective Language Representations for Vision-Language Tasks
ICCV 2019
Dual Attention Matching for Audio-Visual Event Localization
ICCV 2019
Multi-Modality Latent Interaction Network for Visual Question Answering
ICCV 2019
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching
ICCV 2019
Phrase Localization Without Paired Training Examples
ICCV 2019
Robust Change Captioning
ICCV 2019
<
1
…
473
474
475
…
523
>