Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
A Dataset and Benchmark for Large-Scale Multi-Modal Face Anti-Spoofing
CVPR 2019
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection
AAAI 2019
Semantic Proposal for Activity Localization in Videos via Sentence Query
AAAI 2019
Unsupervised Bilingual Lexicon Induction from Mono-Lingual Multimodal Data
AAAI 2019
HSME: Hypersphere Manifold Embedding for Visible Thermal Person Re-Identification
AAAI 2019
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
AAAI 2019
Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts
INTERSPEECH 2019
Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model
ACL 2019
Keep Meeting Summaries on Topic: Abstractive Multi-Modal Meeting Summarization
ACL 2019
Reasoning Visual Dialogs With Structural and Partial Observations
CVPR 2019
Recursive Visual Attention in Visual Dialog
CVPR 2019
Two Body Problem: Collaborative Visual Task Completion
CVPR 2019
Audio Visual Scene-Aware Dialog
CVPR 2019
Deep Multimodal Clustering for Unsupervised Audiovisual Learning
CVPR 2019
Cooperative Multimodal Approach to Depression Detection in Twitter
AAAI 2019
HireNet: A Hierarchical Attention Model for the Automatic Analysis of Asynchronous Video Job Interviews
AAAI 2019
An Efficient Approach to Informative Feature Extraction from Multimodal Data
AAAI 2019
Deep Robust Unsupervised Multi-Modal Network
AAAI 2019
DAN: Deep Attention Neural Network for News Recommendation
AAAI 2019
Exploring Human-Like Reading Strategy for Abstractive Text Summarization
AAAI 2019
Dance With Flow: Two-In-One Stream Action Detection
CVPR 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
CVPR 2019
Deep Supervised Cross-Modal Retrieval
CVPR 2019
Image-Question-Answer Synergistic Network for Visual Dialog
CVPR 2019
Inverse Cooking: Recipe Generation From Food Images
CVPR 2019
<
1
…
117
118
119
…
128
>