Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
Person Tube Retrieval via Language Description
AAAI 2020
Visual Relationship Detection with Low Rank Non-Negative Tensor Decomposition
AAAI 2020
Expressing Objects Just Like Words: Recurrent Visual Embedding for Image-Text Matching
AAAI 2020
Visual Agreement Regularized Training for Multi-Modal Machine Translation
AAAI 2020
Learning Relationships between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis
AAAI 2020
Modelling Form-Meaning Systematicity with Linguistic and Visual Features
AAAI 2020
Modality-Balanced Models for Visual Dialogue
AAAI 2020
Adaptive Adversarial Multi-task Representation Learning
ICML 2020
Knowledge-Enriched Visual Storytelling
AAAI 2020
Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
AAAI 2020
Deep Embedded Complementary and Interactive Information for Multi-View Classification
AAAI 2020
Learning Agent Communication under Limited Bandwidth by Message Pruning
AAAI 2020
Infrared-Visible Cross-Modal Person Re-Identification with an X Modality
AAAI 2020
Dynamic Instance Normalization for Arbitrary Style Transfer
AAAI 2020
EPOC: Efficient Perception via Optimal Communication
AAAI 2020
Cross-Modal Subspace Clustering via Deep Canonical Correlation Analysis
AAAI 2020
Structured and Sparse Annotations for Image Emotion Distribution Learning
AAAI 2019
Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization
ACL 2019
DSTC7 Task 1: Noetic End-to-End Response Selection
ACL 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
AAAI 2019
Learning Individual Styles of Conversational Gesture
CVPR 2019
Grounded Video Description
CVPR 2019
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering
CVPR 2019
Dual Encoding for Zero-Example Video Retrieval
CVPR 2019
Image-To-Image Translation via Group-Wise Deep Whitening-And-Coloring Transformation
CVPR 2019
<
1
…
114
115
116
…
128
>