Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage
ACL 2020
Modality-Transferable Emotion Embeddings for Low-Resource Multimodal Emotion Recognition
AACL 2020
All-in-One: A Deep Attentive Multi-task Learning Framework for Humour, Sarcasm, Offensive, Motivation, and Sentiment on Memes
AACL 2020
TMU Japanese-English Multimodal Machine Translation System for WAT 2020
AACL 2020
Multimodal Neural Machine Translation for English to Hindi
AACL 2020
Learning to Correspond Dynamical Systems
L4DC 2020
Swoosh! Rattle! Thump! - Actions that Sound
RSS 2020
Learning Multimodal Representations for Unseen Activities
WACV 2020
ReStGAN: A step towards visually guided shopper experience via text-to-image synthesis
WACV 2020
CookGAN: Meal Image Synthesis from Ingredients
WACV 2020
Bridged Variational Autoencoders for Joint Modeling of Images and Attributes
WACV 2020
Multi-Modal Association based Grouping for Form Structure Extraction
WACV 2020
PGL at TextGraphs 2020 Shared Task: Explanation Regeneration using Language and Graph Learning Methods
COLING 2020
Robust Multi-View Representation Learning (Student Abstract)
AAAI 2020
SpotFake+: A Multimodal Framework for Fake News Detection via Transfer Learning (Student Abstract)
AAAI 2020
Unified Vision-Language Pre-Training for Image Captioning and VQA
AAAI 2020
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
AAAI 2020
Find Objects and Focus on Highlights: Mining Object Semantics for Video Highlight Detection via Graph Neural Networks
AAAI 2020
3D Crowd Counting via Multi-View Fusion with 3D Gaussian Kernels
AAAI 2020
Rethinking the Image Fusion: A Fast Unified Image Fusion Network based on Proportional Maintenance of Gradient and Intensity
AAAI 2020
PI-RCNN: An Efficient Multi-Sensor 3D Object Detector with Point-Based Attentive Cont-Conv Fusion Module
AAAI 2020
Convolutional Hierarchical Attention Network for Query-Focused Video Summarization
AAAI 2020
3D Single-Person Concurrent Activity Detection Using Stacked Relation Network
AAAI 2020
F³Net: Fusion, Feedback and Focus for Salient Object Detection
AAAI 2020
Adaptive Cross-Modal Embeddings for Image-Text Alignment
AAAI 2020
<
1
…
112
113
114
…
128
>