Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
DevBench: A multimodal developmental benchmark for language learning
NIPS 2024
Unity by Diversity: Improved Representation Learning for Multimodal VAEs
NIPS 2024
READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling
AAAI 2024
Adaptive Graph Learning for Multimodal Conversational Emotion Detection
AAAI 2024
Video Event Extraction with Multi-View Interaction Knowledge Distillation
AAAI 2024
Efficient Large Multi-modal Models via Visual Context Compression
NIPS 2024
Physical Consistency Bridges Heterogeneous Data in Molecular Multi-Task Learning
NIPS 2024
Automated Defect Report Generation for Enhanced Industrial Quality Control
AAAI 2024
Visual Hallucination Elevates Speech Recognition
AAAI 2024
SheffieldVeraAI at SemEval-2024 Task 4: Prompting and fine-tuning a Large Vision-Language Model for Binary Classification of Persuasion Techniques in Memes
SEMEVAL 2024
Early Detection of Extreme Storm Tide Events Using Multimodal Data Processing
AAAI 2024
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
AAAI 2024
AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System
AAAI 2024
All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation
NIPS 2024
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
NIPS 2024
Deep Correlated Prompting for Visual Recognition with Missing Modalities
NIPS 2024
JoLT: Jointly Learned Representations of Language and Time-Series for Clinical Time-Series Interpretation (Student Abstract)
AAAI 2024
HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data
NIPS 2024
THGFormer: Time-Aware Hypergraph Learning for Multimodal Social Media Popularity Prediction (Student Abstract)
AAAI 2024
Multimodal Ensembling for Zero-Shot Image Classification
AAAI 2024
Virtual Try-On: Real-Time Interactive Hybrid Network with High-Fidelity
AAAI 2024
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
NIPS 2024
ReMI: A Dataset for Reasoning with Multiple Images
NIPS 2024
Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space Model
NIPS 2024
MmCows: A Multimodal Dataset for Dairy Cattle Monitoring
NIPS 2024
<
1
…
57
58
59
…
128
>