Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning
NIPS 2024
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling
NIPS 2024
M2PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning
EMNLP 2024
Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data
NIPS 2024
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
AAAI 2024
M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection
EMNLP 2024
On the Comparison between Multi-modal and Single-modal Contrastive Learning
NIPS 2024
Continual Learning in an Open and Dynamic World
AAAI 2024
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better
NIPS 2024
Fine-Grained Prediction of Reading Comprehension from Eye Movements
EMNLP 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
NIPS 2024
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis
NIPS 2024
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation
EMNLP 2024
Multi-Level Cross-Modal Alignment for Speech Relation Extraction
EMNLP 2024
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
NIPS 2024
Unity by Diversity: Improved Representation Learning for Multimodal VAEs
NIPS 2024
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections
EMNLP 2024
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
NIPS 2024
Cross-modal Representation Flattening for Multi-modal Domain Generalization
NIPS 2024
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection
EMNLP 2024
HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data
NIPS 2024
Facilitating Multimodal Classification via Dynamically Learning Modality Gap
NIPS 2024
MmCows: A Multimodal Dataset for Dairy Cattle Monitoring
NIPS 2024
Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval
NIPS 2024
Hierarchical Aligned Multimodal Learning for NER on Tweet Posts
AAAI 2024
<
1
…
18
19
20
…
49
>