Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
DiG-In-GNN: Discriminative Feature Guided GNN-Based Fraud Detector against Inconsistencies in Multi-Relation Fraud Graph
AAAI 2024
Multimodal Large Language Models Make Text-to-Image Generative Models Align Better
NIPS 2024
Fine-Grained Prediction of Reading Comprehension from Eye Movements
EMNLP 2024
Detection-Based Intermediate Supervision for Visual Question Answering
AAAI 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
NIPS 2024
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis
NIPS 2024
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation
EMNLP 2024
Financial Forecasting from Textual and Tabular Time Series
EMNLP 2024
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
CVPR 2024
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
NIPS 2024
Unity by Diversity: Improved Representation Learning for Multimodal VAEs
NIPS 2024
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections
EMNLP 2024
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion
NIPS 2024
Cross-modal Representation Flattening for Multi-modal Domain Generalization
NIPS 2024
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection
EMNLP 2024
Multi-Level Cross-Modal Alignment for Speech Relation Extraction
EMNLP 2024
HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data
NIPS 2024
Facilitating Multimodal Classification via Dynamically Learning Modality Gap
NIPS 2024
PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment
EMNLP 2024
MmCows: A Multimodal Dataset for Dairy Cattle Monitoring
NIPS 2024
Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval
NIPS 2024
Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA
EMNLP 2024
Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments
EMNLP 2024
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
NIPS 2024
Exploiting Descriptive Completeness Prior for Cross Modal Hashing with Incomplete Labels
NIPS 2024
<
1
…
19
20
21
…
49
>