Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Multi-view Collaborative Gaussian Process Dynamical Systems
JMLR 2023
Loan Fraud Users Detection in Online Lending Leveraging Multiple Data Views
AAAI 2023
SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval
AISTATS 2023
Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data
AISTATS 2023
Stochastic Gradient Descent-Ascent: Unified Theory and New Efficient Methods
AISTATS 2023
Where Will Players Move Next? Dynamic Graphs and Hierarchical Fusion for Movement Forecasting in Badminton
AAAI 2023
Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection
AAAI 2023
MRCN: A Novel Modality Restitution and Compensation Network for Visible-Infrared Person Re-identification
AAAI 2023
Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval
AAAI 2023
COLA: Improving Conversational Recommender Systems by Collaborative Augmentation
AAAI 2023
Video-Text Pre-training with Learned Regions for Retrieval
AAAI 2023
Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR
AAAI 2023
Multi-View Action Recognition Using Contrastive Learning
WACV 2023
Pretraining Language Models with Text-Attributed Heterogeneous Graphs
EMNLP 2023
MAFiD: Moving Average Equipped Fusion-in-Decoder for Question Answering over Tabular and Textual Data
EACL 2023
Concept-based Persona Expansion for Improving Diversity of Persona-Grounded Dialogue
EACL 2023
Improving Cross-modal Alignment for Text-Guided Image Inpainting
EACL 2023
Multimodal Event Transformer for Image-guided Story Ending Generation
EACL 2023
Cross-Modal Semantic Enhanced Interaction for Image-Sentence Retrieval
WACV 2023
Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
AAAI 2023
CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
AAAI 2023
Just Noticeable Visual Redundancy Forecasting: A Deep Multimodal-Driven Approach
AAAI 2023
What to Fuse and How to Fuse: Exploring Emotion and Personality Fusion Strategies for Explainable Mental Disorder Detection
ACL 2023
ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction
EMNLP 2023
Discriminative Class Tokens for Text-to-Image Diffusion Models
ICCV 2023
<
1
…
29
30
31
…
49
>