← Learning Types

Machine Learning › Learning Types ›

Multi-Modal Learning

1213 directly classified papers

Papers per year

Papers

Multi-view Collaborative Gaussian Process Dynamical Systems JMLR 2023

Loan Fraud Users Detection in Online Lending Leveraging Multiple Data Views AAAI 2023

SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval AISTATS 2023

Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data AISTATS 2023

Stochastic Gradient Descent-Ascent: Unified Theory and New Efficient Methods AISTATS 2023

Where Will Players Move Next? Dynamic Graphs and Hierarchical Fusion for Movement Forecasting in Badminton AAAI 2023

Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection AAAI 2023

MRCN: A Novel Modality Restitution and Compensation Network for Visible-Infrared Person Re-identification AAAI 2023

Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval AAAI 2023

COLA: Improving Conversational Recommender Systems by Collaborative Augmentation AAAI 2023

Video-Text Pre-training with Learned Regions for Retrieval AAAI 2023

Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR AAAI 2023

Multi-View Action Recognition Using Contrastive Learning WACV 2023

Pretraining Language Models with Text-Attributed Heterogeneous Graphs EMNLP 2023

MAFiD: Moving Average Equipped Fusion-in-Decoder for Question Answering over Tabular and Textual Data EACL 2023

Concept-based Persona Expansion for Improving Diversity of Persona-Grounded Dialogue EACL 2023

Improving Cross-modal Alignment for Text-Guided Image Inpainting EACL 2023

Multimodal Event Transformer for Image-guided Story Ending Generation EACL 2023

Cross-Modal Semantic Enhanced Interaction for Image-Sentence Retrieval WACV 2023

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation AAAI 2023

CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets AAAI 2023

Just Noticeable Visual Redundancy Forecasting: A Deep Multimodal-Driven Approach AAAI 2023

What to Fuse and How to Fuse: Exploring Emotion and Personality Fusion Strategies for Explainable Mental Disorder Detection ACL 2023

ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction EMNLP 2023

Discriminative Class Tokens for Text-to-Image Diffusion Models ICCV 2023