Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin
ACL 2024
MolTC: Towards Molecular Relational Modeling In Language Models
ACL 2024
ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language Tasks
CVPR 2024
UniHuman: A Unified Model For Editing Human Images in the Wild
CVPR 2024
PeVL: Pose-Enhanced Vision-Language Model for Fine-Grained Human Action Recognition
CVPR 2024
Unified Lexical Representation for Interpretable Visual-Language Alignment
NIPS 2024
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
ACL 2024
MasonTigers@LT-EDI-2024: An Ensemble Approach Towards Detecting Homophobia and Transphobia in Social Media Comments
EACL 2024
Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers
NIPS 2024
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation
AAAI 2024
Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment
AAAI 2024
Debiasing Multimodal Sarcasm Detection with Contrastive Learning
AAAI 2024
Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection
AAAI 2024
Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks
ACL 2024
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
AAAI 2024
Multi-Modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models
AAAI 2024
Beyond Grounding: Extracting Fine-Grained Event Hierarchies across Modalities
AAAI 2024
Reinforced Adaptive Knowledge Learning for Multimodal Fake News Detection
AAAI 2024
Multi-Modal Disordered Representation Learning Network for Description-Based Person Search
AAAI 2024
ND-MRM: Neuronal Diversity Inspired Multisensory Recognition Model
AAAI 2024
RedCore: Relative Advantage Aware Cross-Modal Representation Learning for Missing Modalities with Imbalanced Missing Rates
AAAI 2024
XKD: Cross-Modal Knowledge Distillation with Domain Alignment for Video Representation Learning
AAAI 2024
Probabilistic Conformal Distillation for Enhancing Missing Modality Robustness
NIPS 2024
Attention-Induced Embedding Imputation for Incomplete Multi-View Partial Multi-Label Classification
AAAI 2024
Noise-Aware Image Captioning with Progressively Exploring Mismatched Words
AAAI 2024
<
1
…
35
36
37
…
128
>