Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Masked Audio Text Encoders are Effective Multi-Modal Rescorers
ACL 2023
Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach
INTERSPEECH 2023
Pay Attention to Implicit Attribute Values: A Multi-modal Generative Framework for AVE Task
ACL 2023
With Prejudice to None: A Few-Shot, Multilingual Transfer Learning Approach to Detect Social Bias in Low Resource Languages
ACL 2023
MultiQG-TI: Towards Question Generation from Multi-modal Sources
ACL 2023
MarsEclipse at SemEval-2023 Task 3: Multi-lingual and Multi-label Framing Detection with Contrastive Learning
ACL 2023
PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion
ACL 2023
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition
ACL 2023
Unifying Vision-Language Representation Space with Single-Tower Transformer
AAAI 2023
Modeling Entities As Semantic Points for Visual Information Extraction in the Wild
CVPR 2023
An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity
CVPR 2023
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
CVPR 2023
PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion
SEMEVAL 2023
Best of Both Worlds: Multimodal Contrastive Learning With Tabular and Imaging Data
CVPR 2023
A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension
AAAI 2023
Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue
EMNLP 2023
EDIS: Entity-Driven Image Search over Multimodal Web Content
EMNLP 2023
Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks
EMNLP 2023
Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback
EMNLP 2023
Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer
EMNLP 2023
Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimoda Emotion Recognition
EMNLP 2023
Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines
EMNLP 2023
Deep Metric Learning to Hierarchically Rank - An Application in Product Retrieval
EMNLP 2023
Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
EMNLP 2023
MaXM: Towards Multilingual Visual Question Answering
EMNLP 2023
<
1
…
27
28
29
…
49
>