Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding
ACL 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
ACL 2021
Multi-View Cross-Lingual Structured Prediction with Minimum Supervision
ACL 2021
Competence-based Multimodal Curriculum Learning for Medical Report Generation
ACL 2021
MultiMET: A Multimodal Dataset for Metaphor Understanding
ACL 2021
Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction
ACL 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
ACL 2021
CogAlign: Learning to Align Textual Neural Representations to Cognitive Language Processing Signals
ACL 2021
Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques
ACL 2021
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words
ACL 2021
Detecting Propaganda Techniques in Memes
ACL 2021
Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural Baselines
ACL 2021
More than Text: Multi-modal Chinese Word Segmentation
ACL 2021
X-Fact: A New Benchmark Dataset for Multilingual Fact Checking
ACL 2021
Towards Visual Question Answering on Pathology Images
ACL 2021
Video-guided Machine Translation with Spatial Hierarchical Attention Network
ACL 2021
“I’ve Seen Things You People Wouldn’t Believe”: Hallucinating Entities in GuessWhat?!
ACL 2021
CRSLab: An Open-Source Toolkit for Building Conversational Recommender System
ACL 2021
Stretch-VST: Getting Flexible With Visual Stories
ACL 2021
Recognizing Multimodal Entailment
ACL 2021
Textual Representations for Crosslingual Information Retrieval
ACL 2021
Personalized Response Generation with Tensor Factorization
ACL 2021
SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and Images
ACL 2021
YNU-HPCC at SemEval-2021 Task 6: Combining ALBERT and Text-CNN for Persuasion Detection in Texts and Images
ACL 2021
LT3 at SemEval-2021 Task 6: Using Multi-Modal Compact Bilinear Pooling to Combine Visual and Textual Understanding in Memes
ACL 2021
<
1
…
98
99
100
…
128
>