← Learning Types

Deep Learning › Learning Types ›

Multi-Modal Learning

3194 directly classified papers

Papers per year

Papers

LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding ACL 2021

UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning ACL 2021

Multi-View Cross-Lingual Structured Prediction with Minimum Supervision ACL 2021

Competence-based Multimodal Curriculum Learning for Medical Report Generation ACL 2021

MultiMET: A Multimodal Dataset for Metaphor Understanding ACL 2021

Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction ACL 2021

TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance ACL 2021

CogAlign: Learning to Align Textual Neural Representations to Cognitive Language Processing Signals ACL 2021

Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques ACL 2021

VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words ACL 2021

Detecting Propaganda Techniques in Memes ACL 2021

Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural Baselines ACL 2021

More than Text: Multi-modal Chinese Word Segmentation ACL 2021

X-Fact: A New Benchmark Dataset for Multilingual Fact Checking ACL 2021

Towards Visual Question Answering on Pathology Images ACL 2021

Video-guided Machine Translation with Spatial Hierarchical Attention Network ACL 2021

“I’ve Seen Things You People Wouldn’t Believe”: Hallucinating Entities in GuessWhat?! ACL 2021

CRSLab: An Open-Source Toolkit for Building Conversational Recommender System ACL 2021

Stretch-VST: Getting Flexible With Visual Stories ACL 2021

Recognizing Multimodal Entailment ACL 2021

Textual Representations for Crosslingual Information Retrieval ACL 2021

Personalized Response Generation with Tensor Factorization ACL 2021

SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and Images ACL 2021

YNU-HPCC at SemEval-2021 Task 6: Combining ALBERT and Text-CNN for Persuasion Detection in Texts and Images ACL 2021

LT3 at SemEval-2021 Task 6: Using Multi-Modal Compact Bilinear Pooling to Combine Visual and Textual Understanding in Memes ACL 2021