← Learning Types

Machine Learning › Learning Types ›

Multi-Modal Learning

1213 directly classified papers

Papers per year

Papers

Masked Audio Text Encoders are Effective Multi-Modal Rescorers ACL 2023

Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach INTERSPEECH 2023

Pay Attention to Implicit Attribute Values: A Multi-modal Generative Framework for AVE Task ACL 2023

With Prejudice to None: A Few-Shot, Multilingual Transfer Learning Approach to Detect Social Bias in Low Resource Languages ACL 2023

MultiQG-TI: Towards Question Generation from Multi-modal Sources ACL 2023

MarsEclipse at SemEval-2023 Task 3: Multi-lingual and Multi-label Framing Detection with Contrastive Learning ACL 2023

PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion ACL 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition ACL 2023

Unifying Vision-Language Representation Space with Single-Tower Transformer AAAI 2023

Modeling Entities As Semantic Points for Visual Information Extraction in the Wild CVPR 2023

An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity CVPR 2023

Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training CVPR 2023

PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion SEMEVAL 2023

Best of Both Worlds: Multimodal Contrastive Learning With Tabular and Imaging Data CVPR 2023

A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension AAAI 2023

Well Begun is Half Done: Generator-agnostic Knowledge Pre-Selection for Knowledge-Grounded Dialogue EMNLP 2023

EDIS: Entity-Driven Image Search over Multimodal Web Content EMNLP 2023

Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks EMNLP 2023

Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback EMNLP 2023

Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer EMNLP 2023

Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimoda Emotion Recognition EMNLP 2023

Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines EMNLP 2023

Deep Metric Learning to Hierarchically Rank - An Application in Product Retrieval EMNLP 2023

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization EMNLP 2023

MaXM: Towards Multilingual Visual Question Answering EMNLP 2023