conftrace_

Artificial Intelligence › Core AI ›

Multimodal Learning

13,057 papers

Papers per year

Papers

Visual Cues and Error Correction for Translation Robustness EMNLP 2021

An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal Dialog EMNLP 2021

SciCap: Generating Captions for Scientific Figures EMNLP 2021

Coreference-aware Surprisal Predicts Brain Response EMNLP 2021

COSMic: A Coherence-Aware Generation Metric for Image Descriptions EMNLP 2021

MURAL: Multimodal, Multitask Representations Across Languages EMNLP 2021

Exploring a Unified Sequence-To-Sequence Transformer for Medical Product Safety Monitoring in Social Media EMNLP 2021

MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding EMNLP 2021

Task-Oriented Clustering for Dialogues EMNLP 2021

Does Vision-and-Language Pretraining Improve Lexical Grounding? EMNLP 2021

QACE: Asking Questions to Evaluate an Image Caption EMNLP 2021

Image Retrieval for Arguments Using Stance-Aware Query Expansion EMNLP 2021

Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it? EMNLP 2021

Investigating Negation in Pre-trained Vision-and-language Models EMNLP 2021

The CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue EMNLP 2021

Neural Anaphora Resolution in Dialogue EMNLP 2021

Anaphora Resolution in Dialogue: Description of the DFKI-TalkingRobots System for the CODI-CRAC 2021 Shared-Task EMNLP 2021

The Pipeline Model for Resolution of Anaphoric Reference and Resolution of Entity Reference EMNLP 2021

Dependency Induction Through the Lens of Visual Perception EMNLP 2021

VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering EMNLP 2021

Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color EMNLP 2021

Empathetic Dialog Generation with Fine-Grained Intents EMNLP 2021

Enriching Language Models with Visually-grounded Word Vectors and the Lancaster Sensorimotor Norms EMNLP 2021

Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via Multi-Task Training EMNLP 2021

Does language help generalization in vision models? EMNLP 2021