conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Visual Cues and Error Correction for Translation Robustness
EMNLP 2021
An animated picture says at least a thousand words: Selecting Gif-based Replies in Multimodal Dialog
EMNLP 2021
SciCap: Generating Captions for Scientific Figures
EMNLP 2021
Coreference-aware Surprisal Predicts Brain Response
EMNLP 2021
COSMic: A Coherence-Aware Generation Metric for Image Descriptions
EMNLP 2021
MURAL: Multimodal, Multitask Representations Across Languages
EMNLP 2021
Exploring a Unified Sequence-To-Sequence Transformer for Medical Product Safety Monitoring in Social Media
EMNLP 2021
MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding
EMNLP 2021
Task-Oriented Clustering for Dialogues
EMNLP 2021
Does Vision-and-Language Pretraining Improve Lexical Grounding?
EMNLP 2021
QACE: Asking Questions to Evaluate an Image Caption
EMNLP 2021
Image Retrieval for Arguments Using Stance-Aware Query Expansion
EMNLP 2021
Transferring Knowledge from Vision to Language: How to Achieve it and how to Measure it?
EMNLP 2021
Investigating Negation in Pre-trained Vision-and-language Models
EMNLP 2021
The CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue
EMNLP 2021
Neural Anaphora Resolution in Dialogue
EMNLP 2021
Anaphora Resolution in Dialogue: Description of the DFKI-TalkingRobots System for the CODI-CRAC 2021 Shared-Task
EMNLP 2021
The Pipeline Model for Resolution of Anaphoric Reference and Resolution of Entity Reference
EMNLP 2021
Dependency Induction Through the Lens of Visual Perception
EMNLP 2021
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering
EMNLP 2021
Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color
EMNLP 2021
Empathetic Dialog Generation with Fine-Grained Intents
EMNLP 2021
Enriching Language Models with Visually-grounded Word Vectors and the Lancaster Sensorimotor Norms
EMNLP 2021
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via Multi-Task Training
EMNLP 2021
Does language help generalization in vision models?
EMNLP 2021
<
1
…
421
422
423
…
523
>