conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Beyond Accuracy: A Consolidated Tool for Visual Question Answering Benchmarking
EMNLP 2021
MeetDot: Videoconferencing with Live Translation Captions
EMNLP 2021
iFacetSum: Coreference-based Interactive Faceted Summarization for Multi-Document Exploration
EMNLP 2021
DRIFT: A Toolkit for Diachronic Analysis of Scientific Literature
EMNLP 2021
A Web Scale Entity Extraction System
EMNLP 2021
Joint Multimedia Event Extraction from Video and Article
EMNLP 2021
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
EMNLP 2021
Visually Grounded Concept Composition
EMNLP 2021
Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer
EMNLP 2021
Exploring Sentence Community for Document-Level Event Extraction
EMNLP 2021
What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum Probability
EMNLP 2021
Calibrate your listeners! Robust communication-based training for pragmatic speakers
EMNLP 2021
Diversity and Consistency: Exploring Visual Question-Answer Pair Generation
EMNLP 2021
Learning to Ground Visual Objects for Visual Dialog
EMNLP 2021
Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Knowledge
EMNLP 2021
TWT: Table with Written Text for Controlled Data-to-Text Generation
EMNLP 2021
Which is Making the Contribution: Modulating Unimodal and Cross-modal Dynamics for Multimodal Sentiment Analysis
EMNLP 2021
Inconsistency Matters: A Knowledge-guided Dual-inconsistency Network for Multi-modal Rumor Detection
EMNLP 2021
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
EMNLP 2021
Retrieval, Analogy, and Composition: A framework for Compositional Generalization in Image Captioning
EMNLP 2021
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering
EMNLP 2021
REBEL: Relation Extraction By End-to-end Language generation
EMNLP 2021
Controlled Neural Sentence-Level Reframing of News Articles
EMNLP 2021
Progressive Transformer-Based Generation of Radiology Reports
EMNLP 2021
Data Efficient Masked Language Modeling for Vision and Language
EMNLP 2021
<
1
…
420
421
422
…
523
>