conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Exploiting Image–Text Synergy for Contextual Image Captioning
EACL 2021
Classification of mental illnesses on social media using RoBERTa
EACL 2021
Universal Joy A Data Set and Results for Classifying Emotions Across Languages
EACL 2021
Lightweight Models for Multimodal Sequential Data
EACL 2021
The World of an Octopus: How Reporting Bias Influences a Language Model’s Perception of Color
EMNLP 2021
Predicting emergent linguistic compositions through time: Syntactic frame extension via multimodal chaining
EMNLP 2021
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
EMNLP 2021
Contextual Rephrase Detection for Reducing Friction in Dialogue Systems
EMNLP 2021
Reference-Centric Models for Grounded Collaborative Dialogue
EMNLP 2021
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization
EMNLP 2021
Visual Goal-Step Inference using wikiHow
EMNLP 2021
Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language Models
EMNLP 2021
Thinking Clearly, Talking Fast: Concept-Guided Non-Autoregressive Generation for Open-Domain Dialogue Systems
EMNLP 2021
CSAGN: Conversational Structure Aware Graph Network for Conversational Semantic Role Labeling
EMNLP 2021
Multimodal Phased Transformer for Sentiment Analysis
EMNLP 2021
Entity Relation Extraction as Dependency Parsing in Visually Rich Documents
EMNLP 2021
Beyond Text: Incorporating Metadata and Label Structure for Multi-Label Document Classification using Heterogeneous Graphs
EMNLP 2021
Leveraging Capsule Routing to Associate Knowledge with Medical Literature Hierarchically
EMNLP 2021
Automated Generation of Accurate & Fluent Medical X-ray Reports
EMNLP 2021
CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations
EMNLP 2021
Relation-aware Video Reading Comprehension for Temporal Language Grounding
EMNLP 2021
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization
EMNLP 2021
Joint Multi-modal Aspect-Sentiment Analysis with Auxiliary Cross-modal Relation Detection
EMNLP 2021
Region under Discussion for visual dialog
EMNLP 2021
WhyAct: Identifying Action Reasons in Lifestyle Vlogs
EMNLP 2021
<
1
…
418
419
420
…
523
>