conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Sattiy at SemEval-2021 Task 9: An Ensemble Solution for Statement Verification and Evidence Finding with Tables
ACL 2021
CLUZH at SIGMORPHON 2021 Shared Task on Multilingual Grapheme-to-Phoneme Conversion: Variations on a Baseline
ACL 2021
Improved pronunciation prediction accuracy using morphology
ACL 2021
Visually Grounded Follow-up Questions: a Dataset of Spatial Questions Which Require Dialogue History
ACL 2021
Towards Navigation by Reasoning over Spatial Configurations
ACL 2021
Improvements and Extensions on Metaphor Detection
ACL 2021
Rakuten’s Participation in WAT 2021: Examining the Effectiveness of Pre-trained Models for Multilingual and Multimodal Machine Translation
ACL 2021
Improved English to Hindi Multimodal Neural Machine Translation
ACL 2021
IITP at WAT 2021: System description for English-Hindi Multimodal Translation Task
ACL 2021
ViTA: Visual-Linguistic Translation by Aligning Object Tags
ACL 2021
TMEKU System for the WAT2021 Multimodal Translation Task
ACL 2021
Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset
ACL 2021
VL-BERT+: Detecting Protected Groups in Hateful Multimodal Memes
ACL 2021
Racist or Sexist Meme? Classifying Memes beyond Hateful
ACL 2021
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes
ACL 2021
Neural Graph Filtering for Context-aware Recommendation
ACML 2021
Bridging Code-Text Representation Gap using Explanation
ACML 2021
Relation Also Need Attention: Integrating Relation Information Into Image Captioning
ACML 2021
Neural Function Modules with Sparse Arguments: A Dynamic Approach to Integrating Information across Layers
AISTATS 2021
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering
CONLL 2021
Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color
CONLL 2021
Empathetic Dialog Generation with Fine-Grained Intents
CONLL 2021
Enriching Language Models with Visually-grounded Word Vectors and the Lancaster Sensorimotor Norms
CONLL 2021
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via Multi-Task Training
CONLL 2021
Does language help generalization in vision models?
CONLL 2021
<
1
…
411
412
413
…
523
>