conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning
ACL 2020
Cross-Modality Relevance for Reasoning on Language and Vision
ACL 2020
A negative case analysis of visual grounding methods for VQA
ACL 2020
History for Visual Dialog: Do we really need it?
ACL 2020
A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization
ACL 2020
GAIA: A Fine-grained Multimedia Knowledge Extraction System
ACL 2020
Adaptive Transformers for Learning Multimodal Representations
ACL 2020
Non-Topical Coherence in Social Talk: A Call for Dialogue Model Enrichment
ACL 2020
Achieving Common Ground in Multi-modal Dialogue
ACL 2020
Extending ImageNet to Arabic using Arabic WordNet
ACL 2020
On the role of effective and referring questions in GuessWhat?!
ACL 2020
Latent Alignment of Procedural Concepts in Multimodal Recipes
ACL 2020
Dynamic Sentence Boundary Detection for Simultaneous Translation
ACL 2020
BIT’s system for the AutoSimTrans 2020
ACL 2020
Towards Visual Dialog for Radiology
ACL 2020
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
ACL 2020
Multilogue-Net: A Context-Aware RNN for Multi-modal Emotion Detection and Sentiment Analysis in Conversation
ACL 2020
Low Rank Fusion based Transformers for Multimodal Sequences
ACL 2020
Leveraging Multimodal Behavioral Analytics for Automated Job Interview Performance Assessment and Feedback
ACL 2020
Audio-Visual Understanding of Passenger Intents for In-Cabin Conversational Agents
ACL 2020
AI Sensing for Robotics using Deep Learning based Visual and Language Modeling
ACL 2020
Sky + Fire = Sunset. Exploring Parallels between Visually Grounded Metaphors and Image Classifiers
ACL 2020
Statistical Deep Parsing for Spanish Using Neural Networks
ACL 2020
ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
ACL 2020
DiDi Labs’ End-to-end System for the IWSLT 2020 Offline Speech TranslationTask
ACL 2020
<
1
…
448
449
450
…
523
>