conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Beyond Words: Multilingual and Multimodal Red Teaming of MLLMs
ACL 2025
MultiReflect: Multimodal Self-Reflective RAG-based Automated Fact-Checking
ACL 2025
CollEX – A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections
ACL 2025
Cross-modal Clustering-based Retrieval for Scalable and Robust Image Captioning
ACL 2025
Multimodal Retrieval-Augmented Generation: Unified Information Processing Across Text, Image, Table, and Video Modalities
ACL 2025
Making LVLMs Look Twice: Contrastive Decoding with Contrast Images
ACL 2025
MT2ST: Adaptive Multi-Task to Single-Task Learning
ACL 2025
CliME: Evaluating Multimodal Climate Discourse on Social Media and the Climate Alignment Quotient (CAQ)
ACL 2025
Adaptive Linguistic Prompting (ALP) Enhances Phishing Webpage Detection in Multimodal Large Language Models
ACL 2025
Tapping into Social Media in Crisis: A Survey
ACL 2025
Hidden Forms: A Dataset to Fill Masked Interfaces from Language Commands
ACL 2025
A Conversational Agent Framework for Multimodal Knowledge Retrieval: A Case Study in FHWA InfoHighway Web Portal Queries
ACL 2025
VisTRA: Visual Tool-use Reasoning Analyzer for Small Object Visual Question Answering
ACL 2025
Inductive Learning on Heterogeneous Graphs Enhanced by LLMs for Software Mention Detection
ACL 2025
SciVQA 2025: Overview of the First Scientific Visual Question Answering Shared Task
ACL 2025
Visual Question Answering on Scientific Charts Using Fine-Tuned Vision-Language Models
ACL 2025
Coling-UniA at SciVQA 2025: Few-Shot Example Retrieval and Confidence-Informed Ensembling for Multimodal Large Language Models
ACL 2025
Instruction-tuned QwenChart for Chart Question Answering
ACL 2025
Enhancing Scientific Visual Question Answering through Multimodal Reasoning and Ensemble Modeling
ACL 2025
CTYUN-AI at SemEval-2025 Task 1: Learning to Rank for Idiomatic Expressions
ACL 2025
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
ACL 2025
YNU-HPCC at SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Using Multiple Prediction Headers
ACL 2025
daalft at SemEval-2025 Task 1: Multi-step Zero-shot Multimodal Idiomaticity Ranking
ACL 2025
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation
ACL 2025
YNU-HPCC at SemEval-2025 Task 2: Local Cache and Online Retrieval-Based method for Entity-Aware Machine Translation
ACL 2025
<
1
…
83
84
85
…
523
>