conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
MemeDetoxNet: Balancing Toxicity Reduction and Context Preservation
ACL 2025
From Perception to Reasoning: Enhancing Vision-Language Models for Mobile UI Understanding
ACL 2025
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration
ACL 2025
Can Hallucination Correction Improve Video-Language Alignment?
ACL 2025
Blinded by Context: Unveiling the Halo Effect of MLLM in AI Hiring
ACL 2025
Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation
ACL 2025
M2-TabFact: Multi-Document Multi-Modal Fact Verification with Visual and Textual Representations of Tabular Data
ACL 2025
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
ACL 2025
NBDESCRIB: A Dataset for Text Description Generation from Tables and Code in Jupyter Notebooks with Guidelines
ACL 2025
EXPERT: An Explainable Image Captioning Evaluation Metric with Structured Explanations
ACL 2025
Can Vision Language Models Understand Mimed Actions?
ACL 2025
MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
ACL 2025
Challenging Multimodal LLMs with African Standardized Exams: A Document VQA Evaluation
ACL 2025
Modeling Background Knowledge with Frame Semantics for Fine-grained Sentiment Classification
ACL 2025
Simulating Emotional Intelligence in LLMs through Behavioral Conditioning and Analogical Retrieval
ACL 2025
Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies
ACL 2025
Overview of MM-ArgFallacy2025 on Multimodal Argumentative Fallacy Detection and Classification in Political Debates
ACL 2025
Argumentative Fallacy Detection in Political Debates
ACL 2025
Multimodal Argumentative Fallacy Classification in Political Debates
ACL 2025
Prompt-Guided Augmentation and Multi-modal Fusion for Argumentative Fallacy Classification in Political Debates
ACL 2025
Leveraging Context for Multimodal Fallacy Classification in Political Debates
ACL 2025
MateInfoUB: A Real-World Benchmark for Testing LLMs in Competitive, Multilingual, and Multimodal Educational Tasks
ACL 2025
Challenges for AI in Multimodal STEM Assessments: a Human-AI Comparison
ACL 2025
Enhancing Stress Detection on Social Media Through Multi-Modal Fusion of Text and Synthesized Visuals
ACL 2025
UniBuc-SB at ArchEHR-QA 2025: A Resource-Constrained Pipeline for Relevance Classification and Grounded Answer Synthesis
ACL 2025
<
1
…
81
82
83
…
523
>