conftrace_

Artificial Intelligence › Core AI ›

Multimodal Learning

13,057 papers

Papers per year

Papers

MemeDetoxNet: Balancing Toxicity Reduction and Context Preservation ACL 2025

From Perception to Reasoning: Enhancing Vision-Language Models for Mobile UI Understanding ACL 2025

MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration ACL 2025

Can Hallucination Correction Improve Video-Language Alignment? ACL 2025

Blinded by Context: Unveiling the Halo Effect of MLLM in AI Hiring ACL 2025

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation ACL 2025

M2-TabFact: Multi-Document Multi-Modal Fact Verification with Visual and Textual Representations of Tabular Data ACL 2025

CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization ACL 2025

NBDESCRIB: A Dataset for Text Description Generation from Tables and Code in Jupyter Notebooks with Guidelines ACL 2025

EXPERT: An Explainable Image Captioning Evaluation Metric with Structured Explanations ACL 2025

Can Vision Language Models Understand Mimed Actions? ACL 2025

MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models ACL 2025

Challenging Multimodal LLMs with African Standardized Exams: A Document VQA Evaluation ACL 2025

Modeling Background Knowledge with Frame Semantics for Fine-grained Sentiment Classification ACL 2025

Simulating Emotional Intelligence in LLMs through Behavioral Conditioning and Analogical Retrieval ACL 2025

Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies ACL 2025

Overview of MM-ArgFallacy2025 on Multimodal Argumentative Fallacy Detection and Classification in Political Debates ACL 2025

Argumentative Fallacy Detection in Political Debates ACL 2025

Multimodal Argumentative Fallacy Classification in Political Debates ACL 2025

Prompt-Guided Augmentation and Multi-modal Fusion for Argumentative Fallacy Classification in Political Debates ACL 2025

Leveraging Context for Multimodal Fallacy Classification in Political Debates ACL 2025

MateInfoUB: A Real-World Benchmark for Testing LLMs in Competitive, Multilingual, and Multimodal Educational Tasks ACL 2025

Challenges for AI in Multimodal STEM Assessments: a Human-AI Comparison ACL 2025

Enhancing Stress Detection on Social Media Through Multi-Modal Fusion of Text and Synthesized Visuals ACL 2025

UniBuc-SB at ArchEHR-QA 2025: A Resource-Constrained Pipeline for Relevance Classification and Grounded Answer Synthesis ACL 2025