conftrace_

Artificial Intelligence › Core AI ›

Multimodal Learning

13,057 papers

Papers per year

Papers

Beyond Words: Multilingual and Multimodal Red Teaming of MLLMs ACL 2025

MultiReflect: Multimodal Self-Reflective RAG-based Automated Fact-Checking ACL 2025

CollEX – A Multimodal Agentic RAG System Enabling Interactive Exploration of Scientific Collections ACL 2025

Cross-modal Clustering-based Retrieval for Scalable and Robust Image Captioning ACL 2025

Multimodal Retrieval-Augmented Generation: Unified Information Processing Across Text, Image, Table, and Video Modalities ACL 2025

Making LVLMs Look Twice: Contrastive Decoding with Contrast Images ACL 2025

MT2ST: Adaptive Multi-Task to Single-Task Learning ACL 2025

CliME: Evaluating Multimodal Climate Discourse on Social Media and the Climate Alignment Quotient (CAQ) ACL 2025

Adaptive Linguistic Prompting (ALP) Enhances Phishing Webpage Detection in Multimodal Large Language Models ACL 2025

Tapping into Social Media in Crisis: A Survey ACL 2025

Hidden Forms: A Dataset to Fill Masked Interfaces from Language Commands ACL 2025

A Conversational Agent Framework for Multimodal Knowledge Retrieval: A Case Study in FHWA InfoHighway Web Portal Queries ACL 2025

VisTRA: Visual Tool-use Reasoning Analyzer for Small Object Visual Question Answering ACL 2025

Inductive Learning on Heterogeneous Graphs Enhanced by LLMs for Software Mention Detection ACL 2025

SciVQA 2025: Overview of the First Scientific Visual Question Answering Shared Task ACL 2025

Visual Question Answering on Scientific Charts Using Fine-Tuned Vision-Language Models ACL 2025

Coling-UniA at SciVQA 2025: Few-Shot Example Retrieval and Confidence-Informed Ensembling for Multimodal Large Language Models ACL 2025

Instruction-tuned QwenChart for Chart Question Answering ACL 2025

Enhancing Scientific Visual Question Answering through Multimodal Reasoning and Ensemble Modeling ACL 2025

CTYUN-AI at SemEval-2025 Task 1: Learning to Rank for Idiomatic Expressions ACL 2025

JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models ACL 2025

YNU-HPCC at SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Using Multiple Prediction Headers ACL 2025

daalft at SemEval-2025 Task 1: Multi-step Zero-shot Multimodal Idiomaticity Ranking ACL 2025

UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation ACL 2025

YNU-HPCC at SemEval-2025 Task 2: Local Cache and Online Retrieval-Based method for Entity-Aware Machine Translation ACL 2025