conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Improving Multilingual Sign Language Translation with Automatically Clustered Language Family Information
COLING 2025
Investigating the Impact of Incremental Processing and Voice Activity Projection on Spoken Dialogue Systems
COLING 2025
HyperHatePrompt: A Hypergraph-based Prompting Fusion Model for Multimodal Hate Detection
COLING 2025
StoryLLaVA: Enhancing Visual Storytelling with Multi-Modal Large Language Models
COLING 2025
Unified Grid Tagging Scheme for Aspect Sentiment Quad Prediction
COLING 2025
ACE-M3: Automatic Capability Evaluator for Multimodal Medical Models
COLING 2025
A Dual Contrastive Learning Framework for Enhanced Multimodal Conversational Emotion Recognition
COLING 2025
KIA: Knowledge-Guided Implicit Vision-Language Alignment for Chest X-Ray Report Generation
COLING 2025
On the Human-level Performance of Visual Question Answering
COLING 2025
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
COLING 2025
MPID: A Modality-Preserving and Interaction-Driven Fusion Network for Multimodal Sentiment Analysis
COLING 2025
Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language Models
COLING 2025
Fusion meets Function: The Adaptive Selection-Generation Approach in Event Argument Extraction
COLING 2025
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
COLING 2025
Modal Feature Optimization Network with Prompt for Multimodal Sentiment Analysis
COLING 2025
Multimodal Fact-Checking with Vision Language Models: A Probing Classifier based Solution with Embedding Strategies
COLING 2025
Improving the Efficiency of Visually Augmented Language Models
COLING 2025
A Knowledge Graph Reasoning-Based Model for Computerized Adaptive Testing
COLING 2025
Multimodal Extraction and Recognition of Arabic Implicit Discourse Relations
COLING 2025
RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long Documents
COLING 2025
Persona-Consistent Dialogue Generation via Pseudo Preference Tuning
COLING 2025
Towards Cross-Lingual Audio Abuse Detection in Low-Resource Settings with Few-Shot Learning
COLING 2025
Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models
COLING 2025
CateEA: Enhancing Entity Alignment via Implicit Category Supervision
COLING 2025
VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation
COLING 2025
<
1
…
87
88
89
…
523
>