Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
AAAI 2026
Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models
AAAI 2026
DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt
AAAI 2026
SatSolarCast: A Flexible Framework for Multimodal Solar Irradiance Forecasting via Memory-Alignment Learning
AAAI 2026
Building Instance Segmentation for Dense Urban Settlements
AAAI 2026
Multivariate Gaussian Representation Learning for Medical Action Evaluation
AAAI 2026
10 Open Challenges Steering the Future of Vision-Language-Action Models
AAAI 2026
Data-Efficient and Contact-Rich Manipulation Through Diffusion Augmentation and Vision-Language Models
AAAI 2026
From Representation to Reasoning: Toward General-Purpose Visual Intelligence
AAAI 2026
Talking Trails: LLM-Enhanced Spatiotemporal Trajectory Modeling for E-Bike Delivery Route Planning
AAAI 2026
ConstructAI: From Real-Time Safety Insight to Skill Growth in Deployed Construction AI Systems
AAAI 2026
Automated Unified Reasoning with Vision-Language Models for Multi-modal Burn Assessment
AAAI 2026
Speaker Anonymization for Children's Oral Reading Assessment
AAAI 2026
A Data-Centric Analysis of the Impact of Training Data Quality vs. Quantity on P300 Brain-Computer Interface Performance (Student Abstract)
AAAI 2026
Q-MoFusion: A Quantum Classifier for Masquito Species Classification (Student Abstract)
AAAI 2026
An Approach Towards Developing Relationally Intelligent Multimodal Framework for Stock Movement Prediction (Student Abstract)
AAAI 2026
Federated Cross-Modal Style-Aware Prompt Generation (Student Abstract)
AAAI 2026
Multimodal Digital Phenotyping for Early Prediction of Manic Episodes Through Keystroke Dynamics and Circadian Pattern Analysis
AAAI 2026
Building Interpretable, Trust-worthy Systems for Neural Signal Decoding
AAAI 2026
MulTiCast: A Multimodal Time Series Forecasting System
AAAI 2026
VitalDiagnosis: AI-Driven Ecosystem for 24/7 Vital Monitoring and Chronic Disease Management
AAAI 2026
Drifting Away from Truth: GenAI-Driven News Diversity Challenges LVLM-Based Misinformation Detection
AAAI 2026
anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding
AAAI 2026
Multimodal Table Understanding with Difficulty-aware Reinforcement Learning
AAAI 2026
OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination
AAAI 2026
<
1
…
33
34
35
…
523
>