Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Mind Your Special Tokens! On the Importance of Dedicated Sequence-End Tokens in Vision-Language Embedding Models
EACL 2026
Colorism in Multimodal AI: An Empirical Exploration of Socioeconomic Linguistic Bias in Text-to-Image Generation
EACL 2026
From Paper to Structured JSON: An Agentic AI Workflow for Compliant BMR Digital Transformation
EACL 2026
MobileCity: An Efficient Framework for Large-Scale Urban Behavior Simulation
EACL 2026
PatentVision: A multimodal method for drafting patent applications
EACL 2026
MemeWeaver: Inter-Meme Graph Reasoning for Sexism and Misogyny Detection
EACL 2026
Do GUI Grounders Truly Understand UI Elements?
EACL 2026
TABED: Test-Time Adaptive Ensemble Drafting for Robust Speculative Decoding in LVLMs
EACL 2026
VIGiA: Instructional Video Guidance via Dialogue Reasoning and Retrieval
EACL 2026
BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
EACL 2026
Crafting Adversarial Inputs for Large Vision-Language Models Using Black-Box Optimization
EACL 2026
HACS-TL: Cross-Script Transfer Learning for Hausa Ajami Hate Speech Detection Using Transformer-Based Architecture
EACL 2026
Arabic-Adapted One-Step Speech-to-Diacritized ASR: Evaluation and Error Analysis
EACL 2026
Data-Centric Approach at the LoResMT 2026 Turkic Translation Challenge: Russian-Kyrgyz
EACL 2026
PolyFrame at MWE-2026 AdMIRe 2: When Words Are Not Enough: Multimodal Idiom Disambiguation
EACL 2026
VisAffect at MWE-2026 AdMIRe 2: IMMCAN Idiom Multimodal Cross-Attention Network
EACL 2026
ITUNLP2 at MWE-2026 AdMIRe 2: Modular Zero-Shot Pipelines for Multimodal Idiom Grounding and Ranking
EACL 2026
Towards Robust Evaluation of Visual Activity Recognition: Resolving Verb Ambiguity with Sense Clustering
EACL 2026
RTMol: Rethinking Molecule-text Alignment in a Round-trip View
AAAI 2026
3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale
AAAI 2026
S²Drug: Bridging Protein Sequence and 3D Structure in Contrastive Representation Learning for Virtual Screening
AAAI 2026
PDE-Driven Spatiotemporal Generative Modeling for Multilead ECG Synthesis
AAAI 2026
Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass Spectra
AAAI 2026
SyncBrain: Exploring Brain Functional Dynamics Through Neural Oscillatory Synchronization
AAAI 2026
Game Ground Bench: Probing the Limits of LVLMs in Complex Semantic Grounding Across Game Universes
AAAI 2026
<
1
…
17
18
19
…
523
>