Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Deep Learning
›
Learning Types
›
Multi-Modal Learning
3194 directly classified papers
Papers per year
2003: 1
2010: 1
2011: 1
2013: 5
2014: 3
2015: 9
2016: 23
2017: 49
2018: 78
2019: 158
2020: 223
2021: 261
2022: 354
2023: 471
2024: 705
2025: 835
2026: 17
Papers
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer
AAAI 2024
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance
AAAI 2024
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
AAAI 2024
Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
AAAI 2024
SoftCLIP: Softer Cross-Modal Alignment Makes CLIP Stronger
AAAI 2024
Improving Audio-Visual Segmentation with Bidirectional Generation
AAAI 2024
Deep Correlated Prompting for Visual Recognition with Missing Modalities
NIPS 2024
Mixture of In-Context Experts Enhance LLMs' Long Context Awareness
NIPS 2024
Multimodal Ensembling for Zero-Shot Image Classification
AAAI 2024
THGFormer: Time-Aware Hypergraph Learning for Multimodal Social Media Popularity Prediction (Student Abstract)
AAAI 2024
Virtual Try-On: Real-Time Interactive Hybrid Network with High-Fidelity
AAAI 2024
LAMM: Label Alignment for Multi-Modal Prompt Learning
AAAI 2024
Prompting Multi-Modal Image Segmentation with Semantic Grouping
AAAI 2024
Learning Representations for Robust Human-Robot Interaction
AAAI 2024
AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System
AAAI 2024
Towards Holistic, Pragmatic and Multimodal Conversational Systems
AAAI 2024
Early Detection of Extreme Storm Tide Events Using Multimodal Data Processing
AAAI 2024
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
AAAI 2024
Tell Me What Is Good about This Property: Leveraging Reviews for Segment-Personalized Image Collection Summarization
AAAI 2024
Automated Defect Report Generation for Enhanced Industrial Quality Control
AAAI 2024
Visual Hallucination Elevates Speech Recognition
AAAI 2024
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
AAAI 2024
Bidirectional Contrastive Split Learning for Visual Question Answering
AAAI 2024
JoLT: Jointly Learned Representations of Language and Time-Series for Clinical Time-Series Interpretation (Student Abstract)
AAAI 2024
<
1
…
42
43
44
…
128
>