Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Hallucination Detection for Grounded Instruction Generation
EMNLP 2023
Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
EMNLP 2023
That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?
EMNLP 2023
Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection
EMNLP 2023
Context or Knowledge is Not Always Necessary: A Contrastive Learning Framework for Emotion Recognition in Conversations
ACL 2023
See How You Read? Multi-Reading Habits Fusion Reasoning for Multi-Modal Fake News Detection
AAAI 2023
Pic2Word: Mapping Pictures to Words for Zero-Shot Composed Image Retrieval
CVPR 2023
Dynamic Regularization in UDA for Transformers in Multimodal Classification
ACL 2023
Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
ACL 2023
Class-Incremental Grouping Network for Continual Audio-Visual Learning
ICCV 2023
Compositional Mathematical Encoding for Math Word Problems
ACL 2023
MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System
ACL 2023
Efficient RGB-T Tracking via Cross-Modality Distillation
CVPR 2023
Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
CVPR 2023
Delivering Arbitrary-Modal Semantic Segmentation
CVPR 2023
ZBL2W at SemEval-2023 Task 9: A Multilingual Fine-tuning Model with Data Augmentation for Tweet Intimacy Analysis
SEMEVAL 2023
DN at SemEval-2023 Task 12: Low-Resource Language Text Classification via Multilingual Pretrained Language Model Fine-tuning
SEMEVAL 2023
SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph
AAAI 2023
Cross-Modal Distillation for Speaker Recognition
AAAI 2023
Correct for Whom? Subjectivity and the Evaluation of Personalized Image Aesthetics Assessment Models
AAAI 2023
Multi-Level Confidence Learning for Trustworthy Multimodal Classification
AAAI 2023
Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
AAAI 2023
Multiview Clickbait Detection via Jointly Modeling Subjective and Objective Preference
EMNLP 2023
KeFVP: Knowledge-enhanced Financial Volatility Prediction
EMNLP 2023
Improving the Cross-Lingual Generalisation in Visual Question Answering
AAAI 2023
<
1
…
28
29
30
…
49
>