Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Multi-Modal Learning
1213 directly classified papers
Papers per year
2007: 2
2008: 1
2009: 1
2011: 2
2012: 5
2013: 5
2014: 1
2015: 5
2016: 8
2017: 21
2018: 42
2019: 42
2020: 69
2021: 72
2022: 149
2023: 143
2024: 258
2025: 370
2026: 17
Papers
Learning From Temporal Gradient for Semi-Supervised Action Recognition
CVPR 2022
Personalized Image Aesthetics Assessment With Rich Attributes
CVPR 2022
Multimodal Token Fusion for Vision Transformers
CVPR 2022
Balanced Multimodal Learning via On-the-Fly Gradient Modulation
CVPR 2022
PoseKernelLifter: Metric Lifting of 3D Human Pose Using Sound
CVPR 2022
MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation
CVPR 2022
Learning Based Multi-Modality Image and Video Compression
CVPR 2022
Dynamic 3D Gaze From Afar: Deep Gaze Estimation From Temporal Eye-Head-Body Coordination
CVPR 2022
Expressive Talking Head Generation With Granular Audio-Visual Control
CVPR 2022
Multi-Modal Alignment Using Representation Codebook
CVPR 2022
Efficient Two-Stage Detection of Human-Object Interactions With a Novel Unary-Pairwise Transformer
CVPR 2022
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
CVPR 2022
Effective Conditioned and Composed Image Retrieval Combining CLIP-Based Features
CVPR 2022
Mutual Quantization for Cross-Modal Search With Noisy Labels
CVPR 2022
Multi-View Transformer for 3D Visual Grounding
CVPR 2022
Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading
CVPR 2022
Learning Modal-Invariant and Temporal-Memory for Video-Based Visible-Infrared Person Re-Identification
CVPR 2022
Decoupling Zero-Shot Semantic Segmentation
CVPR 2022
Negative-Aware Attention Framework for Image-Text Matching
CVPR 2022
HSC4D: Human-Centered 4D Scene Capture in Large-Scale Indoor-Outdoor Space Using Wearable IMUs and LiDAR
CVPR 2022
ClothFormer: Taming Video Virtual Try-On in All Module
CVPR 2022
KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning
CVPR 2022
Guiding Visual Question Generation
NAACL 2022
RecInDial: A Unified Framework for Conversational Recommendation with Pretrained Language Models
AACL 2022
Persona or Context? Towards Building Context adaptive Personalized Persuasive Virtual Sales Assistant
AACL 2022
<
1
…
32
33
34
…
49
>