← Learning Types

Machine Learning › Learning Types ›

Multi-Modal Learning

1213 directly classified papers

Papers per year

Papers

Learning From Temporal Gradient for Semi-Supervised Action Recognition CVPR 2022

Personalized Image Aesthetics Assessment With Rich Attributes CVPR 2022

Multimodal Token Fusion for Vision Transformers CVPR 2022

Balanced Multimodal Learning via On-the-Fly Gradient Modulation CVPR 2022

PoseKernelLifter: Metric Lifting of 3D Human Pose Using Sound CVPR 2022

MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation CVPR 2022

Learning Based Multi-Modality Image and Video Compression CVPR 2022

Dynamic 3D Gaze From Afar: Deep Gaze Estimation From Temporal Eye-Head-Body Coordination CVPR 2022

Expressive Talking Head Generation With Granular Audio-Visual Control CVPR 2022

Multi-Modal Alignment Using Representation Codebook CVPR 2022

Efficient Two-Stage Detection of Human-Object Interactions With a Novel Unary-Pairwise Transformer CVPR 2022

Text2Pos: Text-to-Point-Cloud Cross-Modal Localization CVPR 2022

Effective Conditioned and Composed Image Retrieval Combining CLIP-Based Features CVPR 2022

Mutual Quantization for Cross-Modal Search With Noisy Labels CVPR 2022

Multi-View Transformer for 3D Visual Grounding CVPR 2022

Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading CVPR 2022

Learning Modal-Invariant and Temporal-Memory for Video-Based Visible-Infrared Person Re-Identification CVPR 2022

Decoupling Zero-Shot Semantic Segmentation CVPR 2022

Negative-Aware Attention Framework for Image-Text Matching CVPR 2022

HSC4D: Human-Centered 4D Scene Capture in Large-Scale Indoor-Outdoor Space Using Wearable IMUs and LiDAR CVPR 2022

ClothFormer: Taming Video Virtual Try-On in All Module CVPR 2022

KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning CVPR 2022

Guiding Visual Question Generation NAACL 2022

RecInDial: A Unified Framework for Conversational Recommendation with Pretrained Language Models AACL 2022

Persona or Context? Towards Building Context adaptive Personalized Persuasive Virtual Sales Assistant AACL 2022