conftrace_

Artificial Intelligence › Core AI ›

Multimodal Learning

13,057 papers

Papers per year

Papers

Positive Sample Propagation Along the Audio-Visual Event Line CVPR 2021

StylePeople: A Generative Model of Fullbody Human Avatars CVPR 2021

Intentonomy: A Dataset and Study Towards Human Intent Understanding CVPR 2021

Hybrid Message Passing With Performance-Driven Structures for Facial Action Unit Detection CVPR 2021

HOTR: End-to-End Human-Object Interaction Detection With Transformers CVPR 2021

ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References CVPR 2021

Zillow Indoor Dataset: Annotated Floor Plans With 360deg Panoramas and 3D Room Layouts CVPR 2021

Learning Triadic Belief Dynamics in Nonverbal Communication From Videos CVPR 2021

GANmut: Learning Interpretable Conditional Space for Gamut of Emotions CVPR 2021

TediGAN: Text-Guided Diverse Face Image Generation and Manipulation CVPR 2021

Towards Diverse Paragraph Captioning for Untrimmed Videos CVPR 2021

Dual-GAN: Joint BVP and Noise Modeling for Remote Physiological Measurement CVPR 2021

Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval CVPR 2021

Roses Are Red, Violets Are Blue... but Should VQA Expect Them To? CVPR 2021

Understanding Object Dynamics for Interactive Image-to-Video Synthesis CVPR 2021

Semantic Audio-Visual Navigation CVPR 2021

PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds CVPR 2021

Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering CVPR 2021

Structured Scene Memory for Vision-Language Navigation CVPR 2021

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting CVPR 2021

Predicting Human Scanpaths in Visual Question Answering CVPR 2021

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision CVPR 2021

Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting With Their Explanations CVPR 2021

Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval CVPR 2021

Learning Camera Localization via Dense Scene Matching CVPR 2021