Artificial Intelligence › Core AI ›

Multimodal Learning

13057 directly classified papers

Papers per year

Papers

Listen to the Image CVPR 2019

Controllable Text Simplification with Lexical Constraint Loss ACL 2019

Multimodal Abstractive Summarization for How2 Videos ACL 2019

Show, Describe and Conclude: On Exploiting the Structure Information of Chest X-ray Reports ACL 2019

Towards Comprehensive Description Generation from Factual Attribute-value Tables ACL 2019

Faithful Multimodal Explanation for Visual Question Answering ACL 2019

Model-Blind Video Denoising via Frame-To-Frame Training CVPR 2019

Learning to Explain With Complemental Examples CVPR 2019

Intention Oriented Image Captions With Guiding Objects CVPR 2019

Visual Query Answering by Entity-Attribute Graph Matching and Reasoning CVPR 2019

Recurrent Neural Networks With Intra-Frame Iterations for Video Deblurring CVPR 2019

Speech2Face: Learning the Face Behind a Voice CVPR 2019

Deep Video Inpainting CVPR 2019

Detailed Human Shape Estimation From a Single Image by Hierarchical Mesh Deformation CVPR 2019

Self-Supervised Spatio-Temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics CVPR 2019

Large-Scale Long-Tailed Recognition in an Open World CVPR 2019

Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition With Multimodal Training CVPR 2019

Pushing the Boundaries of View Extrapolation With Multiplane Images CVPR 2019

What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues ACL 2019

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations ACL 2019

Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers ACL 2019

Expressing Visual Relationships via Language ACL 2019

Multilingual Unsupervised NMT using Shared Encoder and Language-Specific Decoders ACL 2019

Dense Procedure Captioning in Narrated Instructional Videos ACL 2019

Symbolic Inductive Bias for Visually Grounded Learning of Spoken Language ACL 2019