conftrace_

Artificial Intelligence › Core AI ›

Multimodal Learning

13,057 papers

Papers per year

Papers

Dynamic Refinement Network for Oriented and Densely Packed Object Detection CVPR 2020

Universal Weighting Metric Learning for Cross-Modal Matching CVPR 2020

PhraseCut: Language-Based Image Segmentation in the Wild CVPR 2020

Learning User Representations for Open Vocabulary Image Hashtag Prediction CVPR 2020

DAVD-Net: Deep Audio-Aided Video Decompression of Talking Heads CVPR 2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension CVPR 2020

The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction CVPR 2020

Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning CVPR 2020

Q-learning with Language Model for Edit-based Unsupervised Summarization EMNLP 2020

Learning to Represent Image and Text with Denotation Graph EMNLP 2020

Does my multimodal model learn cross-modal interactions? It’s harder to tell than you might think! EMNLP 2020

Reading Between the Lines: Exploring Infilling in Visual Narratives EMNLP 2020

Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics EMNLP 2020

Table Fact Verification with Structure-Aware Transformer EMNLP 2020

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis EMNLP 2020

Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos EMNLP 2020

BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues EMNLP 2020

Domain-Specific Lexical Grounding in Noisy Visual-Textual Documents EMNLP 2020

HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training EMNLP 2020

Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision EMNLP 2020

Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News EMNLP 2020

Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product EMNLP 2020

Neural Deepfake Detection with Factual Structure of Text EMNLP 2020

TED-CDB: A Large-Scale Chinese Discourse Relation Dataset on TED Talks EMNLP 2020

STL-CQA: Structure-based Transformers with Localization and Encoding for Chart Question Answering EMNLP 2020