Artificial Intelligence › Core AI ›

Multimodal Learning

13057 directly classified papers

Papers per year

Papers

Noise-Blind Image Deblurring CVPR 2017

Neural Response Generation via GAN with an Approximate Embedding Layer EMNLP 2017

Material Classification Using Frequency- and Depth-Dependent Time-Of-Flight Distortion CVPR 2017

Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking CVPR 2017

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images CVPR 2017

openXBOW -- Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit JMLR 2017

Multilingual Hierarchical Attention Networks for Document Classification IJCNLP 2017

Dataset for a Neural Natural Language Interface for Databases (NNLIDB) IJCNLP 2017

Joint Learning of Dialog Act Segmentation and Recognition in Spoken Dialog Using Neural Networks IJCNLP 2017

Named Entity Recognition with Stack Residual LSTM and Trainable Bias Decoding IJCNLP 2017

Extracting Visual Knowledge from the Web with Multimodal Learning IJCAI 2017

ES-LDA: Entity Summarization using Knowledge-based Topic Modeling IJCNLP 2017

Improving Black-box Speech Recognition using Semantic Parsing IJCNLP 2017

NLPSA at IJCNLP-2017 Task 2: Imagine Scenario: Leveraging Supportive Images for Dimensional Sentiment Analysis IJCNLP 2017

Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing IJCNLP 2017

Tag-Enhanced Tree-Structured Neural Networks for Implicit Discourse Relation Classification IJCNLP 2017

Guided Open Vocabulary Image Captioning with Constrained Beam Search EMNLP 2017

Deriving continous grounded meaning representations from referentially structured multimodal contexts EMNLP 2017

Computational Imaging on the Electric Grid CVPR 2017

Video Highlight Prediction Using Audience Chat Reactions EMNLP 2017

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension CVPR 2017

An Analysis of Action Recognition Datasets for Language and Vision Tasks ACL 2017

Classifying Temporal Relations by Bidirectional LSTM over Dependency Paths ACL 2017

ROAM: A Rich Object Appearance Model With Application to Rotoscoping CVPR 2017

Where is Misty? Interpreting Spatial Descriptors by Modeling Regions in Space EMNLP 2017