SNAG: Spoken Narratives and Gaze Dataset

Preethi Vaidyanathan; Emily T. Prud’hommeaux; Jeff B. Pelz; Cecilia O. Alm

2018 ACL ACL 2018

SNAG: Spoken Narratives and Gaze Dataset

Abstract

AbstractHumans rely on multiple sensory modalities when examining and reasoning over images. In this paper, we describe a new multimodal dataset that consists of gaze measurements and spoken descriptions collected in parallel during an image inspection task. The task was performed by multiple participants on 100 general-domain images showing everyday objects and activities. We demonstrate the usefulness of the dataset by applying an existing visual-linguistic data fusion framework in order to label important image regions with appropriate linguistic labels.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Machine Learning

🧭 Keyword Pioneer — multimodal dataset

🐣 Hot Topic Early Bird — multimodal dataset

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Preethi Vaidyanathan , Emily T. Prud’hommeaux , Jeff B. Pelz , Cecilia O. Alm

Topics

Artificial Intelligence > Core AI > Multimodal Learning Computer Vision > Core AI > Multimodal Learning Machine Learning > Learning Types > Multi-Modal Learning Artificial Intelligence > Core AI > Multi-Modal Learning

Keywords

multimodal learning eye tracking multimodal dataset image understanding spoken language gaze tracking visual-linguistic fusion gaze measurement spoken description image region labeling visual linguistic fusion spoken narrative visual-linguistic data fusion

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018