Papers
88 papers found
Expressing Objects Just Like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen, Jiebo Luo
Interactive Visualizations of Word Embeddings for K-12 Students
Saptarashmi Bandyopadhyay, Jason Xu, Neel Pawar et al.
Seeing the advantage: visually grounding word embeddings to better capture human semantic knowledge
Danny Merkx, Stefan Frank, Mirjam Ernestus
PoliTo at SemEval-2023 Task 1: CLIP-based Visual-Word Sense Disambiguation Based on Back-Translation
Lorenzo Vaiani, Luca Cagliero, Paolo Garza
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via Multi-Task Training
Hassan Shahmohammadi, Hendrik P. A. Lensch, R. Harald Baayen
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via Multi-Task Training
Hassan Shahmohammadi, Hendrik P. A. Lensch, R. Harald Baayen
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis
Hengshun Zhou, Jun Du, Gongzhen Zou et al.
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting
Haotian Wang, Jun Du, Hengshun Zhou et al.
PoliTo at SemEval-2023 Task 1: CLIP-based Visual-Word Sense Disambiguation Based on Back-Translation
Lorenzo Vaiani, Luca Cagliero, Paolo Garza
Seeing Words Differently: Visual Embeddings for Robust English-Arabic Machine Translation
Mahdi Alshaikh Saleh, Irfan Ahmad
Obtaining referential word meanings from visual and distributional information: Experiments on object naming
Sina Zarrieß, David Schlangen
Generating Pedagogically Meaningful Visuals for Math Word Problems: A New Benchmark and Analysis of Text-to-Image Models
Junling Wang, Anna Rutkiewicz, April Wang et al.
From Words to Structured Visuals: A Benchmark and Framework for Text-to-Diagram Generation and Editing
Jingxuan Wei, Cheng Tan, Qi Chen et al.
ViCo: Word Embeddings From Visual Co-Occurrences
Tanmay Gupta, Alexander Schwing, Derek Hoiem
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Puyuan Peng, David Harwath
Quantifying the Visual Concreteness of Words and Topics in Multimodal Datasets
Jack Hessel, David Mimno, Lillian Lee
Wikipedia2Vec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from Wikipedia
Ikuya Yamada, Akari Asai, Jin Sakuma et al.
Waffling Around for Performance: Visual Classification with Random Words and Broad Concepts
Karsten Roth, Jae Myung Kim, A. Sophia Koepke et al.
VCWE: Visual Character-Enhanced Word Embeddings
Chi Sun, Xipeng Qiu, Xuanjing Huang
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas
Unsupervised Learning of Visual Sense Models for Polysemous Words
Kate Saenko, Trevor Darrell
Sub-Word Level Lip Reading With Visual Attention
K R Prajwal, Triantafyllos Afouras, Andrew Zisserman
More Than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech
Michael Hassid, Michelle Tadmor Ramanovich, Brendan Shillingford et al.