Spandana Gella

33 papers · 2012–2026 · 8 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (13)

🗺️ Taxonomy Completionist (52) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🔬 Deep Specialist (13) 🧬 Topic Evolution 👥 Mega-Team (39) 🌱 Topic Pioneer 🗃️ Keyword Collector (107) 📈 Trend Setter 💎 Century Club (31) 🔥 Unstoppable (10) ⚡ Prolific Year (5)

Conferences

EMNLP (13) ACL (8) NAACL (4) EACL (3) ICML (2) AAAI (1) ICLR (1) SEMEVAL (1)

Top co-authors

Siva Reddy (7) Dilek Hakkani-Tur (6) Sai Rajeswar (5) Frank Keller (5) Perouz Taslakian (4) Juan A. Rodriguez (4) David Vázquez (4) Christopher Pal (4) Arjun Akula (3) Rabiul Awal (3)

Keywords

multimodal learning (8) embodied ai (3) referring expression (2) contrastive learning (2) visual grounding (2) multimodal large language model (2) few-shot learning (2) dialogue safety (2) dialogue system (2) in-context learning (2) image retrieval (1) question answering (1) prompt engineering (1) dialogue generation (1) natural language generation (1) domain adaptation (1) multi-task learning (1) code generation (1) knowledge distillation (1) bias detection (1)

Papers

StarFlow: Generating Structured Workflow Outputs From Sketch Images EACL 2026 Multimodal Large Language Models for Human-AI Interaction: Foundations, Agents, and Inclusive Applications EACL 2026 UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction ICML 2025 BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks ICLR 2025 FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering EMNLP 2025 ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval EMNLP 2025 WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation EMNLP 2025 SafeArena: Evaluating the Safety of Autonomous Web Agents ICML 2025 PG-Story: Taxonomy, Dataset, and Evaluation for Ensuring Child-Safe Content for Story Generation EMNLP 2024 Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue EMNLP 2023 Using In-Context Learning to Improve Dialogue Safety EMNLP 2023 DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines EMNLP 2023 ALFRED-L: Investigating the Role of Language for Action Learning in Interactive Visual Environments EMNLP 2022 Analyzing the Limits of Self-Supervision in Handling Bias in Language EMNLP 2022 TEACh: Task-Driven Embodied Agents That Chat AAAI 2022 Mind the Context: The Impact of Contextualization in Neural Module Networks for Grounding Visual Referring Expressions EMNLP 2021 Words Aren’t Enough, Their Order Matters: On the Robustness of Grounding Visual Referring Expressions ACL 2020 Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019) ACL 2019 Cross-lingual Visual Verb Sense Disambiguation NAACL 2019 Proceedings of the Second Workshop on Shortcomings in Vision and Language NAACL 2019 Multimodal Abstractive Summarization for How2 Videos ACL 2019 Neural Word Decomposition Models for Abusive Language Detection ACL 2019 A Dataset for Telling the Stories of Social Media Videos EMNLP 2018 Proceedings of the Third Workshop on Representation Learning for NLP ACL 2018 An Evaluation of Image-Based Verb Prediction Models against Human Eye-Tracking Data NAACL 2018 Proceedings of ACL 2017, Student Research Workshop ACL 2017 Image Pivoting for Learning Multilingual Multimodal Representations EMNLP 2017 An Analysis of Action Recognition Datasets for Language and Vision Tasks ACL 2017 Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings NAACL 2016 Learning Word Sense Distributions, Detecting Unattested Senses and Identifying Novel Senses Using Topic Models ACL 2014 POS Tagging of English-Hindi Code-Mixed Social Media Content EMNLP 2014 One Sense per Tweeter ... and Other Lexical Semantic Tales of Twitter EACL 2014 DSS: Text Similarity Using Lexical Alignments of Form, Distributional Semantics and Grammatical Relations SEMEVAL 2012