Spandana Gella
33 papers · 2012–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (8) π§ Keyword Pioneer π£ Hot Topic Early Bird π Academic Marathon (13)
πΊοΈ
Taxonomy Completionist
(52)
π£
Hot Topic Early Bird
π
Interdisciplinary Bridge
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π₯
Mega-Team
(39)
π±
Topic Pioneer
ποΈ
Keyword Collector
(107)
π
Trend Setter
π
Century Club
(31)
π₯
Unstoppable
(10)
β‘
Prolific Year
(5)
Conferences
EMNLP (13)
ACL (8)
NAACL (4)
EACL (3)
ICML (2)
AAAI (1)
ICLR (1)
SEMEVAL (1)
Top co-authors
Keywords
multimodal learning
(8)
embodied ai
(3)
referring expression
(2)
contrastive learning
(2)
visual grounding
(2)
multimodal large language model
(2)
few-shot learning
(2)
dialogue safety
(2)
dialogue system
(2)
in-context learning
(2)
image retrieval
(1)
question answering
(1)
prompt engineering
(1)
dialogue generation
(1)
natural language generation
(1)
domain adaptation
(1)
multi-task learning
(1)
code generation
(1)
knowledge distillation
(1)
bias detection
(1)
Papers
StarFlow: Generating Structured Workflow Outputs From Sketch Images
EACL 2026
Multimodal Large Language Models for Human-AI Interaction: Foundations, Agents, and Inclusive Applications
EACL 2026
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction
ICML 2025
BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks
ICLR 2025
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering
EMNLP 2025
ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval
EMNLP 2025
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
EMNLP 2025
SafeArena: Evaluating the Safety of Autonomous Web Agents
ICML 2025
PG-Story: Taxonomy, Dataset, and Evaluation for Ensuring Child-Safe Content for Story Generation
EMNLP 2024
Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue
EMNLP 2023
Using In-Context Learning to Improve Dialogue Safety
EMNLP 2023
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
EMNLP 2023
ALFRED-L: Investigating the Role of Language for Action Learning in Interactive Visual Environments
EMNLP 2022
Analyzing the Limits of Self-Supervision in Handling Bias in Language
EMNLP 2022
TEACh: Task-Driven Embodied Agents That Chat
AAAI 2022
Mind the Context: The Impact of Contextualization in Neural Module Networks for Grounding Visual Referring Expressions
EMNLP 2021
Words Arenβt Enough, Their Order Matters: On the Robustness of Grounding Visual Referring Expressions
ACL 2020
Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)
ACL 2019
Cross-lingual Visual Verb Sense Disambiguation
NAACL 2019
Proceedings of the Second Workshop on Shortcomings in Vision and Language
NAACL 2019
Multimodal Abstractive Summarization for How2 Videos
ACL 2019
Neural Word Decomposition Models for Abusive Language Detection
ACL 2019
A Dataset for Telling the Stories of Social Media Videos
EMNLP 2018
Proceedings of the Third Workshop on Representation Learning for NLP
ACL 2018
An Evaluation of Image-Based Verb Prediction Models against Human Eye-Tracking Data
NAACL 2018
Proceedings of ACL 2017, Student Research Workshop
ACL 2017
Image Pivoting for Learning Multilingual Multimodal Representations
EMNLP 2017
An Analysis of Action Recognition Datasets for Language and Vision Tasks
ACL 2017
Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings
NAACL 2016
Learning Word Sense Distributions, Detecting Unattested Senses and Identifying Novel Senses Using Topic Models
ACL 2014
POS Tagging of English-Hindi Code-Mixed Social Media Content
EMNLP 2014
One Sense per Tweeter ... and Other Lexical Semantic Tales of Twitter
EACL 2014
DSS: Text Similarity Using Lexical Alignments of Form, Distributional Semantics and Grammatical Relations
SEMEVAL 2012