Sandro Pezzelle
35 papers · 2016–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Academic Marathon (9) π Conference Polyglot (6) π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
π
Conference Polyglot
(6)
π
Academic Marathon
(9)
π
Keyword Trendsetter Combo
(4)
π₯
Mega-Team
(20)
π€
Dynamic Duo
(16)
π¬
Deep Specialist
(12)
π§¬
Topic Evolution
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(163)
β‘
Prolific Year
(5)
π₯
Unstoppable
(10)
π
Century Club
(33)
β
The Questioner
(4)
Conferences
ACL (12)
EMNLP (12)
EACL (5)
NAACL (3)
IJCNLP (2)
COLING (1)
Top co-authors
Keywords
multimodal learning
(13)
visual grounding
(5)
large language model
(5)
image captioning
(4)
relational reasoning
(4)
human evaluation
(2)
visual scene understanding
(2)
dialogue system
(2)
lexical semantics
(2)
cross-modal representation
(2)
pre-trained language model
(2)
multimodal model
(2)
cognitive modeling
(2)
cross-modal alignment
(2)
semantic analysis
(2)
visual storytelling
(2)
referring expression
(2)
visual reasoning
(2)
visual context
(2)
vision-language model
(2)
Papers
Beyond Divergent Creativity: A Human-Based Evaluation of Creativity in Large Language Models
EACL 2026
Vision-Language Models Align with Human Neural Representations in Concept Processing
EACL 2026
They want to pretend not to understand: The Limits of Current LLMs in Interpreting Implicit Content of Political Discourse
ACL 2025
If I feel smart, I will do the right thing: Combining Complementary Multimodal Information in Visual Language Models
COLING 2025
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions
ACL 2025
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
ACL 2025
Detecting and Translating Language Ambiguity with Multilingual LLMs
EMNLP 2024
Naming, Describing, and Quantifying Visual Objects in Humans and LLMs
ACL 2024
Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!
ACL 2024
Describing Images Fast and Slow: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes
EACL 2024
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition
EMNLP 2024
The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models
EMNLP 2023
Dealing with Semantic Underspecification in Multimodal NLP
ACL 2023
Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind
ACL 2023
A Psycholinguistic Analysis of BERTβs Representations of Compounds
EACL 2023
GROOViST: A Metric for Grounding Objects in Visual Storytelling
EMNLP 2023
When Language Models Fall in Love: Animacy Processing in Transformer Language Models
EMNLP 2023
Less Descriptive yet Discriminative: Quantifying the Properties of Multimodal Referring Utterances via CLIP
ACL 2022
Controllable Text Generation for All Ages: Evaluating a Plug-and-Play Approach to Age-Adapted Dialogue
EMNLP 2022
Probing Cross-Modal Representations in Multi-Step Relational Reasoning
IJCNLP 2021
Probing Cross-Modal Representations in Multi-Step Relational Reasoning
ACL 2021
EaSe: A Diagnostic Tool for VQA based on Answer Diversity
NAACL 2021
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze
EMNLP 2020
Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts
EMNLP 2020
Be Different to Be Better! A Benchmark to Leverage the Complementarity of Language and Vision
EMNLP 2020
Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge (LANTERN)
EMNLP 2019
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts
EMNLP 2019
Big Generalizations with Small Data: Exploring the Role of Training Samples in Learning Adjectives of Size
EMNLP 2019
Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts
IJCNLP 2019
Quantifiers in a Multimodal World: Hallucinating Vision with Language and Sound
NAACL 2019
Comparatives, Quantifiers, Proportions: a Multi-Task Model for the Learning of Quantities from Vision
NAACL 2018
Some of Them Can be Guessed! Exploring the Effect of Linguistic Context in Predicting Quantifiers
ACL 2018
FOIL it! Find One mismatch between Image and Language caption
ACL 2017
Be Precise or Fuzzy: Learning the Meaning of Cardinals and Quantifiers from Vision
EACL 2017
The LAMBADA dataset: Word prediction requiring a broad discourse context
ACL 2016