Alessandro Suglia
19 papers · 2020–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Cross-Pollinator (13)
π£
Hot Topic Early Bird
π
Conference Polyglot
(5)
π
Academic Marathon
(5)
π¬
Deep Specialist
(11)
π₯
Mega-Team
(20)
π₯
Unstoppable
(6)
β‘
Prolific Year
(8)
π
Century Club
(19)
β
The Questioner
(2)
ποΈ
Keyword Collector
(98)
Conferences
EMNLP (8)
ACL (5)
COLING (3)
NAACL (2)
EACL (1)
Top co-authors
Keywords
multimodal learning
(6)
embodied ai
(4)
vision language model
(4)
visual question answering
(4)
large language model
(4)
vision-language model
(3)
benchmark evaluation
(3)
robotic manipulation
(2)
zero-shot learning
(2)
text generation
(2)
diagnostic classifier
(2)
visual grounding
(2)
visual reasoning
(2)
model evaluation
(1)
prompt engineering
(1)
conversational ai
(1)
imitation learning
(1)
knowledge distillation
(1)
video understanding
(1)
direct preference optimization
(1)
Papers
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
EMNLP 2025
Playpen: An Environment for Exploring Learning From Dialogue Game Feedback
EMNLP 2025
CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts
NAACL 2025
Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
EMNLP 2025
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
ACL 2025
Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models
EMNLP 2024
Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
EMNLP 2024
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding
EMNLP 2024
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
EMNLP 2024
PIXAR: Auto-Regressive Language Modeling in Pixel Space
ACL 2024
Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
ACL 2024
Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers
NAACL 2024
Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks
COLING 2024
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
EMNLP 2023
ACT-Thor: A Controlled Benchmark for Embodied Action Understanding in Simulated Environments
COLING 2022
Combine to Describe: Evaluating Compositional Generalization in Image Captioning
ACL 2022
An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games
EACL 2021
CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning
ACL 2020
Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games
COLING 2020