Alessandro Suglia

19 papers · 2020–2025 · 5 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (13)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (5) 🏃 Academic Marathon (5) 🔬 Deep Specialist (11) 👥 Mega-Team (20) 🔥 Unstoppable (6) ⚡ Prolific Year (8) 💎 Century Club (19) ❓ The Questioner (2) 🗃️ Keyword Collector (98)

Conferences

EMNLP (8) ACL (5) COLING (3) NAACL (2) EACL (1)

Top co-authors

Oliver Lemon (9) Ioannis Konstas (8) Georgios Pantazopoulos (7) Arash Eshghi (6) Malvina Nikandrou (5) Amit Parekh (4) Raffaella Bernardi (4) Raquel Fernández (3) Andrea Vanzo (3) Alberto Testoni (3)

Keywords

multimodal learning (6) embodied ai (4) vision language model (4) visual question answering (4) large language model (4) vision-language model (3) benchmark evaluation (3) robotic manipulation (2) zero-shot learning (2) text generation (2) diagnostic classifier (2) visual grounding (2) visual reasoning (2) model evaluation (1) prompt engineering (1) conversational ai (1) imitation learning (1) knowledge distillation (1) video understanding (1) direct preference optimization (1)

Papers

FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks EMNLP 2025 Playpen: An Environment for Exploring Learning From Dialogue Game Feedback EMNLP 2025 CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts NAACL 2025 Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests EMNLP 2025 LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks ACL 2025 Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models EMNLP 2024 Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling EMNLP 2024 AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding EMNLP 2024 Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks EMNLP 2024 PIXAR: Auto-Regressive Language Modeling in Pixel Space ACL 2024 Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation ACL 2024 Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers NAACL 2024 Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks COLING 2024 Multitask Multimodal Prompted Training for Interactive Embodied Task Completion EMNLP 2023 ACT-Thor: A Controlled Benchmark for Embodied Action Understanding in Simulated Environments COLING 2022 Combine to Describe: Evaluating Compositional Generalization in Image Captioning ACL 2022 An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games EACL 2021 CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning ACL 2020 Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games COLING 2020