Sara Sarto
7 papers · 2023–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (6) π Cross-Pollinator (10) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (22)
π§
Keyword Pioneer
Conferences
CVPR (2)
ACL (1)
ECCV (1)
ICCV (1)
IJCAI (1)
WACV (1)
Top co-authors
Keywords
image captioning
(3)
multimodal large language model
(2)
multimodal learning
(2)
feature extraction
(1)
video captioning
(1)
image editing
(1)
visual grounding
(1)
cross-modal retrieval
(1)
instruction following
(1)
semantic representation
(1)
prompt learning
(1)
visual recognition
(1)
semantic space
(1)
human judgment
(1)
document analysis
(1)
semantic information
(1)
vision language model
(1)
evaluation metric
(1)
multimodal document
(1)
hallucination detection
(1)
Papers
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios
WACV 2025
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
IJCAI 2025
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
CVPR 2025
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues
ECCV 2024
The Revolution of Multimodal Large Language Models: A Survey
ACL 2024
With a Little Help from Your Own Past: Prototypical Memory Networks for Image Captioning
ICCV 2023
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
CVPR 2023