Yaya Shi
8 papers · 2020–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Conference Polyglot (6) π Academic Marathon (5) π Renaissance Researcher (5) π Interdisciplinary Bridge π Cross-Pollinator (15)
πΊοΈ
Taxonomy Completionist
(21)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Trend Setter
Conferences
COLING (2)
CVPR (2)
ACL (1)
EMNLP (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
video captioning
(2)
benchmark evaluation
(1)
self-supervised learning
(1)
in-context learning
(1)
video understanding
(1)
visual representation
(1)
cross-modal retrieval
(1)
semantic alignment
(1)
instruction tuning
(1)
spatiotemporal modeling
(1)
cross-modal alignment
(1)
vision-language model
(1)
multimodal large language model
(1)
multi-image understanding
(1)
reference-free evaluation
(1)
video-text retrieval
(1)
multi-image reasoning
(1)
semantic grounding
(1)
fine-grained evaluation
(1)
visual language model
(1)
Papers
iMOVE : Instance-Motion-Aware Video Understanding
ACL 2025
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
ICLR 2025
MIBench: Evaluating Multimodal Large Language Models over Multiple Images
EMNLP 2024
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
COLING 2024
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval
COLING 2024
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
ICML 2023
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
CVPR 2022
Object Relational Graph With Teacher-Recommended Learning for Video Captioning
CVPR 2020