conftrace_

Yaya Shi

8 papers · 2020–2025 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🌍 Conference Polyglot (6) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (15)

🗺️ Taxonomy Completionist (21) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 📈 Trend Setter

Conferences

COLING (2) CVPR (2) ACL (1) EMNLP (1) ICLR (1) ICML (1)

Top co-authors

chunfeng yuan (5) Bing Li (5) Weiming Hu (5) Haiyang Xu (5) Ji Zhang (4) Ming Yan (4) Fei Huang (4) Haowei Liu (3) Qinghao Ye (3) Chenliang Li (3)

Keywords

video captioning (2) benchmark evaluation (1) self-supervised learning (1) in-context learning (1) video understanding (1) visual representation (1) cross-modal retrieval (1) semantic alignment (1) instruction tuning (1) spatiotemporal modeling (1) cross-modal alignment (1) vision-language model (1) multimodal large language model (1) multi-image understanding (1) reference-free evaluation (1) video-text retrieval (1) multi-image reasoning (1) semantic grounding (1) fine-grained evaluation (1) visual language model (1)

Papers

iMOVE : Instance-Motion-Aware Video Understanding ACL 2025 TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types ICLR 2025 MIBench: Evaluating Multimodal Large Language Models over Multiple Images EMNLP 2024 Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training COLING 2024 Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval COLING 2024 mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video ICML 2023 EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching CVPR 2022 Object Relational Graph With Teacher-Recommended Learning for Video Captioning CVPR 2020