Xilin Wei
4 papers · 2023–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (2) π Cross-Pollinator (8)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(13)
β
The Questioner
Conferences
NIPS (3)
ICML (1)
Top co-authors
Keywords
multimodal learning
(2)
large language model
(2)
video captioning
(1)
multi-modal learning
(1)
video understanding
(1)
instruction tuning
(1)
benchmark dataset
(1)
vision-language model
(1)
large vision-language model
(1)
multi-turn dialogue
(1)
multi-image understanding
(1)
chain-of-thought prompting
(1)
dense caption
(1)
large video-language model
(1)
text-to-video model
(1)
temporal description
(1)
dialogue system
(1)
step-by-step reasoning
(1)
tool-augmented reasoning
(1)
math reasoning
(1)
Papers
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
ICML 2025
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs
NIPS 2024
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
NIPS 2024
Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning
NIPS 2023