Yuanhan Zhang
13 papers · 2020–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge π Academic Marathon (5) π Conference Polyglot (6) πΊοΈ Taxonomy Completionist (22)
π
Academic Marathon
(5)
π
Cross-Pollinator
(15)
π
Renaissance Researcher
(5)
π₯
Mega-Team
(22)
π€
Dynamic Duo
(10)
π
Century Club
(12)
β‘
Prolific Year
(5)
β
The Questioner
(2)
Conferences
ECCV (5)
CVPR (2)
NAACL (2)
ACL (1)
ICCV (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
multimodal learning
(5)
video understanding
(3)
video question answering
(2)
benchmark evaluation
(2)
large language model
(2)
video generation
(1)
egocentric vision
(1)
human perception
(1)
question answering
(1)
few-shot learning
(1)
in-context learning
(1)
generative model
(1)
visual reasoning
(1)
efficient computing
(1)
preference modeling
(1)
direct preference optimization
(1)
diffusion model
(1)
large multimodal model
(1)
reward model
(1)
adversarial robustness
(1)
Papers
Video-MMMU: Evaluating Knowledge Acquisition from Multidisciplinary Professional Videos
ACL 2026
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
NAACL 2025
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding
ICCV 2025
LLaVA-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
ICLR 2025
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models
NAACL 2025
EgoLife: Towards Egocentric Life Assistant
CVPR 2025
FunQA: Towards Surprising Video Comprehension
ECCV 2024
VBench: Comprehensive Benchmark Suite for Video Generative Models
CVPR 2024
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
ECCV 2024
MMBENCH: Is Your Multi-Modal Model an All-around Player?
ECCV 2024
What Makes Good Examples for Visual In-Context Learning?
NIPS 2023
Benchmarking Omni-Vision Representation through the Lens of Visual Realms
ECCV 2022
CelebA-Spoof: Large-Scale Face Anti-Spoofing Dataset with Rich Annotations
ECCV 2020