Weihan Wang
10 papers · 2023–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (5) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer
π
Cross-Pollinator
(13)
Conferences
CVPR (3)
ICCV (2)
ICLR (2)
ACL (1)
ECCV (1)
NIPS (1)
Top co-authors
Keywords
visual question answering
(3)
visual language model
(2)
multimodal large language model
(2)
video question answering
(2)
multimodal learning
(1)
image captioning
(1)
video understanding
(1)
visual grounding
(1)
stereo matching
(1)
vision language model
(1)
multi-modal large language model
(1)
vision-language model
(1)
uncertainty estimation
(1)
vision-language pre-training
(1)
soft label
(1)
context window
(1)
long video understanding
(1)
video benchmark
(1)
masked language modeling
(1)
image-text matching
(1)
Papers
Glyph: Scaling Context Windows via Visual-Text Compression
ACL 2026
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models
CVPR 2025
CogVLM: Visual Expert for Pretrained Language Models
NIPS 2024
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
ECCV 2024
ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation
ICCV 2023
Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation
CVPR 2023