Zhaokai Wang
10 papers · 2021–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (6) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (24) π§ Keyword Pioneer
π
Cross-Pollinator
(10)
π
Academic Marathon
(5)
π₯
Mega-Team
(29)
π
Century Club
(10)
ποΈ
Keyword Collector
(54)
Conferences
CVPR (3)
AAAI (2)
EMNLP (2)
ACL (1)
ICCV (1)
NIPS (1)
Top co-authors
Keywords
large language model
(3)
image generation
(2)
multimodal large language model
(2)
multimodal learning
(2)
object detection
(1)
foundation model
(1)
semantic segmentation
(1)
sparse reward
(1)
trajectory analysis
(1)
reward design
(1)
synthetic datum
(1)
vision language model
(1)
progressive learning
(1)
multi-modal large language model
(1)
vision-language model
(1)
mixture of expert
(1)
image captioning
(1)
model generalization
(1)
benchmark evaluation
(1)
catastrophic forgetting
(1)
Papers
TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation
AAAI 2026
OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use
ACL 2025
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
CVPR 2025
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
CVPR 2025
Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Spatial Reasoning
EMNLP 2025
Parameter-Inverted Image Pyramid Networks
NIPS 2024
ItiNera: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning
EMNLP 2024
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
CVPR 2024
Video Background Music Generation: Dataset, Method and Evaluation
ICCV 2023
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
AAAI 2021