Gengyuan Zhang
9 papers · 2021–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (14) πΊοΈ Taxonomy Completionist (24) π§ Keyword Pioneer π Conference Polyglot (5) π Renaissance Researcher (6)
π
Interdisciplinary Bridge
β‘
Prolific Year
(5)
β
The Questioner
Conferences
WACV (3)
CVPR (2)
EMNLP (2)
ACL (1)
ICCV (1)
Top co-authors
Keywords
vision-language model
(4)
large language model
(2)
video question answering
(2)
video understanding
(2)
image generation
(1)
domain adaptation
(1)
zero-shot learning
(1)
question answering
(1)
temporal modeling
(1)
multimodal learning
(1)
temporal reasoning
(1)
visual reasoning
(1)
graph representation
(1)
diffusion model
(1)
joint representation
(1)
entity embedding
(1)
data heterogeneity
(1)
text-to-image model
(1)
visual question answering
(1)
link prediction
(1)
Papers
Multimodal Pragmatic Jailbreak on Text-to-image Models
ACL 2025
Localizing Events in Videos with Multimodal Queries
CVPR 2025
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models
CVPR 2025
Perceive Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries
WACV 2025
CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering
WACV 2025
Can Vision-Language Models Be a Good Guesser? Exploring VLMs for Times and Location Reasoning
WACV 2024
VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
EMNLP 2024
Multi-Event Video-Text Retrieval
ICCV 2023
Time-dependent Entity Embedding is not All You Need: A Re-evaluation of Temporal Knowledge Graph Completion Models under a Unified Framework
EMNLP 2021