Zhecan Wang
15 papers · 2017–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🏃 Academic Marathon (8) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)
🌈
Renaissance Researcher
(7)
🌍
Conference Polyglot
(8)
🏃
Academic Marathon
(8)
🤝
Dynamic Duo
(11)
🧬
Topic Evolution
💎
Century Club
(15)
📈
Trend Setter
❓
The Questioner
🚀
Conference Pioneer
🗃️
Keyword Collector
(71)
🔥
Unstoppable
(6)
Conferences
EMNLP (4)
ACL (2)
ECCV (2)
NAACL (2)
NIPS (2)
AAAI (1)
ICLR (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(3)
visual question answering
(3)
vision-language model
(3)
visual reasoning
(2)
visual commonsense
(2)
image generation
(2)
commonsense reasoning
(2)
visual commonsense reasoning
(2)
vision language model
(2)
zero-shot learning
(2)
adversarial training
(1)
face recognition
(1)
cross-modal learning
(1)
transfer learning
(1)
benchmark evaluation
(1)
generative adversarial network
(1)
image captioning
(1)
graph transformer
(1)
commonsense knowledge
(1)
3d face model
(1)
Papers
PuzzleGPT: Emulating Human Puzzle-Solving Ability for Time and Location Prediction
NAACL 2025
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
ICLR 2024
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images
NIPS 2024
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning
ECCV 2024
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
ACL 2023
Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
EMNLP 2023
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
EMNLP 2023
SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning
AAAI 2022
Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks
ACL 2022
Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense
EMNLP 2022
Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding
EMNLP 2022
Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions
NAACL 2021
Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild
WACV 2020
Learning Visual Commonsense for Robust Scene Graph Generation
ECCV 2020
Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis
NIPS 2017