Zhecan Wang

15 papers · 2017–2025 · 8 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (8) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)

🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (8) 🤝 Dynamic Duo (11) 🧬 Topic Evolution 💎 Century Club (15) 📈 Trend Setter ❓ The Questioner 🚀 Conference Pioneer 🗃️ Keyword Collector (71) 🔥 Unstoppable (6)

Conferences

EMNLP (4) ACL (2) ECCV (2) NAACL (2) NIPS (2) AAAI (1) ICLR (1) WACV (1)

Top co-authors

Shih-fu Chang (11) Haoxuan You (11) Kai-Wei Chang (9) Rui Sun (4) Hammad Ayyubi (3) Wenhao Li (3) Alireza Zareian (3) Noel Codella (2) Yicheng He (2) Liunian Harold Li (2)

Keywords

multimodal learning (3) visual question answering (3) vision-language model (3) visual reasoning (2) visual commonsense (2) image generation (2) commonsense reasoning (2) visual commonsense reasoning (2) vision language model (2) zero-shot learning (2) adversarial training (1) face recognition (1) cross-modal learning (1) transfer learning (1) benchmark evaluation (1) generative adversarial network (1) image captioning (1) graph transformer (1) commonsense knowledge (1) 3d face model (1)

Papers

PuzzleGPT: Emulating Human Puzzle-Solving Ability for Time and Location Prediction NAACL 2025 CoBIT: A Contrastive Bi-directional Image-Text Generation Model ICLR 2024 JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images NIPS 2024 HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning ECCV 2024 UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding ACL 2023 Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond EMNLP 2023 IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models EMNLP 2023 SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning AAAI 2022 Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks ACL 2022 Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense EMNLP 2022 Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding EMNLP 2022 Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions NAACL 2021 Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild WACV 2020 Learning Visual Commonsense for Robust Scene Graph Generation ECCV 2020 Dual-Agent GANs for Photorealistic and Identity Preserving Profile Face Synthesis NIPS 2017