Can Zhang

18 papers · 2021–2026 · 9 conferences · across top CS/AI conferences

Achievements

+6 more ↓

🗺️ Taxonomy Completionist (39) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🧭 Keyword Pioneer

🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (6) 💎 Century Club (17) ⚡ Prolific Year (6) 🔥 Unstoppable (5) 🗃️ Keyword Collector (78)

Conferences

CVPR (6) ICCV (3) AAAI (2) ACL (2) ECCV (1) EMNLP (1) ICLR (1) ICML (1) IJCAI (1)

Top co-authors

Yuexian Zou (7) Meng Cao (7) Gim Hee Lee (4) Long Chen (3) Jie Chen (2) Dongming Yang (2) Junwu Weng (2) Hao Zhang (2) Jue Wang (2) Tianyu Yang (2)

Keywords

vision-language model (3) temporal action localization (2) video understanding (2) weakly-supervised learning (2) domain generalization (2) contrastive learning (2) benchmark evaluation (2) temporal localization (2) video grounding (2) visual question answering (1) prototype learning (1) label propagation (1) semi-supervised learning (1) video captioning (1) scene understanding (1) test-time adaptation (1) pose estimation (1) depth estimation (1) weakly supervised learning (1) anomaly detection (1)

Papers

GroupToM-Bench: Benchmarking Group Theory of Mind and Nonlinear Social Emergence in MLLMs ACL 2026 econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians ICLR 2025 CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models CVPR 2025 IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments CVPR 2025 FedSMU: Communication-Efficient and Generalization-Enhanced Federated Learning through Symbolic Model Updates ICML 2025 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry AAAI 2025 Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models ICCV 2025 RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter ACL 2024 Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly CVPR 2024 GeT: Generative Target Structure Debiasing for Domain Adaptation ICCV 2023 Iterative Proposal Refinement for Weakly-Supervised Video Grounding CVPR 2023 Unsupervised Feature Representation Learning for Domain-generalized Cross-domain Image Retrieval ICCV 2023 LocVTP: Video-Text Pre-training for Temporal Localization ECCV 2022 Unsupervised Pre-Training for Temporal Action Localization Tasks CVPR 2022 On Pursuit of Designing Multi-modal Transformer for Video Grounding EMNLP 2021 Non-Autoregressive Coarse-to-Fine Video Captioning AAAI 2021 RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection IJCAI 2021 CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning CVPR 2021