Can Zhang
18 papers · 2021–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (39) π Conference Polyglot (9) π Interdisciplinary Bridge π Renaissance Researcher (6) π§ Keyword Pioneer
π
Conference Polyglot
(9)
π
Renaissance Researcher
(6)
π
Century Club
(17)
β‘
Prolific Year
(6)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(78)
Conferences
CVPR (6)
ICCV (3)
AAAI (2)
ACL (2)
ECCV (1)
EMNLP (1)
ICLR (1)
ICML (1)
IJCAI (1)
Top co-authors
Keywords
vision-language model
(3)
temporal action localization
(2)
video understanding
(2)
weakly-supervised learning
(2)
domain generalization
(2)
contrastive learning
(2)
benchmark evaluation
(2)
temporal localization
(2)
video grounding
(2)
visual question answering
(1)
prototype learning
(1)
label propagation
(1)
semi-supervised learning
(1)
video captioning
(1)
scene understanding
(1)
test-time adaptation
(1)
pose estimation
(1)
depth estimation
(1)
weakly supervised learning
(1)
anomaly detection
(1)
Papers
GroupToM-Bench: Benchmarking Group Theory of Mind and Nonlinear Social Emergence in MLLMs
ACL 2026
econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians
ICLR 2025
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
CVPR 2025
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments
CVPR 2025
FedSMU: Communication-Efficient and Generalization-Enhanced Federated Learning through Symbolic Model Updates
ICML 2025
Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry
AAAI 2025
Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models
ICCV 2025
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
ACL 2024
Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
CVPR 2024
GeT: Generative Target Structure Debiasing for Domain Adaptation
ICCV 2023
Iterative Proposal Refinement for Weakly-Supervised Video Grounding
CVPR 2023
Unsupervised Feature Representation Learning for Domain-generalized Cross-domain Image Retrieval
ICCV 2023
LocVTP: Video-Text Pre-training for Temporal Localization
ECCV 2022
Unsupervised Pre-Training for Temporal Action Localization Tasks
CVPR 2022
On Pursuit of Designing Multi-modal Transformer for Video Grounding
EMNLP 2021
Non-Autoregressive Coarse-to-Fine Video Captioning
AAAI 2021
RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection
IJCAI 2021
CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning
CVPR 2021