Long Zhao

32 papers · 2015–2025 · 9 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10) 🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (59)

🗺️ Taxonomy Completionist (59) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏆 Grand Slam 🤝 Dynamic Duo (12) 🧬 Topic Evolution ⚡ Prolific Year (9) 🔥 Unstoppable (8) 💎 Century Club (32) 📈 Trend Setter 🚀 Conference Pioneer 🗃️ Keyword Collector (116) ❓ The Questioner

Conferences

CVPR (10) ECCV (5) AAAI (4) ICML (3) NIPS (3) ICCV (2) IJCAI (2) WACV (2) ICLR (1)

Top co-authors

Ting Liu (12) Xi Peng (11) Dimitris N. Metaxas (10) Dimitris Metaxas (6) Liangzhe Yuan (6) Yu Tian (5) Hartwig Adam (5) Boqing Gong (5) Florian Schroff (5) Mubbasir Kapadia (4)

Keywords

contrastive learning (4) generative adversarial network (3) multimodal learning (2) object detection (2) representation learning (2) attention mechanism (2) knowledge distillation (2) missing modality (2) graph convolutional network (2) open-vocabulary object detection (2) catastrophic forgetting (1) domain generalization (1) maximum entropy (1) transfer learning (1) image generation (1) continual learning (1) transformer architecture (1) action recognition (1) feature learning (1) similarity learning (1)

Papers

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering ICML 2025 Epsilon-VAE: Denoising as Visual Decoding ICML 2025 Steering Prototypes With Prompt-Tuning for Rehearsal-Free Continual Learning WACV 2024 MINES: Message Intercommunication for Inductive Relation Reasoning over Neighbor-Enhanced Subgraphs AAAI 2024 Sample-Level Cross-View Similarity Learning for Incomplete Multi-View Clustering AAAI 2024 Generating Enhanced Negatives for Training Language-Based Object Detectors CVPR 2024 Distilling Vision-Language Models on Millions of Videos CVPR 2024 Taming Self-Training for Open-Vocabulary Object Detection CVPR 2024 Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models ECCV 2024 Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding ICLR 2024 VideoPrism: A Foundational Visual Encoder for Video Understanding ICML 2024 Unified Visual Relationship Detection with Vision and Language Models ICCV 2023 More Than Just Attention: Improving Cross-Modal Attentions With Contrastive Constraints for Image-Text Matching WACV 2023 Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition ICCV 2023 Are Multimodal Transformers Robust to Missing Modality? CVPR 2022 Global Matching With Overlapping Attention for Optical Flow Estimation CVPR 2022 Exploiting Unlabeled Data with Vision and Language Models for Object Detection ECCV 2022 Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning ECCV 2022 COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality ECCV 2022 Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding AAAI 2022 Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization CVPR 2021 SMIL: Multimodal Learning with Severely Missing Modality AAAI 2021 Improved Transformer for High-Resolution GANs NIPS 2021 Learning to Learn Single Domain Generalization CVPR 2020 Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness NIPS 2020 Knowledge As Priors: Cross-Modal Knowledge Generalization for Datasets Without Superior Knowledge CVPR 2020 Rethinking Kernel Methods for Node Representation Learning on Graphs NIPS 2019 Semantic Graph Convolutional Networks for 3D Human Pose Regression CVPR 2019 Learning to Forecast and Refine Residual Motion for Image-to-Video Generation ECCV 2018 CR-GAN: Learning Complete Representations for Multi-view Generation IJCAI 2018 Bridging Saliency Detection to Weakly Supervised Object Detection Based on Self-Paced Curriculum Learning IJCAI 2016 Object Proposal by Multi-Branch Hierarchical Segmentation CVPR 2015