Long Zhao
32 papers · 2015–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (10) π Conference Polyglot (9) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (59)
πΊοΈ
Taxonomy Completionist
(59)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Grand Slam
π€
Dynamic Duo
(12)
π§¬
Topic Evolution
β‘
Prolific Year
(9)
π₯
Unstoppable
(8)
π
Century Club
(32)
π
Trend Setter
π
Conference Pioneer
ποΈ
Keyword Collector
(116)
β
The Questioner
Conferences
CVPR (10)
ECCV (5)
AAAI (4)
ICML (3)
NIPS (3)
ICCV (2)
IJCAI (2)
WACV (2)
ICLR (1)
Top co-authors
Keywords
contrastive learning
(4)
generative adversarial network
(3)
multimodal learning
(2)
object detection
(2)
representation learning
(2)
attention mechanism
(2)
knowledge distillation
(2)
missing modality
(2)
graph convolutional network
(2)
open-vocabulary object detection
(2)
catastrophic forgetting
(1)
domain generalization
(1)
maximum entropy
(1)
transfer learning
(1)
image generation
(1)
continual learning
(1)
transformer architecture
(1)
action recognition
(1)
feature learning
(1)
similarity learning
(1)
Papers
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering
ICML 2025
Epsilon-VAE: Denoising as Visual Decoding
ICML 2025
Steering Prototypes With Prompt-Tuning for Rehearsal-Free Continual Learning
WACV 2024
MINES: Message Intercommunication for Inductive Relation Reasoning over Neighbor-Enhanced Subgraphs
AAAI 2024
Sample-Level Cross-View Similarity Learning for Incomplete Multi-View Clustering
AAAI 2024
Generating Enhanced Negatives for Training Language-Based Object Detectors
CVPR 2024
Distilling Vision-Language Models on Millions of Videos
CVPR 2024
Taming Self-Training for Open-Vocabulary Object Detection
CVPR 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
ECCV 2024
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding
ICLR 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
ICML 2024
Unified Visual Relationship Detection with Vision and Language Models
ICCV 2023
More Than Just Attention: Improving Cross-Modal Attentions With Contrastive Constraints for Image-Text Matching
WACV 2023
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition
ICCV 2023
Are Multimodal Transformers Robust to Missing Modality?
CVPR 2022
Global Matching With Overlapping Attention for Optical Flow Estimation
CVPR 2022
Exploiting Unlabeled Data with Vision and Language Models for Object Detection
ECCV 2022
Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning
ECCV 2022
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality
ECCV 2022
Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding
AAAI 2022
Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
CVPR 2021
SMIL: Multimodal Learning with Severely Missing Modality
AAAI 2021
Improved Transformer for High-Resolution GANs
NIPS 2021
Learning to Learn Single Domain Generalization
CVPR 2020
Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness
NIPS 2020
Knowledge As Priors: Cross-Modal Knowledge Generalization for Datasets Without Superior Knowledge
CVPR 2020
Rethinking Kernel Methods for Node Representation Learning on Graphs
NIPS 2019
Semantic Graph Convolutional Networks for 3D Human Pose Regression
CVPR 2019
Learning to Forecast and Refine Residual Motion for Image-to-Video Generation
ECCV 2018
CR-GAN: Learning Complete Representations for Multi-view Generation
IJCAI 2018
Bridging Saliency Detection to Weakly Supervised Object Detection Based on Self-Paced Curriculum Learning
IJCAI 2016
Object Proposal by Multi-Branch Hierarchical Segmentation
CVPR 2015