Zhiding Yu
62 papers · 2014–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏠
Conference Loyalist
(20)
🤝
Dynamic Duo
(24)
👑
Triple Crown
🧬
Topic Evolution
🔥
Unstoppable
(13)
🚀
Conference Pioneer
⚡
Prolific Year
(8)
📈
Trend Setter
🗃️
Keyword Collector
(224)
❓
The Questioner
(3)
💎
Century Club
(62)
Conferences
CVPR (20)
NIPS (11)
ICML (8)
ICCV (7)
ECCV (6)
ICLR (5)
IJCAI (2)
WACV (2)
EMNLP (1)
Top co-authors
Research topics
Keywords
semantic segmentation
(9)
convolutional neural network
(8)
object detection
(8)
instance segmentation
(6)
self-supervised learning
(5)
adversarial robustness
(4)
representation learning
(4)
autonomous driving
(3)
weakly supervised learning
(3)
domain adaptation
(3)
few-shot learning
(3)
visual reasoning
(3)
feature learning
(3)
neural network
(3)
vision transformer
(3)
domain generalization
(3)
feature embedding
(3)
metric learning
(2)
depth estimation
(2)
trajectory prediction
(2)
Papers
GHOST: Getting to the Bottom of Hallucinations with A Multi-round Consistency Benchmark
WACV 2026
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
ICLR 2025
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training
ICCV 2025
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression
ICML 2025
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
CVPR 2025
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
CVPR 2025
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025
Neural Eulerian Scene Flow Fields
ICLR 2025
Improving Distant 3D Object Detection Using 2D Box Supervision
CVPR 2024
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
CVPR 2024
LITA: Language Instructed Temporal-Localization Assistant
ECCV 2024
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
ECCV 2024
Memorize What Matters: Emergent Scene Decomposition from Multitraverse
NIPS 2024
Differentially Private Video Activity Recognition
WACV 2024
Learning Calibrated Uncertainties for Domain Shift: A Distributionally Robust Learning Approach
IJCAI 2023
A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification
ICML 2023
Vision Transformers Are Good Mask Auto-Labelers
CVPR 2023
VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion
CVPR 2023
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
EMNLP 2023
Fully Attentional Networks with Self-emerging Token Labeling
ICCV 2023
End-to-end 3D Tracking with Decoupled Queries
ICCV 2023
FB-BEV: BEV Representation from Forward-Backward View Transformations
ICCV 2023
FocalFormer3D: Focusing on Hard Instance for 3D Object Detection
ICCV 2023
Panoptic SegFormer: Delving Deeper Into Panoptic Segmentation With Transformers
CVPR 2022
How Much More Data Do I Need? Estimating Requirements for Downstream Tasks
CVPR 2022
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
NIPS 2022
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
NIPS 2022
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
ICLR 2022
Understanding The Robustness in Vision Transformers
ICML 2022
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions
CVPR 2022
FreeSOLO: Learning To Segment Objects Without Annotations
CVPR 2022
CoordGAN: Self-Supervised Dense Correspondences Emerge From GANs
CVPR 2022
Not All Labels Are Equal: Rationalizing the Labeling Costs for Training Object Detection
CVPR 2022
Coupled Segmentation and Edge Learning via Dynamic Graph Propagation
NIPS 2021
AugMax: Adversarial Composition of Random Augmentations for Robust Training
NIPS 2021
DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence From Box Supervision
ICCV 2021
Contrastive Syn-to-Real Generalization
ICLR 2021
Adversarially Robust 3D Point Cloud Recognition Using Self-Supervisions
NIPS 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
NIPS 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
ICML 2021
Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection
ICML 2021
Neural Networks with Recurrent Generative Feedback
NIPS 2020
UFO²: A Unified Framework towards Omni-supervised Object Detection
ECCV 2020
Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning
NIPS 2020
Automated Synthetic-to-Real Generalization
ICML 2020
Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification
ECCV 2020
Angular Visual Hardness
ICML 2020
Regularizing Neural Networks via Minimizing Hyperspherical Energy
CVPR 2020
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection
CVPR 2020
Confidence Regularized Self-Training
ICCV 2019
Joint Discriminative and Generative Learning for Person Re-Identification
CVPR 2019
Learning towards Minimum Hyperspherical Energy
NIPS 2018
Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training
ECCV 2018
Simultaneous Edge Alignment and Learning
ECCV 2018
Learning Strict Identity Mappings in Deep Residual Networks
CVPR 2018
Decoupled Networks
CVPR 2018
CASENet: Deep Category-Aware Semantic Edge Detection
CVPR 2017
Deep Hyperspherical Learning
NIPS 2017
SphereFace: Deep Hypersphere Embedding for Face Recognition
CVPR 2017
Large-Margin Softmax Loss for Convolutional Neural Networks
ICML 2016
Generalized Transitive Distance with Minimum Spanning Random Forest
IJCAI 2015
Transitive Distance Clustering with K-Means Duality
CVPR 2014