Zhiding Yu

62 papers · 2014–2026 · 9 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (20) 🤝 Dynamic Duo (24) 👑 Triple Crown 🧬 Topic Evolution 🔥 Unstoppable (13) 🚀 Conference Pioneer ⚡ Prolific Year (8) 📈 Trend Setter 🗃️ Keyword Collector (224) ❓ The Questioner (3) 💎 Century Club (62)

Conferences

CVPR (20) NIPS (11) ICML (8) ICCV (7) ECCV (6) ICLR (5) IJCAI (2) WACV (2) EMNLP (1)

Top co-authors

Anima Anandkumar (24) Jose M. Alvarez (18) Jan Kautz (14) Chaowei Xiao (10) De-An Huang (9) Shiyi Lan (9) Weiyang Liu (9) Animashree Anandkumar (7) Weili Nie (7) Yuke Zhu (6)

Research topics

Differential Privacy (1) Core AI (1)

Keywords

semantic segmentation (9) convolutional neural network (8) object detection (8) instance segmentation (6) self-supervised learning (5) adversarial robustness (4) representation learning (4) autonomous driving (3) weakly supervised learning (3) domain adaptation (3) few-shot learning (3) visual reasoning (3) feature learning (3) neural network (3) vision transformer (3) domain generalization (3) feature embedding (3) metric learning (2) depth estimation (2) trajectory prediction (2)

Papers

GHOST: Getting to the Bottom of Hallucinations with A Multi-round Consistency Benchmark WACV 2026 T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching ICLR 2025 Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training ICCV 2025 RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression ICML 2025 Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought CVPR 2025 OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning CVPR 2025 Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders ICLR 2025 Neural Eulerian Scene Flow Fields ICLR 2025 Improving Distant 3D Object Detection Using 2D Box Supervision CVPR 2024 Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving? CVPR 2024 LITA: Language Instructed Temporal-Localization Assistant ECCV 2024 A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties ECCV 2024 Memorize What Matters: Emergent Scene Decomposition from Multitraverse NIPS 2024 Differentially Private Video Activity Recognition WACV 2024 Learning Calibrated Uncertainties for Domain Shift: A Distributionally Robust Learning Approach IJCAI 2023 A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification ICML 2023 Vision Transformers Are Good Mask Auto-Labelers CVPR 2023 VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion CVPR 2023 Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning EMNLP 2023 Fully Attentional Networks with Self-emerging Token Labeling ICCV 2023 End-to-end 3D Tracking with Decoupled Queries ICCV 2023 FB-BEV: BEV Representation from Forward-Backward View Transformations ICCV 2023 FocalFormer3D: Focusing on Hard Instance for 3D Object Detection ICCV 2023 Panoptic SegFormer: Delving Deeper Into Panoptic Segmentation With Transformers CVPR 2022 How Much More Data Do I Need? Estimating Requirements for Downstream Tasks CVPR 2022 MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training NIPS 2022 Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models NIPS 2022 RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning ICLR 2022 Understanding The Robustness in Vision Transformers ICML 2022 Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions CVPR 2022 FreeSOLO: Learning To Segment Objects Without Annotations CVPR 2022 CoordGAN: Self-Supervised Dense Correspondences Emerge From GANs CVPR 2022 Not All Labels Are Equal: Rationalizing the Labeling Costs for Training Object Detection CVPR 2022 Coupled Segmentation and Edge Learning via Dynamic Graph Propagation NIPS 2021 AugMax: Adversarial Composition of Random Augmentations for Robust Training NIPS 2021 DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence From Box Supervision ICCV 2021 Contrastive Syn-to-Real Generalization ICLR 2021 Adversarially Robust 3D Point Cloud Recognition Using Self-Supervisions NIPS 2021 SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers NIPS 2021 SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies ICML 2021 Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection ICML 2021 Neural Networks with Recurrent Generative Feedback NIPS 2020 UFO²: A Unified Framework towards Omni-supervised Object Detection ECCV 2020 Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning NIPS 2020 Automated Synthetic-to-Real Generalization ICML 2020 Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification ECCV 2020 Angular Visual Hardness ICML 2020 Regularizing Neural Networks via Minimizing Hyperspherical Energy CVPR 2020 Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection CVPR 2020 Confidence Regularized Self-Training ICCV 2019 Joint Discriminative and Generative Learning for Person Re-Identification CVPR 2019 Learning towards Minimum Hyperspherical Energy NIPS 2018 Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training ECCV 2018 Simultaneous Edge Alignment and Learning ECCV 2018 Learning Strict Identity Mappings in Deep Residual Networks CVPR 2018 Decoupled Networks CVPR 2018 CASENet: Deep Category-Aware Semantic Edge Detection CVPR 2017 Deep Hyperspherical Learning NIPS 2017 SphereFace: Deep Hypersphere Embedding for Face Recognition CVPR 2017 Large-Margin Softmax Loss for Convolutional Neural Networks ICML 2016 Generalized Transitive Distance with Minimum Spanning Random Forest IJCAI 2015 Transitive Distance Clustering with K-Means Duality CVPR 2014