Nanning Zheng

99 papers · 2013–2026 · 13 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🏃 Academic Marathon (12) 🌍 Conference Polyglot (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (21) 🤝 Dynamic Duo (14) 👑 Triple Crown 🏆 Grand Slam 🔬 Deep Specialist (18) 🧬 Topic Evolution 🚀 Conference Pioneer ⚡ Prolific Year (15) ❓ The Questioner (5) 📈 Trend Setter 🗃️ Keyword Collector (384) 🔥 Unstoppable (11) 💎 Century Club (96)

Conferences

CVPR (21) NIPS (18) AAAI (16) ICCV (16) ECCV (8) ICML (5) ICLR (4) ACL (3) IJCAI (3) EMNLP (2) IJCNLP (1) MICCAI (1) RSS (1)

Top co-authors

Le Wang (14) Sanping Zhou (14) Gang Hua (12) Ping Wei (9) Shengnan An (8) Jian Sun (8) Jian-Guang Lou (8) Zeqi Lin (8) Yongqiang Ma (7) Wei Tang (7)

Keywords

object detection (9) semantic segmentation (7) video understanding (7) metric learning (7) neural network (7) feature extraction (6) temporal action localization (6) person re-identification (6) attention mechanism (5) action recognition (5) knowledge distillation (4) convolutional neural network (4) 3d object detection (4) large language model (4) zero-shot learning (3) weakly supervised learning (3) vision-language model (3) transfer learning (3) trajectory prediction (3) few-shot learning (3)

Papers

EVOKE: Efficient and High-Fidelity EEG-to-Video Reconstruction via Decoupling Implicit Neural Representation AAAI 2026 Think before Go: Hierarchical Reasoning for Image-goal Navigation ACL 2026 UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space AAAI 2026 ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images CVPR 2025 DAMap: Distance-aware MapNet for High Quality HD Map Construction ICCV 2025 See Through Their Minds: Learning Transferable Brain Decoding Models from Cross-Subject fMRI AAAI 2025 On the Statistical Mechanisms of Distributional Compositional Generalization ICML 2025 FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers ICCV 2025 Beyond Brain Decoding: Visual-Semantic Reconstructions to Mental Creation Extension Based on fMRI ICCV 2025 Unveiling Multi-View Anomaly Detection: Intra-view Decoupling and Inter-view Fusion AAAI 2025 Beyond Single-Modal Boundary: Cross-Modal Anomaly Detection through Visual Prototype and Harmonization CVPR 2025 Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning CVPR 2025 MRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning ECCV 2024 Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis CVPR 2024 Make Your LLM Fully Utilize the Context NIPS 2024 Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement NIPS 2024 Molecule Design by Latent Prompt Transformer NIPS 2024 TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning NIPS 2024 Neural P$^3$M: A Long-Range Interaction Modeling Enhancer for Geometric GNNs NIPS 2024 Breaking through the learning plateaus of in-context learning in Transformer ICML 2024 V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection ICLR 2024 Class-aware Mutual Mixup with Triple Alignments for Semi-Supervised Cross-domain Segmentation MICCAI 2024 Self-Consistency Training for Density-Functional-Theory Hamiltonian Prediction ICML 2024 Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection AAAI 2024 GSO-Net: Grid Surface Optimization via Learning Geometric Constraints AAAI 2024 IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate Importance AAAI 2024 Can LLMs Learn From Mistakes? An Empirical Study on Reasoning Tasks EMNLP 2024 PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation ECCV 2024 AugDETR: Improving Multi-scale Learning for Detection Transformer ECCV 2024 Skill-Based Few-Shot Selection for In-Context Learning EMNLP 2023 Geometric Transformer with Interatomic Positional Encoding NIPS 2023 Closing the gap between the upper bound and lower bound of Adam's iteration complexity NIPS 2023 StructVPR: Distill Structural Knowledge With Weighting Samples for Visual Place Recognition CVPR 2023 Does Deep Learning Learn to Abstract? A Systematic Probing Framework ICLR 2023 DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection ICLR 2023 Learning Trajectories are Generalization Indicators NIPS 2023 How Do In-Context Examples Affect Compositional Generalization? ACL 2023 DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models NIPS 2023 MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering CVPR 2023 DETR Does Not Need Multi-Scale or Locality Design ICCV 2023 Inverse Compositional Learning for Weakly-supervised Relation Grounding ICCV 2023 Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning ICML 2022 Could Giant Pre-trained Image Models Extract Universal Representations? NIPS 2022 Visual Concepts Tokenization NIPS 2022 Construct Effective Geometry Aware Feature Pyramid Network for Multi-Scale Object Detection AAAI 2022 Social Interpretable Tree for Pedestrian Trajectory Prediction AAAI 2022 LGD: Label-Guided Self-Distillation for Object Detection AAAI 2022 Learning Disentangled Classification and Localization Representations for Temporal Action Localization AAAI 2022 Learning To Refactor Action and Co-Occurrence Features for Temporal Action Localization CVPR 2022 TransVPR: Transformer-Based Place Recognition With Multi-Level Attention Aggregation CVPR 2022 Asymmetric Relation Consistency Reasoning for Video Relation Grounding ECCV 2022 Towards Building A Group-based Unsupervised Representation Disentanglement Framework ICLR 2022 SE(3) Equivariant Graph Neural Networks with Complete Local Frames ICML 2022 Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction ICCV 2021 Practical Relative Order Attack in Deep Ranking ICCV 2021 End-to-End Object Detection With Fully Convolutional Network CVPR 2021 INVIGORATE: Interactive Visual Grounding and Grasping in Clutter RSS 2021 Dynamic Grained Encoder for Vision Transformers NIPS 2021 Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context AAAI 2021 Semantic Consistency Networks for 3D Object Detection AAAI 2021 Learning Algebraic Recombination for Compositional Generalization ACL 2021 Meta Pairwise Relationship Distillation for Unsupervised Person Re-Identification ICCV 2021 Enriching Local and Global Contexts for Temporal Action Localization ICCV 2021 Learning Algebraic Recombination for Compositional Generalization IJCNLP 2021 Hindsight Trust Region Policy Optimization IJCAI 2021 Co-evolution Transformer for Protein Contact Prediction NIPS 2021 Instance-Conditional Knowledge Distillation for Object Detection NIPS 2021 ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization AAAI 2021 A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning ECCV 2020 Compositional Generalization by Learning Analytical Expressions NIPS 2020 Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition CVPR 2020 Rethinking Learnable Tree Filter for Generic Feature Transform NIPS 2020 Fine-Grained Dynamic Head for Object Detection NIPS 2020 Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks ICCV 2019 Compressing Unknown Images With Product Quantizer for Efficient Zero-Shot Classification CVPR 2019 Recognizing Unseen Attribute-Object Pair with Generative Model AAAI 2019 Learnable Tree Filter for Structure-preserving Feature Transform NIPS 2019 Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos AAAI 2019 SR-LSTM: State Refinement for LSTM Towards Pedestrian Trajectory Prediction CVPR 2019 Adding Attentiveness to the Neurons in Recurrent Neural Networks ECCV 2018 Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks CVPR 2018 Transductive Semi-Supervised Deep Learning using Min-Max Features ECCV 2018 Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification ECCV 2018 Kernelized Subspace Pooling for Deep Local Descriptors CVPR 2018 Inferring Human Attention by Learning Latent Intentions IJCAI 2017 Point to Set Similarity Based Deep Feature Learning for Person Re-Identification CVPR 2017 View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition From Skeleton Data ICCV 2017 ER3: A Unified Framework for Event Retrieval, Recognition and Recounting CVPR 2017 Discriminative Dictionary Learning With Ranking Metric Embedded for Person Re-Identification IJCAI 2017 Similarity Learning With Spatial Constraints for Person Re-Identification CVPR 2016 Person Re-Identification by Multi-Channel Parts-Based CNN With Improved Triplet Loss Function CVPR 2016 Contour Guided Hierarchical Model for Shape Matching ICCV 2015 Illumination Robust Color Naming via Label Propagation ICCV 2015 Saturation-Preserving Specular Reflection Separation CVPR 2015 Similarity Learning on an Explicit Polynomial Kernel Feature Map for Person Re-Identification CVPR 2015 Modeling 4D Human-Object Interactions for Event and Object Recognition ICCV 2013 Salient Object Detection: A Discriminative Regional Feature Integration Approach CVPR 2013 Concurrent Action Detection with Structural Prediction ICCV 2013 Constructing Adaptive Complex Cells for Robust Visual Tracking ICCV 2013