Nanning Zheng
99 papers · 2013–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🏃 Academic Marathon (12) 🌍 Conference Polyglot (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏠
Conference Loyalist
(21)
🤝
Dynamic Duo
(14)
👑
Triple Crown
🏆
Grand Slam
🔬
Deep Specialist
(18)
🧬
Topic Evolution
🚀
Conference Pioneer
⚡
Prolific Year
(15)
❓
The Questioner
(5)
📈
Trend Setter
🗃️
Keyword Collector
(384)
🔥
Unstoppable
(11)
💎
Century Club
(96)
Conferences
CVPR (21)
NIPS (18)
AAAI (16)
ICCV (16)
ECCV (8)
ICML (5)
ICLR (4)
ACL (3)
IJCAI (3)
EMNLP (2)
IJCNLP (1)
MICCAI (1)
RSS (1)
Top co-authors
Keywords
object detection
(9)
semantic segmentation
(7)
video understanding
(7)
metric learning
(7)
neural network
(7)
feature extraction
(6)
temporal action localization
(6)
person re-identification
(6)
attention mechanism
(5)
action recognition
(5)
knowledge distillation
(4)
convolutional neural network
(4)
3d object detection
(4)
large language model
(4)
zero-shot learning
(3)
weakly supervised learning
(3)
vision-language model
(3)
transfer learning
(3)
trajectory prediction
(3)
few-shot learning
(3)
Papers
EVOKE: Efficient and High-Fidelity EEG-to-Video Reconstruction via Decoupling Implicit Neural Representation
AAAI 2026
Think before Go: Hierarchical Reasoning for Image-goal Navigation
ACL 2026
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
AAAI 2026
ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images
CVPR 2025
DAMap: Distance-aware MapNet for High Quality HD Map Construction
ICCV 2025
See Through Their Minds: Learning Transferable Brain Decoding Models from Cross-Subject fMRI
AAAI 2025
On the Statistical Mechanisms of Distributional Compositional Generalization
ICML 2025
FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers
ICCV 2025
Beyond Brain Decoding: Visual-Semantic Reconstructions to Mental Creation Extension Based on fMRI
ICCV 2025
Unveiling Multi-View Anomaly Detection: Intra-view Decoupling and Inter-view Fusion
AAAI 2025
Beyond Single-Modal Boundary: Cross-Modal Anomaly Detection through Visual Prototype and Harmonization
CVPR 2025
Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning
CVPR 2025
MRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning
ECCV 2024
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis
CVPR 2024
Make Your LLM Fully Utilize the Context
NIPS 2024
Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement
NIPS 2024
Molecule Design by Latent Prompt Transformer
NIPS 2024
TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning
NIPS 2024
Neural P$^3$M: A Long-Range Interaction Modeling Enhancer for Geometric GNNs
NIPS 2024
Breaking through the learning plateaus of in-context learning in Transformer
ICML 2024
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
ICLR 2024
Class-aware Mutual Mixup with Triple Alignments for Semi-Supervised Cross-domain Segmentation
MICCAI 2024
Self-Consistency Training for Density-Functional-Theory Hamiltonian Prediction
ICML 2024
Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection
AAAI 2024
GSO-Net: Grid Surface Optimization via Learning Geometric Constraints
AAAI 2024
IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate Importance
AAAI 2024
Can LLMs Learn From Mistakes? An Empirical Study on Reasoning Tasks
EMNLP 2024
PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation
ECCV 2024
AugDETR: Improving Multi-scale Learning for Detection Transformer
ECCV 2024
Skill-Based Few-Shot Selection for In-Context Learning
EMNLP 2023
Geometric Transformer with Interatomic Positional Encoding
NIPS 2023
Closing the gap between the upper bound and lower bound of Adam's iteration complexity
NIPS 2023
StructVPR: Distill Structural Knowledge With Weighting Samples for Visual Place Recognition
CVPR 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
ICLR 2023
DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
ICLR 2023
Learning Trajectories are Generalization Indicators
NIPS 2023
How Do In-Context Examples Affect Compositional Generalization?
ACL 2023
DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models
NIPS 2023
MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering
CVPR 2023
DETR Does Not Need Multi-Scale or Locality Design
ICCV 2023
Inverse Compositional Learning for Weakly-supervised Relation Grounding
ICCV 2023
Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning
ICML 2022
Could Giant Pre-trained Image Models Extract Universal Representations?
NIPS 2022
Visual Concepts Tokenization
NIPS 2022
Construct Effective Geometry Aware Feature Pyramid Network for Multi-Scale Object Detection
AAAI 2022
Social Interpretable Tree for Pedestrian Trajectory Prediction
AAAI 2022
LGD: Label-Guided Self-Distillation for Object Detection
AAAI 2022
Learning Disentangled Classification and Localization Representations for Temporal Action Localization
AAAI 2022
Learning To Refactor Action and Co-Occurrence Features for Temporal Action Localization
CVPR 2022
TransVPR: Transformer-Based Place Recognition With Multi-Level Attention Aggregation
CVPR 2022
Asymmetric Relation Consistency Reasoning for Video Relation Grounding
ECCV 2022
Towards Building A Group-based Unsupervised Representation Disentanglement Framework
ICLR 2022
SE(3) Equivariant Graph Neural Networks with Complete Local Frames
ICML 2022
Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction
ICCV 2021
Practical Relative Order Attack in Deep Ranking
ICCV 2021
End-to-End Object Detection With Fully Convolutional Network
CVPR 2021
INVIGORATE: Interactive Visual Grounding and Grasping in Clutter
RSS 2021
Dynamic Grained Encoder for Vision Transformers
NIPS 2021
Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context
AAAI 2021
Semantic Consistency Networks for 3D Object Detection
AAAI 2021
Learning Algebraic Recombination for Compositional Generalization
ACL 2021
Meta Pairwise Relationship Distillation for Unsupervised Person Re-Identification
ICCV 2021
Enriching Local and Global Contexts for Temporal Action Localization
ICCV 2021
Learning Algebraic Recombination for Compositional Generalization
IJCNLP 2021
Hindsight Trust Region Policy Optimization
IJCAI 2021
Co-evolution Transformer for Protein Contact Prediction
NIPS 2021
Instance-Conditional Knowledge Distillation for Object Detection
NIPS 2021
ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
AAAI 2021
A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning
ECCV 2020
Compositional Generalization by Learning Analytical Expressions
NIPS 2020
Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
CVPR 2020
Rethinking Learnable Tree Filter for Generic Feature Transform
NIPS 2020
Fine-Grained Dynamic Head for Object Detection
NIPS 2020
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks
ICCV 2019
Compressing Unknown Images With Product Quantizer for Efficient Zero-Shot Classification
CVPR 2019
Recognizing Unseen Attribute-Object Pair with Generative Model
AAAI 2019
Learnable Tree Filter for Structure-preserving Feature Transform
NIPS 2019
Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos
AAAI 2019
SR-LSTM: State Refinement for LSTM Towards Pedestrian Trajectory Prediction
CVPR 2019
Adding Attentiveness to the Neurons in Recurrent Neural Networks
ECCV 2018
Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks
CVPR 2018
Transductive Semi-Supervised Deep Learning using Min-Max Features
ECCV 2018
Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification
ECCV 2018
Kernelized Subspace Pooling for Deep Local Descriptors
CVPR 2018
Inferring Human Attention by Learning Latent Intentions
IJCAI 2017
Point to Set Similarity Based Deep Feature Learning for Person Re-Identification
CVPR 2017
View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition From Skeleton Data
ICCV 2017
ER3: A Unified Framework for Event Retrieval, Recognition and Recounting
CVPR 2017
Discriminative Dictionary Learning With Ranking Metric Embedded for Person Re-Identification
IJCAI 2017
Similarity Learning With Spatial Constraints for Person Re-Identification
CVPR 2016
Person Re-Identification by Multi-Channel Parts-Based CNN With Improved Triplet Loss Function
CVPR 2016
Contour Guided Hierarchical Model for Shape Matching
ICCV 2015
Illumination Robust Color Naming via Label Propagation
ICCV 2015
Saturation-Preserving Specular Reflection Separation
CVPR 2015
Similarity Learning on an Explicit Polynomial Kernel Feature Map for Person Re-Identification
CVPR 2015
Modeling 4D Human-Object Interactions for Event and Object Recognition
ICCV 2013
Salient Object Detection: A Discriminative Regional Feature Integration Approach
CVPR 2013
Concurrent Action Detection with Structural Prediction
ICCV 2013
Constructing Adaptive Complex Cells for Robust Visual Tracking
ICCV 2013