Feng Zheng

80 papers · 2016–2026 · 13 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌈 Renaissance Researcher (9) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13) 🏃 Academic Marathon (10) 🗺️ Taxonomy Completionist (129)

🗺️ Taxonomy Completionist (129) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🔬 Deep Specialist (10) 🏆 Grand Slam 🤝 Dynamic Duo (11) 👑 Triple Crown 💎 Century Club (77) 🔥 Unstoppable (9) 📈 Trend Setter 🗃️ Keyword Collector (334) ⚡ Prolific Year (13) 🚀 Conference Pioneer

Conferences

AAAI (17) CVPR (17) IJCAI (11) ICCV (10) ECCV (7) ICLR (4) NIPS (4) ICML (3) ACL (2) EMNLP (2) MICCAI (1) UAI (1) WACV (1)

Top co-authors

Teng Wang (12) Chengjie Wang (8) Rongrong Ji (8) Hong Chen (8) Ling Shao (8) Yong Liu (7) Cheng Deng (5) Jinbao Wang (5) Xing Sun (5) Zhenyu He (5)

Keywords

person re-identification (8) multimodal learning (6) video understanding (5) unsupervised learning (4) adversarial attack (4) embedding learning (4) metric learning (4) generative model (4) semantic segmentation (4) representation learning (3) vision-language model (3) large language model (3) object tracking (3) image segmentation (3) feature learning (3) visual object tracking (3) contrastive learning (3) domain generalization (3) stochastic gradient descent (2) medical imaging (2)

Papers

CAPruner: Conceptual-Adjacent Scene Graph Pruner for Enhancing 3D Spatial Reasoning of Large Language Models ACL 2026 Transferability of Adversarial Attacks in Video-based MLLMs: A Cross-modal Image-to-Video Approach AAAI 2026 R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios AAAI 2026 SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning WACV 2026 Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models ICLR 2025 An Information-theoretic Perspective of Hierarchical Clustering on Graphs UAI 2025 On the Generalization Ability of Next-Token-Prediction Pretraining ICML 2025 MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection ICLR 2025 A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation ICCV 2025 Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors EMNLP 2025 LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos CVPR 2025 Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization IJCAI 2024 Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt AAAI 2024 Self-guided Knowledge-injected Graph Neural Network for Alzheimer’s Diseases MICCAI 2024 Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion ECCV 2024 Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models ECCV 2024 Tuning-Free Image Customization with Image and Text Guidance ECCV 2024 Unlocking Memorization in Large Language Models with Dynamic Soft Prompting EMNLP 2024 Place Anything into Any Video IJCAI 2024 On the Noise Robustness of In-Context Learning for Text Generation NIPS 2024 Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning AAAI 2024 Block Image Compressive Sensing with Local and Global Information Interaction AAAI 2024 MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production ACL 2024 Negative Label Guided OOD Detection with Pretrained Vision-Language Models ICLR 2024 Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes CVPR 2024 Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples ICCV 2023 Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore ICLR 2023 On the Stability and Generalization of Triplet Learning AAAI 2023 Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models ICCV 2023 Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs AAAI 2023 Transferable Decoding with Visual Entities for Zero-Shot Image Captioning ICCV 2023 Real3D-AD: A Dataset of Point Cloud Anomaly Detection NIPS 2023 Accelerating Vision-Language Pretraining With Free Language Modeling CVPR 2023 Resource-Efficient RGBD Aerial Tracking CVPR 2023 Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline CVPR 2023 Detecting Out-of-distribution Data through In-distribution Class Prior ICML 2023 Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models ICCV 2023 Meta Distribution Alignment for Generalizable Person Re-Identification CVPR 2022 SoftPatch: Unsupervised Anomaly Detection with Noisy Data NIPS 2022 VITA: A Multi-Source Vicinal Transfer Augmentation Method for Out-of-Distribution Generalization AAAI 2022 GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference AAAI 2022 Error-Based Knockoffs Inference for Controlled Feature Selection AAAI 2022 Class-Aware Contrastive Semi-Supervised Learning CVPR 2022 Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression CVPR 2022 S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning ECCV 2022 Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline ECCV 2022 Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks ECCV 2022 VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix ICML 2022 FREE: Feature Refinement for Generalized Zero-Shot Learning ICCV 2021 DepthTrack: Unveiling the Power of RGBD Tracking ICCV 2021 End-to-End Dense Video Captioning With Parallel Decoding ICCV 2021 Saliency-Associated Object Tracking ICCV 2021 Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net CVPR 2021 Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification CVPR 2021 Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval IJCAI 2021 Constructing a Fair Classifier with Generated Fair Data AAAI 2021 Distributed Ranking with Communications: Approximation Analysis and Applications AAAI 2021 One for More: Selecting Generalizable Samples for Generalizable ReID Model AAAI 2021 A Unified Multi-Scenario Attacking Network for Visual Object Tracking AAAI 2021 Dual Distribution Alignment Network for Generalizable Person Re-Identification AAAI 2021 Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation ICCV 2021 Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect AAAI 2020 Super-Resolution and Inpainting with Degraded and Upgraded Generative Adversarial Networks IJCAI 2020 Zero-Shot Object Detection via Learning an Embedding from Semantic Space to Visual Space IJCAI 2020 Enabling Deep Residual Networks for Weakly Supervised Object Detection ECCV 2020 Multi-task Additive Models for Robust Estimation and Automatic Structure Discovery NIPS 2020 Viewpoint-Aware Loss with Angular Regularization for Person Re-Identification AAAI 2020 Noise-Aware Fully Webly Supervised Object Detection CVPR 2020 One-Shot Adversarial Attacks on Visual Tracking With Dual Attention CVPR 2020 Salience-Guided Cascaded Suppression Network for Person Re-Identification CVPR 2020 Deep Asymmetric Metric Learning via Rich Relationship Mining CVPR 2019 Equally-Guided Discriminative Hashing for Cross-modal Retrieval IJCAI 2019 Automatic Grassland Degradation Estimation Using Deep Learning IJCAI 2019 Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training CVPR 2019 Deep Spectral Clustering Using Dual Autoencoder Network CVPR 2019 Binarized Neural Networks for Resource-Efficient Hashing with Minimizing Quantization Loss IJCAI 2019 A Part Power Set Model for Scale-Free Person Retrieval IJCAI 2019 Unsupervised Deep Generative Adversarial Hashing Network CVPR 2018 Fast Vehicle Identification in Surveillance via Ranked Semantic Sampling Based Embedding IJCAI 2018 Learning Cross-View Binary Identities for Fast Person Re-Identification IJCAI 2016