Feng Zheng
80 papers · 2016–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Renaissance Researcher (9) π Interdisciplinary Bridge π Conference Polyglot (13) π Academic Marathon (10) πΊοΈ Taxonomy Completionist (129)
πΊοΈ
Taxonomy Completionist
(129)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π¬
Deep Specialist
(10)
π
Grand Slam
π€
Dynamic Duo
(11)
π
Triple Crown
π
Century Club
(77)
π₯
Unstoppable
(9)
π
Trend Setter
ποΈ
Keyword Collector
(334)
β‘
Prolific Year
(13)
π
Conference Pioneer
Conferences
AAAI (17)
CVPR (17)
IJCAI (11)
ICCV (10)
ECCV (7)
ICLR (4)
NIPS (4)
ICML (3)
ACL (2)
EMNLP (2)
MICCAI (1)
UAI (1)
WACV (1)
Top co-authors
Keywords
person re-identification
(8)
multimodal learning
(6)
video understanding
(5)
unsupervised learning
(4)
adversarial attack
(4)
embedding learning
(4)
metric learning
(4)
generative model
(4)
semantic segmentation
(4)
representation learning
(3)
vision-language model
(3)
large language model
(3)
object tracking
(3)
image segmentation
(3)
feature learning
(3)
visual object tracking
(3)
contrastive learning
(3)
domain generalization
(3)
stochastic gradient descent
(2)
medical imaging
(2)
Papers
CAPruner: Conceptual-Adjacent Scene Graph Pruner for Enhancing 3D Spatial Reasoning of Large Language Models
ACL 2026
Transferability of Adversarial Attacks in Video-based MLLMs: A Cross-modal Image-to-Video Approach
AAAI 2026
R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios
AAAI 2026
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
WACV 2026
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models
ICLR 2025
An Information-theoretic Perspective of Hierarchical Clustering on Graphs
UAI 2025
On the Generalization Ability of Next-Token-Prediction Pretraining
ICML 2025
MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
ICLR 2025
A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
ICCV 2025
Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors
EMNLP 2025
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos
CVPR 2025
Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization
IJCAI 2024
Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt
AAAI 2024
Self-guided Knowledge-injected Graph Neural Network for Alzheimerβs Diseases
MICCAI 2024
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
ECCV 2024
Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
ECCV 2024
Tuning-Free Image Customization with Image and Text Guidance
ECCV 2024
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting
EMNLP 2024
Place Anything into Any Video
IJCAI 2024
On the Noise Robustness of In-Context Learning for Text Generation
NIPS 2024
Beyond Prototypes: Semantic Anchor Regularization for Better Representation Learning
AAAI 2024
Block Image Compressive Sensing with Local and Global Information Interaction
AAAI 2024
MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
ACL 2024
Negative Label Guided OOD Detection with Pretrained Vision-Language Models
ICLR 2024
Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes
CVPR 2024
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples
ICCV 2023
Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore
ICLR 2023
On the Stability and Generalization of Triplet Learning
AAAI 2023
Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models
ICCV 2023
Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs
AAAI 2023
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
ICCV 2023
Real3D-AD: A Dataset of Point Cloud Anomaly Detection
NIPS 2023
Accelerating Vision-Language Pretraining With Free Language Modeling
CVPR 2023
Resource-Efficient RGBD Aerial Tracking
CVPR 2023
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
CVPR 2023
Detecting Out-of-distribution Data through In-distribution Class Prior
ICML 2023
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models
ICCV 2023
Meta Distribution Alignment for Generalizable Person Re-Identification
CVPR 2022
SoftPatch: Unsupervised Anomaly Detection with Noisy Data
NIPS 2022
VITA: A Multi-Source Vicinal Transfer Augmentation Method for Out-of-Distribution Generalization
AAAI 2022
GuidedMix-Net: Semi-supervised Semantic Segmentation by Using Labeled Images as Reference
AAAI 2022
Error-Based Knockoffs Inference for Controlled Feature Selection
AAAI 2022
Class-Aware Contrastive Semi-Supervised Learning
CVPR 2022
Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression
CVPR 2022
S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning
ECCV 2022
Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline
ECCV 2022
Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks
ECCV 2022
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix
ICML 2022
FREE: Feature Refinement for Generalized Zero-Shot Learning
ICCV 2021
DepthTrack: Unveiling the Power of RGBD Tracking
ICCV 2021
End-to-End Dense Video Captioning With Parallel Decoding
ICCV 2021
Saliency-Associated Object Tracking
ICCV 2021
Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net
CVPR 2021
Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification
CVPR 2021
Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval
IJCAI 2021
Constructing a Fair Classifier with Generated Fair Data
AAAI 2021
Distributed Ranking with Communications: Approximation Analysis and Applications
AAAI 2021
One for More: Selecting Generalizable Samples for Generalizable ReID Model
AAAI 2021
A Unified Multi-Scenario Attacking Network for Visual Object Tracking
AAAI 2021
Dual Distribution Alignment Network for Generalizable Person Re-Identification
AAAI 2021
Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
ICCV 2021
Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect
AAAI 2020
Super-Resolution and Inpainting with Degraded and Upgraded Generative Adversarial Networks
IJCAI 2020
Zero-Shot Object Detection via Learning an Embedding from Semantic Space to Visual Space
IJCAI 2020
Enabling Deep Residual Networks for Weakly Supervised Object Detection
ECCV 2020
Multi-task Additive Models for Robust Estimation and Automatic Structure Discovery
NIPS 2020
Viewpoint-Aware Loss with Angular Regularization for Person Re-Identification
AAAI 2020
Noise-Aware Fully Webly Supervised Object Detection
CVPR 2020
One-Shot Adversarial Attacks on Visual Tracking With Dual Attention
CVPR 2020
Salience-Guided Cascaded Suppression Network for Person Re-Identification
CVPR 2020
Deep Asymmetric Metric Learning via Rich Relationship Mining
CVPR 2019
Equally-Guided Discriminative Hashing for Cross-modal Retrieval
IJCAI 2019
Automatic Grassland Degradation Estimation Using Deep Learning
IJCAI 2019
Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training
CVPR 2019
Deep Spectral Clustering Using Dual Autoencoder Network
CVPR 2019
Binarized Neural Networks for Resource-Efficient Hashing with Minimizing Quantization Loss
IJCAI 2019
A Part Power Set Model for Scale-Free Person Retrieval
IJCAI 2019
Unsupervised Deep Generative Adversarial Hashing Network
CVPR 2018
Fast Vehicle Identification in Surveillance via Ranked Semantic Sampling Based Embedding
IJCAI 2018
Learning Cross-View Binary Identities for Fast Person Re-Identification
IJCAI 2016