Nenghai Yu

104 papers · 2009–2026 · 12 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (16) 🗺️ Taxonomy Completionist (11) 🏠 Conference Loyalist (20) 🌟 Keyword Trendsetter Combo (7) 🤝 Dynamic Duo (47) 👑 Triple Crown 🏆 Grand Slam 🔬 Deep Specialist (12) 🏆 Keyword Champion 🔥 Unstoppable (11) 🚀 Conference Pioneer 🗃️ Keyword Collector (392) ⚡ Prolific Year (10) 💎 Century Club (101) 📈 Trend Setter

Conferences

CVPR (26) AAAI (22) ICCV (13) ICML (7) IJCAI (7) ACL (6) ECCV (6) NIPS (6) EMNLP (4) ICLR (4) NAACL (2) ACML (1)

Top co-authors

Weiming Zhang (47) Dongdong Chen (31) Qi Chu (29) Bin Liu (25) Xiaoyi Dong (15) Lu Yuan (15) Wenbo Zhou (14) Tie-yan Liu (12) Kejiang Chen (12) Tao Gong (11)

Research topics

Privacy (4) Techniques (1) Computer Vision (1) Security & Privacy (1)

Keywords

adversarial attack (10) large language model (9) image generation (7) semantic segmentation (6) generative adversarial network (5) zero-shot learning (5) image editing (4) diffusion model (4) neural network (4) transfer learning (4) convolutional neural network (4) few-shot learning (4) person re-identification (4) point cloud (4) anomaly detection (3) feature extraction (3) image inpainting (3) contrastive learning (3) vision transformer (3) neural machine translation (3)

Papers

MagicPaint: Operate Anything for Image Inpainting with Diffusion Model AAAI 2026 When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors ACL 2026 EARG-Net: Edge-Aware Reconstruction-Guided Network for Image Manipulation Detection and Localization AAAI 2026 BinMetric: A Comprehensive Binary Code Analysis Benchmark for Large Language Models IJCAI 2025 Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling ICCV 2025 Rethinking Masked Data Reconstruction Pretraining for Strong 3D Action Representation Learning AAAI 2025 Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching AAAI 2025 TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity ICCV 2025 FE-CLIP: Frequency Enhanced CLIP Model for Zero-Shot Anomaly Detection and Segmentation ICCV 2025 CompileAgent: Automated Real-World Repo-Level Compilation with Tool-Integrated LLM-based Agent System ACL 2025 SQL Injection Jailbreak: A Structural Disaster of Large Language Models ACL 2025 EvoBench: Towards Real-world LLM-Generated Text Detection Benchmarking for Evolving Large Language Models ACL 2025 Deciphering Cross-Modal Alignment in Large Vision-Language Models via Modality Integration Rate ICCV 2025 MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation EMNLP 2025 MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG NAACL 2025 De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks ICML 2025 Towards Anytime Retrieval: A Benchmark for Anytime Person Re-Identification IJCAI 2025 UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery CVPR 2025 On the Vulnerability of Text Sanitization NAACL 2025 ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws EMNLP 2024 Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models CVPR 2024 OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation CVPR 2024 Towards More Unified In-context Visual Understanding CVPR 2024 DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection NIPS 2024 Transferable Facial Privacy Protection against Blind Face Restoration via Domain-Consistent Adversarial Obfuscation ICML 2024 AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA ICML 2024 Boosting Vanilla Lightweight Vision Transformers via Re-parameterization ICLR 2024 TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection AAAI 2024 MuST: Robust Image Watermarking for Multi-Source Tracing AAAI 2024 Data-Free Hard-Label Robustness Stealing Attack AAAI 2024 MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators AAAI 2024 FaceRSA: RSA-Aware Facial Identity Cryptography Framework AAAI 2024 Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification AAAI 2024 Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection EMNLP 2024 Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features EMNLP 2024 A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability ECCV 2024 Diversity-Aware Meta Visual Prompting CVPR 2023 PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers AAAI 2023 AutoStegaFont: Synthesizing Vector Fonts for Hiding Information in Documents AAAI 2023 Pseudo Label-Guided Model Inversion Attack via Conditional Generative Adversarial Network AAAI 2023 DeAR: A Deep-Learning-Based Audio Re-recording Resilient Watermarking AAAI 2023 MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining CVPR 2023 Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting ICCV 2023 HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending ICCV 2023 Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping ICLR 2023 X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion ICML 2023 Fluid Dynamics-Inspired Network for Infrared Small Target Detection IJCAI 2023 UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection ECCV 2022 Reduce Information Loss in Transformers for Pluralistic Image Inpainting CVPR 2022 Shape-Invariant 3D Adversarial Point Clouds CVPR 2022 HairCLIP: Design Your Hair by Text and Reference Image CVPR 2022 Protecting Celebrities From DeepFake With Identity Consistency Transformer CVPR 2022 CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows CVPR 2022 Tracing Text Provenance via Context-Aware Lexical Substitution AAAI 2022 Bootstrapped Masked Autoencoders for Vision BERT Pretraining ECCV 2022 Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification ECCV 2022 Initiative Defense against Facial Manipulation AAAI 2021 Improve Unsupervised Pretraining for Few-Label Transfer ICCV 2021 ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation ICCV 2021 Return-Based Contrastive Representation Learning for Reinforcement Learning ICLR 2021 Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification AAAI 2021 Temporal ROI Align for Video Object Recognition AAAI 2021 Diverse Semantic Image Synthesis via Probability Distribution Modeling CVPR 2021 Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain CVPR 2021 Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation CVPR 2021 Multi-Attentional Deepfake Detection CVPR 2021 Passport-aware Normalization for Deep Model Protection NIPS 2020 LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks CVPR 2020 Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer CVPR 2020 Robust Superpixel-Guided Attentional Adversarial Attack CVPR 2020 GSM: Graph Similarity Model for Multi-Object Tracking IJCAI 2020 Density-Aware Graph for Deep Semi-Supervised Visual Recognition CVPR 2020 DASOT: A Unified Framework Integrating Data Association and Single Object Tracking for Online Multi-Object Tracking AAAI 2020 Self-Robust 3D Point Recognition via Gather-Vector Guidance CVPR 2020 GreedyFool: Distortion-Aware Sparse Adversarial Attack NIPS 2020 Model Watermarking for Image Processing Networks AAAI 2020 Memory-Based Neighbourhood Embedding for Visual Recognition ICCV 2019 Context and Attribute Grounded Dense Captioning CVPR 2019 Trust Region Evolution Strategies AAAI 2019 Semantics Disentangling for Text-To-Image Generation CVPR 2019 Detection Based Defense Against Adversarial Examples From the Steganalysis Point of View CVPR 2019 Capacity Control of ReLU Neural Networks by Basis-Path Norm AAAI 2019 G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space ICLR 2019 DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense ICCV 2019 Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once ICCV 2019 Model-Level Dual Learning ICML 2018 Decouple Learning for Parameterized Image Operators ECCV 2018 Stereoscopic Neural Style Transfer CVPR 2018 Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition ECCV 2018 Dual Supervised Learning ICML 2017 StyleBank: An Explicit Representation for Neural Image Style Transfer CVPR 2017 Coherent Online Video Style Transfer ICCV 2017 Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism ICCV 2017 Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification CVPR 2017 Asynchronous Stochastic Gradient Descent with Delay Compensation ICML 2017 Deliberation Networks: Sequence Generation Beyond One-Pass Decoding NIPS 2017 Dual Inference for Machine Learning IJCAI 2017 Dual Learning for Machine Translation NIPS 2016 Budgeted Multi-Armed Bandits with Multiple Plays IJCAI 2016 Budgeted Bandit Problems with Continuous Random Costs ACML 2015 Thompson Sampling for Budgeted Multi-Armed Bandits IJCAI 2015 Word Alignment Modeling with Context Dependent Deep Neural Network ACL 2013 A Ranking-based Approach to Word Reordering for Statistical Machine Translation ACL 2012 Learning Bregman Distance Functions and Its Application for Semi-Supervised Clustering NIPS 2009