Nenghai Yu
104 papers · 2009–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Academic Marathon
(16)
πΊοΈ
Taxonomy Completionist
(11)
π
Conference Loyalist
(20)
π
Keyword Trendsetter Combo
(7)
π€
Dynamic Duo
(47)
π
Triple Crown
π
Grand Slam
π¬
Deep Specialist
(12)
π
Keyword Champion
π₯
Unstoppable
(11)
π
Conference Pioneer
ποΈ
Keyword Collector
(392)
β‘
Prolific Year
(10)
π
Century Club
(101)
π
Trend Setter
Conferences
CVPR (26)
AAAI (22)
ICCV (13)
ICML (7)
IJCAI (7)
ACL (6)
ECCV (6)
NIPS (6)
EMNLP (4)
ICLR (4)
NAACL (2)
ACML (1)
Top co-authors
Research topics
Keywords
adversarial attack
(10)
large language model
(9)
image generation
(7)
semantic segmentation
(6)
generative adversarial network
(5)
zero-shot learning
(5)
image editing
(4)
diffusion model
(4)
neural network
(4)
transfer learning
(4)
convolutional neural network
(4)
few-shot learning
(4)
person re-identification
(4)
point cloud
(4)
anomaly detection
(3)
feature extraction
(3)
image inpainting
(3)
contrastive learning
(3)
vision transformer
(3)
neural machine translation
(3)
Papers
MagicPaint: Operate Anything for Image Inpainting with Diffusion Model
AAAI 2026
When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors
ACL 2026
EARG-Net: Edge-Aware Reconstruction-Guided Network for Image Manipulation Detection and Localization
AAAI 2026
BinMetric: A Comprehensive Binary Code Analysis Benchmark for Large Language Models
IJCAI 2025
Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling
ICCV 2025
Rethinking Masked Data Reconstruction Pretraining for Strong 3D Action Representation Learning
AAAI 2025
Training-free Open-Vocabulary Semantic Segmentation via Diverse Prototype Construction and Sub-region Matching
AAAI 2025
TAG-WM: Tamper-Aware Generative Image Watermarking via Diffusion Inversion Sensitivity
ICCV 2025
FE-CLIP: Frequency Enhanced CLIP Model for Zero-Shot Anomaly Detection and Segmentation
ICCV 2025
CompileAgent: Automated Real-World Repo-Level Compilation with Tool-Integrated LLM-based Agent System
ACL 2025
SQL Injection Jailbreak: A Structural Disaster of Large Language Models
ACL 2025
EvoBench: Towards Real-world LLM-Generated Text Detection Benchmarking for Evolving Large Language Models
ACL 2025
Deciphering Cross-Modal Alignment in Large Vision-Language Models via Modality Integration Rate
ICCV 2025
MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation
EMNLP 2025
MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG
NAACL 2025
De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks
ICML 2025
Towards Anytime Retrieval: A Benchmark for Anytime Person Re-Identification
IJCAI 2025
UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
CVPR 2025
On the Vulnerability of Text Sanitization
NAACL 2025
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws
EMNLP 2024
Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models
CVPR 2024
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
CVPR 2024
Towards More Unified In-context Visual Understanding
CVPR 2024
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection
NIPS 2024
Transferable Facial Privacy Protection against Blind Face Restoration via Domain-Consistent Adversarial Obfuscation
ICML 2024
AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA
ICML 2024
Boosting Vanilla Lightweight Vision Transformers via Re-parameterization
ICLR 2024
TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection
AAAI 2024
MuST: Robust Image Watermarking for Multi-Source Tracing
AAAI 2024
Data-Free Hard-Label Robustness Stealing Attack
AAAI 2024
MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators
AAAI 2024
FaceRSA: RSA-Aware Facial Identity Cryptography Framework
AAAI 2024
Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification
AAAI 2024
Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
EMNLP 2024
Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features
EMNLP 2024
A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability
ECCV 2024
Diversity-Aware Meta Visual Prompting
CVPR 2023
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
AAAI 2023
AutoStegaFont: Synthesizing Vector Fonts for Hiding Information in Documents
AAAI 2023
Pseudo Label-Guided Model Inversion Attack via Conditional Generative Adversarial Network
AAAI 2023
DeAR: A Deep-Learning-Based Audio Re-recording Resilient Watermarking
AAAI 2023
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
CVPR 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
ICCV 2023
HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending
ICCV 2023
Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping
ICLR 2023
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
ICML 2023
Fluid Dynamics-Inspired Network for Infrared Small Target Detection
IJCAI 2023
UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection
ECCV 2022
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
CVPR 2022
Shape-Invariant 3D Adversarial Point Clouds
CVPR 2022
HairCLIP: Design Your Hair by Text and Reference Image
CVPR 2022
Protecting Celebrities From DeepFake With Identity Consistency Transformer
CVPR 2022
CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows
CVPR 2022
Tracing Text Provenance via Context-Aware Lexical Substitution
AAAI 2022
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
ECCV 2022
Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification
ECCV 2022
Initiative Defense against Facial Manipulation
AAAI 2021
Improve Unsupervised Pretraining for Few-Label Transfer
ICCV 2021
ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation
ICCV 2021
Return-Based Contrastive Representation Learning for Reinforcement Learning
ICLR 2021
Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification
AAAI 2021
Temporal ROI Align for Video Object Recognition
AAAI 2021
Diverse Semantic Image Synthesis via Probability Distribution Modeling
CVPR 2021
Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain
CVPR 2021
Improved Image Matting via Real-Time User Clicks and Uncertainty Estimation
CVPR 2021
Multi-Attentional Deepfake Detection
CVPR 2021
Passport-aware Normalization for Deep Model Protection
NIPS 2020
LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks
CVPR 2020
Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer
CVPR 2020
Robust Superpixel-Guided Attentional Adversarial Attack
CVPR 2020
GSM: Graph Similarity Model for Multi-Object Tracking
IJCAI 2020
Density-Aware Graph for Deep Semi-Supervised Visual Recognition
CVPR 2020
DASOT: A Unified Framework Integrating Data Association and Single Object Tracking for Online Multi-Object Tracking
AAAI 2020
Self-Robust 3D Point Recognition via Gather-Vector Guidance
CVPR 2020
GreedyFool: Distortion-Aware Sparse Adversarial Attack
NIPS 2020
Model Watermarking for Image Processing Networks
AAAI 2020
Memory-Based Neighbourhood Embedding for Visual Recognition
ICCV 2019
Context and Attribute Grounded Dense Captioning
CVPR 2019
Trust Region Evolution Strategies
AAAI 2019
Semantics Disentangling for Text-To-Image Generation
CVPR 2019
Detection Based Defense Against Adversarial Examples From the Steganalysis Point of View
CVPR 2019
Capacity Control of ReLU Neural Networks by Basis-Path Norm
AAAI 2019
G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space
ICLR 2019
DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense
ICCV 2019
Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once
ICCV 2019
Model-Level Dual Learning
ICML 2018
Decouple Learning for Parameterized Image Operators
ECCV 2018
Stereoscopic Neural Style Transfer
CVPR 2018
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition
ECCV 2018
Dual Supervised Learning
ICML 2017
StyleBank: An Explicit Representation for Neural Image Style Transfer
CVPR 2017
Coherent Online Video Style Transfer
ICCV 2017
Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism
ICCV 2017
Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification
CVPR 2017
Asynchronous Stochastic Gradient Descent with Delay Compensation
ICML 2017
Deliberation Networks: Sequence Generation Beyond One-Pass Decoding
NIPS 2017
Dual Inference for Machine Learning
IJCAI 2017
Dual Learning for Machine Translation
NIPS 2016
Budgeted Multi-Armed Bandits with Multiple Plays
IJCAI 2016
Budgeted Bandit Problems with Continuous Random Costs
ACML 2015
Thompson Sampling for Budgeted Multi-Armed Bandits
IJCAI 2015
Word Alignment Modeling with Context Dependent Deep Neural Network
ACL 2013
A Ranking-based Approach to Word Reordering for Statistical Machine Translation
ACL 2012
Learning Bregman Distance Functions and Its Application for Semi-Supervised Clustering
NIPS 2009