Jiangning Zhang
62 papers · 2020–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🌍 Conference Polyglot (7) 🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(5)
🏠
Conference Loyalist
(23)
🤝
Dynamic Duo
(36)
🔬
Deep Specialist
(13)
🧬
Topic Evolution
🏆
Keyword Champion
(2)
⚡
Prolific Year
(18)
❓
The Questioner
🗃️
Keyword Collector
(276)
🔥
Unstoppable
(6)
💎
Century Club
(59)
📈
Trend Setter
Conferences
CVPR (23)
AAAI (13)
ECCV (10)
ICCV (8)
NIPS (4)
IJCAI (2)
ACL (1)
ICLR (1)
Top co-authors
Keywords
diffusion model
(9)
anomaly detection
(8)
image generation
(7)
semantic segmentation
(5)
multimodal large language model
(4)
point cloud
(4)
identity preservation
(4)
object detection
(4)
knowledge distillation
(4)
representation learning
(3)
contrastive learning
(3)
3d vision
(3)
self-supervised learning
(3)
multimodal learning
(3)
multi-modal learning
(3)
unsupervised learning
(3)
image restoration
(3)
feature extraction
(3)
attention mechanism
(3)
state space model
(3)
Papers
Disco-RAG: Discourse-Aware Retrieval-Augmented Generation
ACL 2026
LLM-Oriented Token-Adaptive Knowledge Distillation
AAAI 2026
UltraGen: High-Resolution Video Generation with Hierarchical Attention
AAAI 2026
SVFR: A Unified Framework for Generalized Video Face Restoration
CVPR 2025
CustAny: Customizing Anything from A Single Example
CVPR 2025
OSV: One Step is Enough for High-Quality Image to Video Generation
CVPR 2025
Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection
CVPR 2025
TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation
CVPR 2025
ID-Sculpt: ID-aware 3D Head Generation from Single In-the-wild Portrait Image
AAAI 2025
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning
AAAI 2025
Explore In-Context Segmentation via Latent Diffusion Models
AAAI 2025
Point Cloud Mamba: Point Cloud Learning via State Space Model
AAAI 2025
SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation
ICLR 2025
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs
ICCV 2025
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
ICCV 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers For Motion Transfer
ICCV 2025
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
CVPR 2025
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model
CVPR 2025
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
CVPR 2025
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
CVPR 2025
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
CVPR 2025
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
CVPR 2024
MotionBooth: Motion-Aware Customized Text-to-Video Generation
NIPS 2024
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
NIPS 2024
Fetch and Forge: Efficient Dataset Condensation for Object Detection
NIPS 2024
Rethinking Reverse Distillation for Multi-Modal Anomaly Detection
AAAI 2024
A Diffusion-Based Framework for Multi-Class Anomaly Detection
AAAI 2024
AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model
AAAI 2024
Self-Supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes
AAAI 2024
Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection
CVPR 2024
SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis
CVPR 2024
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
CVPR 2024
Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection
ECCV 2024
FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis
ECCV 2024
AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection
ECCV 2024
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
ECCV 2024
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
ECCV 2024
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
ECCV 2024
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
IJCAI 2024
Rethinking Mobile Block for Efficient Attention-based Models
ICCV 2023
Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection
ICCV 2023
Learning Global-aware Kernel for Image Harmonization
ICCV 2023
Learning With Noisy Labels via Self-Supervised Adversarial Noisy Masking
CVPR 2023
Multimodal Industrial Anomaly Detection via Hybrid Fusion
CVPR 2023
High-Fidelity Generalized Emotional Talking Face Generation With Multi-Modal Emotion Space Learning
CVPR 2023
Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution
CVPR 2023
Calibrated Teacher for Sparsely Annotated Object Detection
AAAI 2023
Learning To Measure the Point Cloud Reconstruction Loss in a Representation Space
CVPR 2023
MixTeacher: Mining Promising Labels With Mixed Scale Teacher for Semi-Supervised Object Detection
CVPR 2023
Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption
ICCV 2023
Learning to Train a Point Cloud Reconstruction Network without Matching
ECCV 2022
Resolution-Free Point Cloud Sampling Network with Data Distillation
ECCV 2022
Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping
ECCV 2022
Iterative Few-shot Semantic Segmentation from Image Label Text
IJCAI 2022
SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-resolution
AAAI 2022
Region-Aware Face Swapping
CVPR 2022
Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model
NIPS 2021
RFNet: Recurrent Forward Network for Dense Point Cloud Completion
ICCV 2021
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose
AAAI 2020
FReeNet: Multi-Identity Face Reenactment
CVPR 2020
Learning by Analogy: Reliable Supervision From Transformations for Unsupervised Optical Flow Estimation
CVPR 2020
DTVNet: Dynamic Time-lapse Video Generation via Single Still Image
ECCV 2020