Jian Zhang

168 papers · 2000–2026 · 18 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (18)

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (18) 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (25) 🤝 Dynamic Duo (13) 🧬 Topic Evolution 🏆 Grand Slam 🔬 Deep Specialist (21) 🏆 Keyword Champion 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (8) ⚡ Prolific Year (32) 💎 Century Club (155) 🗃️ Keyword Collector (53) ❓ The Questioner (2)

Conferences

CVPR (38) AAAI (32) ICCV (17) NIPS (13) ICLR (12) ECCV (12) IJCAI (10) ACL (9) EMNLP (6) COLING (5) CORL (3) MICCAI (3) AISTATS (2) ICML (2) INTERSPEECH (1) JMLR (1) NAACL (1) WACV (1)

Top co-authors

Yinghuan Shi (14) Lei Qi (14) Chong Mou (12) Ruiqin Xiong (10) Feifei Ma (8) Tiejun Huang (8) Xuanyu Zhang (8) Xinhua Cheng (7) Jiwen Yu (7) Yanmin Wu (6)

Research topics

Privacy (4) Optimization & Theory (1)

Keywords

diffusion model (13) image reconstruction (10) large language model (9) spike camera (8) convolutional neural network (8) semantic segmentation (7) graph neural network (6) image restoration (6) domain adaptation (6) point cloud (5) knowledge distillation (5) novel view synthesis (4) image super-resolution (4) 3d gaussian splatting (4) 3d reconstruction (4) unsupervised learning (4) transfer learning (4) few-shot learning (4) image generation (4) semi-supervised learning (4)

Papers

Beyond the Panorama: Training-Free Hierarchical Perception-Reasoning for Fine-Grained Vision in MLLMs ACL 2026 FactVerse: A Benchmark for Factual Consistency in Interleaved Image–Text Generation ACL 2026 MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning ACL 2026 MUR: Momentum Uncertainty guided Reasoning for Large Language Models ACL 2026 MAPS: Multi-Agent Personality Shaping for Collaborative Reasoning AAAI 2026 MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt Optimization AAAI 2026 LLM-Guided Quantified SMT Solving over Uninterpreted Functions AAAI 2026 VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning AAAI 2026 PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer AAAI 2026 Dissecting Failure Dynamics in Large Language Model Reasoning ACL 2026 Decomposing and Composing: Towards Efficient Vision-Language Continual Learning via Rank-1 Expert Pool in a Single LoRA AAAI 2026 CoreGaze: Core Subgraph-Driven Visual Gaze Diffusion for Training-Free Referring Multimodal Large Language Models ACL 2026 LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models AAAI 2026 Humanoid Policy Human Policy CORL 2025 RadKAM: Attention-Driven Kolmogorov-Arnold Model for Automatic Radiation-Induced Lymphopenia Prediction by Multimodal Learning MICCAI 2025 GA-SAM: Geometry-Aware SAM Adaptation with Sparse Annotation-Driven Point Cloud Completion MICCAI 2025 Fusing Dual Encoders: Single-source Domain Generalization with Extremely Few Annotations MICCAI 2025 SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography ICLR 2025 FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models ICLR 2025 Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD ICLR 2025 AutoG: Towards automatic graph construction from tabular data ICLR 2025 PointGAC: Geometric-Aware Codebook for Masked Point Modeling ICCV 2025 Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints ICCV 2025 Recoverable Facial Identity Protection via Adaptive Makeup Transfer Adversarial Attacks AAAI 2025 Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization AAAI 2025 A Complete Algorithm for Optimization Modulo Nonlinear Real Arithmetic AAAI 2025 C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction AAAI 2025 Revisiting Interpolation for Noisy Label Correction AAAI 2025 A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation ICCV 2025 Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild ICCV 2025 Efficient Universal Goal Hijacking with Semantics-guided Prompt Organization ACL 2025 LLM-Driven Completeness and Consistency Evaluation for Cultural Heritage Data Augmentation in Cross-Modal Retrieval EMNLP 2025 DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models EMNLP 2025 ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming EMNLP 2025 InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception CVPR 2025 OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction CVPR 2025 Retrieval Augmented Instruction Tuning for Open NER with Large Language Models COLING 2025 Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering CVPR 2025 Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation CVPR 2025 SkillMimic: Learning Basketball Interaction Skills from Demonstrations CVPR 2025 OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking CVPR 2025 Adversarial Diffusion Compression for Real-World Image Super-Resolution CVPR 2025 Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization CVPR 2025 Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting CVPR 2025 Robot Operating Home Appliances by Reading User Manuals CORL 2025 Score-CDM: Score-Weighted Convolutional Diffusion Model for Multivariate Time Series Imputation IJCAI 2024 ReVideo: Remake a Video with Motion and Content Control NIPS 2024 OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding NIPS 2024 Large Spatial Model: End-to-end Unposed Images to Semantic 3D NIPS 2024 GS-Hider: Hiding Messages into 3D Gaussian Splatting NIPS 2024 HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting NIPS 2024 Joint Demosaicing and Denoising for Spike Camera AAAI 2024 Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training AAAI 2024 T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models AAAI 2024 Optical Flow for Spike Camera with Hierarchical Spatial-Temporal Spike Fusion AAAI 2024 Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck AAAI 2024 Expressive Multi-Agent Communication via Identity-Aware Learning AAAI 2024 A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction COLING 2024 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model CVPR 2024 Boosting Spike Camera Image Reconstruction from a Perspective of Dealing with Spike Fluctuations CVPR 2024 Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation CVPR 2024 EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection CVPR 2024 Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model CVPR 2024 DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing CVPR 2024 Super-Resolution Reconstruction from Bayer-Pattern Spike Streams CVPR 2024 KPConvX: Modernizing Kernel Point Convolution with Kernel Attention CVPR 2024 OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model ECCV 2024 Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization ECCV 2024 The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation ECCV 2024 Towards compact reversible image representations for neural style transfer ECCV 2024 Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training EMNLP 2024 BadEdit: Backdooring Large Language Models by Model Editing ICLR 2024 Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts ICLR 2024 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models ICLR 2024 NetInfoF Framework: Measuring and Exploiting Network Usable Information ICLR 2024 SaSDim:Self-Adaptive Noise Scaling Diffusion Model for Spatial Time Series Imputation IJCAI 2024 Implicit Neural Representation for Cooperative Low-light Image Enhancement ICCV 2023 Empirical Study of Zero-Shot NER with ChatGPT EMNLP 2023 Suggesting Variable Order for Cylindrical Algebraic Decomposition via Reinforcement Learning NIPS 2023 Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation ICCV 2023 DomainAdaptor: A Novel Approach to Test-time Adaptation ICCV 2023 A Unified Continual Learning Framework with General Parameter-Efficient Tuning ICCV 2023 Generalizable Decision Boundaries: Dualistic Meta-Learning for Open Set Domain Generalization ICCV 2023 Overlap-Guided Gaussian Mixture Models for Point Cloud Registration WACV 2023 A Study on Visualization of Voiceprint Feature INTERSPEECH 2023 Null-Space Diffusion Sampling for Zero-Shot Point Cloud Completion IJCAI 2023 HVTSurv: Hierarchical Vision Transformer for Patient-Level Survival Prediction from Whole Slide Image AAAI 2023 GAN Prior Based Null-Space Learning for Consistent Super-resolution AAAI 2023 Less Is More Important: An Attention Module Guided by Probability Density Function for Convolutional Neural Networks AAAI 2023 Learning to Super-resolve Dynamic Scenes for Neuromorphic Spike Camera AAAI 2023 Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model ICLR 2023 EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding CVPR 2023 Panoptic Compositional Feature Field for Editable Scene Rendering With Network-Inferred Labels via Metric Learning CVPR 2023 Optimization-Inspired Cross-Attention Transformer for Compressive Sensing CVPR 2023 Multi-Agent Automated Machine Learning CVPR 2023 Large-Capacity and Flexible Video Steganography via Invertible Neural Network CVPR 2023 Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration CVPR 2023 Knowledge-Constrained Answer Generation for Open-Ended Video Question Answering AAAI 2023 Can Graph Neural Networks Learn to Solve the MaxSAT Problem? (Student Abstract) AAAI 2023 FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model ICCV 2023 CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography NIPS 2023 Ray Priors Through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation CVPR 2022 Frequency Domain Model Augmentation for Adversarial Attack ECCV 2022 Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation ECCV 2022 Metric Learning Based Interactive Modulation for Real-World Super-Resolution ECCV 2022 Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection ECCV 2022 R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning ECCV 2022 MVDG: A Unified Multi-View Framework for Domain Generalization ECCV 2022 Deep Generalized Unfolding Networks for Image Restoration CVPR 2022 HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging CVPR 2022 Image Disentanglement Autoencoder for Steganography Without Embedding CVPR 2022 Robust Invertible Image Steganography CVPR 2022 Word Level Robustness Enhancement: Fight Perturbation with Perturbation AAAI 2022 Panini-Net: GAN Prior Based Degradation-Aware Feature Interpolation for Face Restoration AAAI 2022 Unpaired Multi-Domain Stain Transfer for Kidney Histopathological Images AAAI 2022 Matching on Sets: Conquer Occluded Person Re-identification Without Alignment AAAI 2021 TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification NIPS 2021 Super Resolve Dynamic Scene From Continuous Spike Streams ICCV 2021 Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream CVPR 2021 Webly Supervised Fine-Grained Recognition: Benchmark Datasets and an Approach ICCV 2021 Dense Deep Unfolding Network With 3D-CNN Prior for Snapshot Compressive Imaging ICCV 2021 Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching IJCAI 2021 Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning ICML 2021 PTN: A Poisson Transfer Network for Semi-supervised Few-shot Learning AAAI 2021 Jo-SRC: A Contrastive Approach for Combating Noisy Labels CVPR 2021 Dynamic Attentive Graph Learning for Image Restoration ICCV 2021 Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation CVPR 2021 Adversarial AutoAugment ICLR 2020 Feature-Metric Registration: A Fast Semi-Supervised Approach for Robust Point Cloud Registration Without Correspondences CVPR 2020 Contextual Embeddings: When Are They Worth It? ACL 2020 Potential Passenger Flow Prediction: A Novel Study for Urban Transportation Development AAAI 2020 Measuring and Improving the Use of Graph Information in Graph Neural Networks ICLR 2020 Face Anti-Spoofing via Disentangled Representation Learning ECCV 2020 AutoBSS: An Efficient Algorithm for Block Stacking Style Search NIPS 2020 Field-wise Learning for Multi-field Categorical Data NIPS 2020 A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification IJCAI 2020 A Spatial Missing Value Imputation Method for Multi-view Urban Statistical Data IJCAI 2020 Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer IJCAI 2020 Adversarial Domain Adaptation with Domain Mixup AAAI 2020 Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks CVPR 2019 Variational Convolutional Neural Network Pruning CVPR 2019 Scene Text Recognition from Two-Dimensional Perspective AAAI 2019 Worst Cases Policy Gradients CORL 2019 Variational Few-Shot Learning ICCV 2019 Low-Precision Random Fourier Features for Memory-constrained Kernel Approximation AISTATS 2019 Approximating Integer Solution Counting via Space Quantification for Linear Constraints IJCAI 2019 Solving the Satisfiability Problem of Modal Logic S5 Guided by Graph Coloring IJCAI 2019 On the Downstream Performance of Compressed Word Embeddings NIPS 2019 Leveraging Heterogeneous Auxiliary Tasks to Assist Crowd Counting CVPR 2019 Extracting Privileged Information from Untagged Corpora for Classifier Learning IJCAI 2018 ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing CVPR 2018 Goal-Oriented Visual Question Generation via Intermediate Rewards ECCV 2018 Fine-Grained Video Captioning for Sports Narrative CVPR 2018 Natural Language Inference over Interaction Space ICLR 2018 Structured Control Nets for Deep Reinforcement Learning ICML 2018 SQuAD: 100,000+ Questions for Machine Comprehension of Text EMNLP 2016 Topic-Informed Neural Machine Translation COLING 2016 Fast Gated Neural Domain Adaptation: Language Model as a Case Study COLING 2016 Higher-Order Inference for Multi-Class Log-Supermodular Models ICCV 2015 Image Denoising via Adaptive Soft-Thresholding Based on Non-Local Samples CVPR 2015 On Linearly Constrained Minimum Variance Beamforming JMLR 2015 Message Passing Inference for Large Scale Graphical Models with High Order Potentials NIPS 2014 Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors ICCV 2013 Multiple Instance Learning on Structured Data NIPS 2011 The Group Dantzig Selector AISTATS 2010 A Rhetorical Syntax-Driven Model for Speech Summarization COLING 2010 Speech Summarization Without Lexical Features for Mandarin Broadcast News NAACL 2007 Extraction of Chinese Compound Words - An Experimental Study on a Very Large Corpus ACL 2000