Jian Zhang
168 papers · 2000–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (18)
π
Interdisciplinary Bridge
π
Conference Polyglot
(18)
π£
Hot Topic Early Bird
π
Conference Loyalist
(25)
π€
Dynamic Duo
(13)
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(21)
π
Keyword Champion
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(8)
β‘
Prolific Year
(32)
π
Century Club
(155)
ποΈ
Keyword Collector
(53)
β
The Questioner
(2)
Conferences
CVPR (38)
AAAI (32)
ICCV (17)
NIPS (13)
ICLR (12)
ECCV (12)
IJCAI (10)
ACL (9)
EMNLP (6)
COLING (5)
CORL (3)
MICCAI (3)
AISTATS (2)
ICML (2)
INTERSPEECH (1)
JMLR (1)
NAACL (1)
WACV (1)
Top co-authors
Research topics
Keywords
diffusion model
(13)
image reconstruction
(10)
large language model
(9)
spike camera
(8)
convolutional neural network
(8)
semantic segmentation
(7)
graph neural network
(6)
image restoration
(6)
domain adaptation
(6)
point cloud
(5)
knowledge distillation
(5)
novel view synthesis
(4)
image super-resolution
(4)
3d gaussian splatting
(4)
3d reconstruction
(4)
unsupervised learning
(4)
transfer learning
(4)
few-shot learning
(4)
image generation
(4)
semi-supervised learning
(4)
Papers
Beyond the Panorama: Training-Free Hierarchical Perception-Reasoning for Fine-Grained Vision in MLLMs
ACL 2026
FactVerse: A Benchmark for Factual Consistency in Interleaved ImageβText Generation
ACL 2026
MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning
ACL 2026
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
ACL 2026
MAPS: Multi-Agent Personality Shaping for Collaborative Reasoning
AAAI 2026
MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt Optimization
AAAI 2026
LLM-Guided Quantified SMT Solving over Uninterpreted Functions
AAAI 2026
VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning
AAAI 2026
PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-Cancer
AAAI 2026
Dissecting Failure Dynamics in Large Language Model Reasoning
ACL 2026
Decomposing and Composing: Towards Efficient Vision-Language Continual Learning via Rank-1 Expert Pool in a Single LoRA
AAAI 2026
CoreGaze: Core Subgraph-Driven Visual Gaze Diffusion for Training-Free Referring Multimodal Large Language Models
ACL 2026
LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models
AAAI 2026
Humanoid Policy Β Human Policy
CORL 2025
RadKAM: Attention-Driven Kolmogorov-Arnold Model for Automatic Radiation-Induced Lymphopenia Prediction by Multimodal Learning
MICCAI 2025
GA-SAM: Geometry-Aware SAM Adaptation with Sparse Annotation-Driven Point Cloud Completion
MICCAI 2025
Fusing Dual Encoders: Single-source Domain Generalization with Extremely Few Annotations
MICCAI 2025
SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography
ICLR 2025
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
ICLR 2025
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD
ICLR 2025
AutoG: Towards automatic graph construction from tabular data
ICLR 2025
PointGAC: Geometric-Aware Codebook for Masked Point Modeling
ICCV 2025
Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints
ICCV 2025
Recoverable Facial Identity Protection via Adaptive Makeup Transfer Adversarial Attacks
AAAI 2025
Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization
AAAI 2025
A Complete Algorithm for Optimization Modulo Nonlinear Real Arithmetic
AAAI 2025
C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory Prediction
AAAI 2025
Revisiting Interpolation for Noisy Label Correction
AAAI 2025
A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
ICCV 2025
Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild
ICCV 2025
Efficient Universal Goal Hijacking with Semantics-guided Prompt Organization
ACL 2025
LLM-Driven Completeness and Consistency Evaluation for Cultural Heritage Data Augmentation in Cross-Modal Retrieval
EMNLP 2025
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
EMNLP 2025
ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
EMNLP 2025
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
CVPR 2025
OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction
CVPR 2025
Retrieval Augmented Instruction Tuning for Open NER with Large Language Models
COLING 2025
Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering
CVPR 2025
Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation
CVPR 2025
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
CVPR 2025
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
CVPR 2025
Adversarial Diffusion Compression for Real-World Image Super-Resolution
CVPR 2025
Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization
CVPR 2025
Taste More, Taste Better: Diverse Data and Strong Model Boost Semi-Supervised Crowd Counting
CVPR 2025
Robot Operating Home Appliances by Reading User Manuals
CORL 2025
Score-CDM: Score-Weighted Convolutional Diffusion Model for Multivariate Time Series Imputation
IJCAI 2024
ReVideo: Remake a Video with Motion and Content Control
NIPS 2024
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
NIPS 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
NIPS 2024
GS-Hider: Hiding Messages into 3D Gaussian Splatting
NIPS 2024
HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting
NIPS 2024
Joint Demosaicing and Denoising for Spike Camera
AAAI 2024
Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training
AAAI 2024
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models
AAAI 2024
Optical Flow for Spike Camera with Hierarchical Spatial-Temporal Spike Fusion
AAAI 2024
Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck
AAAI 2024
Expressive Multi-Agent Communication via Identity-Aware Learning
AAAI 2024
A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction
COLING 2024
360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
CVPR 2024
Boosting Spike Camera Image Reconstruction from a Perspective of Dealing with Spike Fluctuations
CVPR 2024
Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation
CVPR 2024
EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
CVPR 2024
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
CVPR 2024
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing
CVPR 2024
Super-Resolution Reconstruction from Bayer-Pattern Spike Streams
CVPR 2024
KPConvX: Modernizing Kernel Point Convolution with Kernel Attention
CVPR 2024
OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model
ECCV 2024
Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization
ECCV 2024
The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation
ECCV 2024
Towards compact reversible image representations for neural style transfer
ECCV 2024
Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training
EMNLP 2024
BadEdit: Backdooring Large Language Models by Model Editing
ICLR 2024
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
ICLR 2024
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
ICLR 2024
NetInfoF Framework: Measuring and Exploiting Network Usable Information
ICLR 2024
SaSDim:Self-Adaptive Noise Scaling Diffusion Model for Spatial Time Series Imputation
IJCAI 2024
Implicit Neural Representation for Cooperative Low-light Image Enhancement
ICCV 2023
Empirical Study of Zero-Shot NER with ChatGPT
EMNLP 2023
Suggesting Variable Order for Cylindrical Algebraic Decomposition via Reinforcement Learning
NIPS 2023
Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation
ICCV 2023
DomainAdaptor: A Novel Approach to Test-time Adaptation
ICCV 2023
A Unified Continual Learning Framework with General Parameter-Efficient Tuning
ICCV 2023
Generalizable Decision Boundaries: Dualistic Meta-Learning for Open Set Domain Generalization
ICCV 2023
Overlap-Guided Gaussian Mixture Models for Point Cloud Registration
WACV 2023
A Study on Visualization of Voiceprint Feature
INTERSPEECH 2023
Null-Space Diffusion Sampling for Zero-Shot Point Cloud Completion
IJCAI 2023
HVTSurv: Hierarchical Vision Transformer for Patient-Level Survival Prediction from Whole Slide Image
AAAI 2023
GAN Prior Based Null-Space Learning for Consistent Super-resolution
AAAI 2023
Less Is More Important: An Attention Module Guided by Probability Density Function for Convolutional Neural Networks
AAAI 2023
Learning to Super-resolve Dynamic Scenes for Neuromorphic Spike Camera
AAAI 2023
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model
ICLR 2023
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
CVPR 2023
Panoptic Compositional Feature Field for Editable Scene Rendering With Network-Inferred Labels via Metric Learning
CVPR 2023
Optimization-Inspired Cross-Attention Transformer for Compressive Sensing
CVPR 2023
Multi-Agent Automated Machine Learning
CVPR 2023
Large-Capacity and Flexible Video Steganography via Invertible Neural Network
CVPR 2023
Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration
CVPR 2023
Knowledge-Constrained Answer Generation for Open-Ended Video Question Answering
AAAI 2023
Can Graph Neural Networks Learn to Solve the MaxSAT Problem? (Student Abstract)
AAAI 2023
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
ICCV 2023
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography
NIPS 2023
Ray Priors Through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation
CVPR 2022
Frequency Domain Model Augmentation for Adversarial Attack
ECCV 2022
Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation
ECCV 2022
Metric Learning Based Interactive Modulation for Real-World Super-Resolution
ECCV 2022
Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection
ECCV 2022
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning
ECCV 2022
MVDG: A Unified Multi-View Framework for Domain Generalization
ECCV 2022
Deep Generalized Unfolding Networks for Image Restoration
CVPR 2022
HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging
CVPR 2022
Image Disentanglement Autoencoder for Steganography Without Embedding
CVPR 2022
Robust Invertible Image Steganography
CVPR 2022
Word Level Robustness Enhancement: Fight Perturbation with Perturbation
AAAI 2022
Panini-Net: GAN Prior Based Degradation-Aware Feature Interpolation for Face Restoration
AAAI 2022
Unpaired Multi-Domain Stain Transfer for Kidney Histopathological Images
AAAI 2022
Matching on Sets: Conquer Occluded Person Re-identification Without Alignment
AAAI 2021
TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification
NIPS 2021
Super Resolve Dynamic Scene From Continuous Spike Streams
ICCV 2021
Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream
CVPR 2021
Webly Supervised Fine-Grained Recognition: Benchmark Datasets and an Approach
ICCV 2021
Dense Deep Unfolding Network With 3D-CNN Prior for Snapshot Compressive Imaging
ICCV 2021
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching
IJCAI 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
ICML 2021
PTN: A Poisson Transfer Network for Semi-supervised Few-shot Learning
AAAI 2021
Jo-SRC: A Contrastive Approach for Combating Noisy Labels
CVPR 2021
Dynamic Attentive Graph Learning for Image Restoration
ICCV 2021
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation
CVPR 2021
Adversarial AutoAugment
ICLR 2020
Feature-Metric Registration: A Fast Semi-Supervised Approach for Robust Point Cloud Registration Without Correspondences
CVPR 2020
Contextual Embeddings: When Are They Worth It?
ACL 2020
Potential Passenger Flow Prediction: A Novel Study for Urban Transportation Development
AAAI 2020
Measuring and Improving the Use of Graph Information in Graph Neural Networks
ICLR 2020
Face Anti-Spoofing via Disentangled Representation Learning
ECCV 2020
AutoBSS: An Efficient Algorithm for Block Stacking Style Search
NIPS 2020
Field-wise Learning for Multi-field Categorical Data
NIPS 2020
A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification
IJCAI 2020
A Spatial Missing Value Imputation Method for Multi-view Urban Statistical Data
IJCAI 2020
Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer
IJCAI 2020
Adversarial Domain Adaptation with Domain Mixup
AAAI 2020
Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks
CVPR 2019
Variational Convolutional Neural Network Pruning
CVPR 2019
Scene Text Recognition from Two-Dimensional Perspective
AAAI 2019
Worst Cases Policy Gradients
CORL 2019
Variational Few-Shot Learning
ICCV 2019
Low-Precision Random Fourier Features for Memory-constrained Kernel Approximation
AISTATS 2019
Approximating Integer Solution Counting via Space Quantification for Linear Constraints
IJCAI 2019
Solving the Satisfiability Problem of Modal Logic S5 Guided by Graph Coloring
IJCAI 2019
On the Downstream Performance of Compressed Word Embeddings
NIPS 2019
Leveraging Heterogeneous Auxiliary Tasks to Assist Crowd Counting
CVPR 2019
Extracting Privileged Information from Untagged Corpora for Classifier Learning
IJCAI 2018
ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing
CVPR 2018
Goal-Oriented Visual Question Generation via Intermediate Rewards
ECCV 2018
Fine-Grained Video Captioning for Sports Narrative
CVPR 2018
Natural Language Inference over Interaction Space
ICLR 2018
Structured Control Nets for Deep Reinforcement Learning
ICML 2018
SQuAD: 100,000+ Questions for Machine Comprehension of Text
EMNLP 2016
Topic-Informed Neural Machine Translation
COLING 2016
Fast Gated Neural Domain Adaptation: Language Model as a Case Study
COLING 2016
Higher-Order Inference for Multi-Class Log-Supermodular Models
ICCV 2015
Image Denoising via Adaptive Soft-Thresholding Based on Non-Local Samples
CVPR 2015
On Linearly Constrained Minimum Variance Beamforming
JMLR 2015
Message Passing Inference for Large Scale Graphical Models with High Order Potentials
NIPS 2014
Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors
ICCV 2013
Multiple Instance Learning on Structured Data
NIPS 2011
The Group Dantzig Selector
AISTATS 2010
A Rhetorical Syntax-Driven Model for Speech Summarization
COLING 2010
Speech Summarization Without Lexical Features for Mandarin Broadcast News
NAACL 2007
Extraction of Chinese Compound Words - An Experimental Study on a Very Large Corpus
ACL 2000