Zheng Zhang
211 papers · 2014–2026 · 21 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (27) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (20)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(27)
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(20)
π
Keyword Champion
π€
Dynamic Duo
(31)
π
Grand Slam
π₯
Mega-Team
(22)
π
Triple Crown
π¬
Deep Specialist
(21)
π§¬
Topic Evolution
π
Trend Setter
π
Century Club
(199)
ποΈ
Keyword Collector
(73)
β
The Questioner
(3)
π
Conference Pioneer
β‘
Prolific Year
(42)
π₯
Unstoppable
(13)
Conferences
AAAI (29)
NIPS (27)
ACL (26)
EMNLP (25)
CVPR (22)
ICCV (19)
ECCV (14)
ICLR (12)
IJCAI (9)
COLING (5)
ICML (5)
NAACL (5)
AISTATS (2)
IJCNLP (2)
JMLR (2)
WACV (2)
EACL (1)
INTERSPEECH (1)
MICCAI (1)
AACL (1)
UAI (1)
Top co-authors
Research topics
Keywords
large language model
(31)
representation learning
(19)
object detection
(14)
graph neural network
(13)
contrastive learning
(13)
transfer learning
(11)
self-supervised learning
(11)
model compression
(10)
attention mechanism
(9)
data augmentation
(9)
semantic segmentation
(9)
vision transformer
(8)
reinforcement learning
(7)
masked image modeling
(7)
image classification
(7)
vision-language model
(7)
information retrieval
(6)
zero-shot learning
(6)
domain adaptation
(6)
unsupervised learning
(5)
Papers
FourierPET: Deep Fourier-based Unrolled Network for Low-count PET Reconstruction
AAAI 2026
TDSS: Task Dynamic-Synergistic Skill Adaptation for Boosting Efficient and Scalable Multi-Task Learning in Dense Visual Prediction
AAAI 2026
Themis: Automated Constraint-Aware Test Synthesis Framework for Code Reinforcement Learning
AAAI 2026
Controllable Contamination Detection for Reliable LLM Evaluation with Statistical Guarantees
ACL 2026
Bolster Hallucination Detection via Prompt-Guided Data Augmentation
AAAI 2026
Coverage-Constrained Human-AI Cooperation with Multiple Experts
AAAI 2026
LongTutor: Benchmarking Large Language Models for Long-term Personalized Tutoring
ACL 2026
Cross-modal Prompting for Balanced Incomplete Multi-modal Emotion Recognition
AAAI 2026
SinBasis Networks: Matrix-Equivalent Feature Extraction for Wave-Like Optical Spectrograms
AAAI 2026
MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation
AAAI 2026
AutoPP: Towards Automated Product Poster Generation and Optimization
AAAI 2026
FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression
EACL 2026
AuthGuard: Generalizable Deepfake Detection via Language Guidance
WACV 2026
UniRAG: Unified Query Understanding Method for Retrieval Augmented Generation
ACL 2025
PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization
ACL 2025
QA Analysis in Medical and Legal Domains: A Survey of Data Augmentation in Low-Resource Settings
ACL 2025
Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning
ACL 2025
Wanda++: Pruning Large Language Models via Regional Gradients
ACL 2025
QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
EMNLP 2025
StableDepth: Scene-Consistent and Scale-Invariant Monocular Depth
ICCV 2025
PolaFormer: Polarity-aware Linear Attention for Vision Transformers
ICLR 2025
CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
EMNLP 2025
Projection Pursuit Density Ratio Estimation
ICML 2025
CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models
IJCAI 2025
Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach
ICLR 2025
Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models
EMNLP 2025
Connector-S: A Survey of Connectors in Multi-modal Large Language Models
IJCAI 2025
Multi-Document Event Extraction Using Large and Small Language Models
EMNLP 2025
A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation
CVPR 2025
NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens
ICLR 2025
SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
ICLR 2025
A Statistical Approach for Controlled Training Data Detection
ICLR 2025
Portcullis: A Scalable and Verifiable Privacy Gateway for Third-Party LLM Inference
AAAI 2025
OT-StainNet: Optimal Transport Driven Semantic Matching for Weakly Paired H&E-to-IHC Stain Transfer
AAAI 2025
Transferable Adversarial Face Attack with Text Controlled Attribute
AAAI 2025
Distribution-Driven Dense Retrieval: Modeling Many-to-One Query-Document Relationship
AAAI 2025
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems
AAAI 2025
LEGEND: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets
AAAI 2025
Intent Oriented Contrastive Learning for Sequential Recommendation
AAAI 2025
DeepMIM: Deep Supervision for Masked Image Modeling
WACV 2025
Optimal Transport-Guided Source-Free Adaptation for Face Anti-Spoofing
CVPR 2025
PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models
IJCAI 2025
Causal Effect of Functional Treatment
JMLR 2025
Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
COLING 2025
GRAG: Graph Retrieval-Augmented Generation
NAACL 2025
Residual Reweighted Conformal Prediction for Graph Neural Networks
UAI 2025
Communication-Efficient and Tensorized Federated Fine-Tuning of Large Language Models
ACL 2025
Learning to Select In-Context Demonstration Preferred by Large Language Model
ACL 2025
MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
EMNLP 2025
GraphNarrator: Generating Textual Explanations for Graph Neural Networks
ACL 2025
PQR: Improving Dense Retrieval via Potential Query Modeling
ACL 2025
Zero-1-to-3: Domain-Level Zero-Shot Cognitive Diagnosis via One Batch of Early-Bird Students towards Three Diagnostic Objectives
AAAI 2024
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
NIPS 2024
Masked Structural Growth for 2x Faster Language Model Pre-training
ICLR 2024
DeepZero: Scaling Up Zeroth-Order Optimization for Deep Model Training
ICLR 2024
Self-Supervised Heterogeneous Graph Learning: a Homophily and Heterogeneity View
ICLR 2024
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
NIPS 2024
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs
NIPS 2024
Can Language Models Learn to Skip Steps?
NIPS 2024
Towards Accurate and Fair Cognitive Diagnosis via Monotonic Data Augmentation
NIPS 2024
Exploiting Descriptive Completeness Prior for Cross Modal Hashing with Incomplete Labels
NIPS 2024
TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs
NIPS 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
NIPS 2024
CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
NIPS 2024
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
NIPS 2024
FlightBERT++: A Non-autoregressive Multi-Horizon Flight Trajectory Prediction Framework
AAAI 2024
CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework
AAAI 2024
BiPFT: Binary Pre-trained Foundation Transformer with Low-Rank Estimation of Binarization Residual Polynomials
AAAI 2024
CariesXrays: Enhancing Caries Detection in Hospital-Scale Panoramic Dental X-rays via Feature Pyramid Contrastive Learning
AAAI 2024
Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models
ACL 2024
LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
NAACL 2024
EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs
COLING 2024
Medical Cross-Modal Prompt Hashing with Robust Noisy Correspondence Learning
MICCAI 2024
Adaptive Slot Attention: Object Discovery with Dynamic Slot Number
CVPR 2024
Segment and Caption Anything
CVPR 2024
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
CVPR 2024
Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios
CVPR 2024
Enhancing Cross-Modal Retrieval via Visual-Textual Prompt Hashing
IJCAI 2024
Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting
ECCV 2024
Towards Reliable Advertising Image Generation Using Human Feedback
ECCV 2024
PSALM: Pixelwise Segmentation with Large Multi-modal Model
ECCV 2024
Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification
ECCV 2024
Learning to Complement and to Defer to Multiple Users
ECCV 2024
GroupCover: A Secure, Efficient and Scalable Inference Framework for On-device Model Protection based on TEEs
ICML 2024
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
EMNLP 2024
Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models
EMNLP 2024
Knowledge-Centric Hallucination Detection
EMNLP 2024
ECON: On the Detection and Resolution of Evidence Conflicts
EMNLP 2024
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
EMNLP 2024
SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
ICML 2024
Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition
ICML 2024
Collaborative Cognitive Diagnosis with Disentangled Representation Learning for Learner Modeling
NIPS 2024
Unified Lexical Representation for Interpretable Visual-Language Alignment
NIPS 2024
CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation
ACL 2023
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-Training for Visual Recognition
CVPR 2023
Unsupervised Open-Vocabulary Object Localization in Videos
ICCV 2023
FairLISA: Fair User Modeling with Limited Sensitive Attributes Information
NIPS 2023
Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First
NIPS 2023
Evaluating Open-QA Evaluation
NIPS 2023
Coarse-to-Fine Amodal Segmentation with Shape Prior
ICCV 2023
Improving CLIP Fine-tuning Performance
ICCV 2023
Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval
ICCV 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
ICCV 2023
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation
ICCV 2023
KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection
ICCV 2023
Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus
EMNLP 2023
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
EMNLP 2023
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding
EMNLP 2023
Building Multi-domain Dialog State Trackers from Single-domain Dialogs
EMNLP 2023
Exploiting Abstract Meaning Representation for Open-Domain Question Answering
ACL 2023
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation
ACL 2023
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning
ACL 2023
Dual Cache for Long Document Neural Coreference Resolution
ACL 2023
An AMR-based Link Prediction Approach for Document-level Event Argument Extraction
ACL 2023
Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations
EMNLP 2023
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
EMNLP 2023
Revealing the Dark Secrets of Masked Image Modeling
CVPR 2023
DETR Does Not Need Multi-Scale or Locality Design
ICCV 2023
Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations
ACL 2023
Quantization-aware and Tensor-compressed Training of Transformers for Natural Language Understanding
INTERSPEECH 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-Trained Models
CVPR 2023
On Data Scaling in Masked Image Modeling
CVPR 2023
Side Adapter Network for Open-Vocabulary Semantic Segmentation
CVPR 2023
Object-Centric Multiple Object Tracking
ICCV 2023
KPT: Keyword-Guided Pre-training for Grounded Dialog Generation
AAAI 2023
Bridging the Gap to Real-World Object-Centric Learning
ICLR 2023
Unsupervised Deep Subgraph Anomaly Detection (Extended Abstract)
IJCAI 2023
Fantastic Questions and Where to Find Them: FairytaleQA β An Authentic Dataset for Narrative Comprehension
ACL 2022
Vega-MT: The JD Explore Academy Machine Translation System for WMT22
EMNLP 2022
Video Swin Transformer
CVPR 2022
It is AIβs Turn to Ask Humans a Question: Question-Answer Pair Generation for Childrenβs Story Books
ACL 2022
Why Propagate Alone? Parallel Use of Labels and Features on Graphs
ICLR 2022
Could Giant Pre-trained Image Models Extract Universal Representations?
NIPS 2022
Interactive Information Extraction by Semantic Information Graph
IJCAI 2022
Inductive Relation Prediction Using Analogy Subgraph Embeddings
ICLR 2022
"A Simple Approach and Benchmark for 21,000-Category Object Detection"
ECCV 2022
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model
ECCV 2022
PSS: Progressive Sample Selection for Open-World Visual Representation Learning
ECCV 2022
Learning Enhanced Representation for Tabular Data via Neighborhood Propagation
NIPS 2022
SSEGCN: Syntactic and Semantic Enhanced Graph Convolutional Network for Aspect-based Sentiment Analysis
NAACL 2022
Swin Transformer V2: Scaling Up Capacity and Resolution
CVPR 2022
SimMIM: A Simple Framework for Masked Image Modeling
CVPR 2022
Self-supervised Amodal Video Object Segmentation
NIPS 2022
Self-Healing Robust Neural Networks via Closed-Loop Control
JMLR 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
NIPS 2022
RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
EMNLP 2022
Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours
EMNLP 2022
Dialogue Meaning Representation for Task-Oriented Dialogue Systems
EMNLP 2022
Conversation Disentanglement with Bi-Level Contrastive Learning
EMNLP 2022
DORE: Document Ordered Relation Extraction based on Generative Framework
EMNLP 2022
A Unified Dialogue User Simulator for Few-shot Data Augmentation
EMNLP 2022
A Unified Generative Framework for Various NER Subtasks
ACL 2021
Representation Learning on Spatial Networks
NIPS 2021
Bootstrap Your Object Detector via Mixed Training
NIPS 2021
GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction
NIPS 2021
Unified Tensor Framework for Incomplete Multi-view Clustering and Missing-view Inferring
AAAI 2021
Partial-Label and Structure-constrained Deep Coupled Factorization Network
AAAI 2021
A Unified Generative Framework for Aspect-based Sentiment Analysis
ACL 2021
Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings
AISTATS 2021
Bayesian Inference with Certifiable Adversarial Robustness
AISTATS 2021
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning
CVPR 2021
Prototype-Supervised Adversarial Network for Targeted Attack of Deep Hashing
CVPR 2021
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections
EMNLP 2021
Learning Hierarchical Graph Neural Networks for Image Clustering
ICCV 2021
Group-Free 3D Object Detection via Transformers
ICCV 2021
Semantics Disentangling for Generalized Zero-Shot Learning
ICCV 2021
Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows
ICCV 2021
End-to-End Semi-Supervised Object Detection With Soft Teacher
ICCV 2021
Towards Robust Neural Networks via Close-loop Control
ICLR 2021
Graph Neural Networks Inspired by Classical Iterative Algorithms
ICML 2021
A Unified Generative Framework for Aspect-based Sentiment Analysis
IJCNLP 2021
A Unified Generative Framework for Various NER Subtasks
IJCNLP 2021
GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation
COLING 2020
CoLAKE: Contextualized Language and Knowledge Embedding
COLING 2020
A Closer Look at Local Aggregation Operators in Point Cloud Analysis
ECCV 2020
TL-Explorer: A Digital Humanities Tool for Mapping and Analyzing Translated Literature
COLING 2020
RepPoints v2: Verification Meets Regression for Object Detection
NIPS 2020
Deep Latent Low-Rank Fusion Network for Progressive Subspace Discovery
IJCAI 2020
CDIMC-net: Cognitive Deep Incomplete Multi-view Clustering Network
IJCAI 2020
Disentangled Non-local Neural Networks
ECCV 2020
Region Graph Embedding Network for Zero-Shot Learning
ECCV 2020
Negative Margin Matters: Understanding Margin in Few-shot Classification
ECCV 2020
Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation
ECCV 2020
Parametric Instance Classification for Unsupervised Visual Feature learning
NIPS 2020
MovieChats: Chat like Humans in a Closed Domain
EMNLP 2020
Learning from the Past: Continual Meta-Learning with Bayesian Graph Neural Networks
AAAI 2020
Multi-Scale Self-Attention for Text Classification
AAAI 2020
Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification
AAAI 2020
Learning Goal-oriented Dialogue Policy with opposite Agent Awareness
AACL 2020
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
ACL 2020
Local Relation Networks for Image Recognition
ICCV 2019
Scalable Block-Diagonal Locality-Constrained Projective Dictionary Learning
IJCAI 2019
Attentive Region Embedding Network for Zero-Shot Learning
CVPR 2019
Star-Transformer
NAACL 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ACL 2019
SADIH: Semantic-Aware DIscrete Hashing
AAAI 2019
Unified Embedding Alignment with Missing Views Inferring for Incomplete Multi-View Clustering
AAAI 2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
ICCV 2019
Spatial-Temporal Relation Networks for Multi-Object Tracking
ICCV 2019
Highly-Economized Multi-View Binary Compression for Scalable Image Clustering
ECCV 2018
Efficient Generation and Processing of Word Co-occurrence Networks Using corpus2graph
NAACL 2018
GNEG: Graph-Based Negative Sampling for word2vec
ACL 2018
Relation Networks for Object Detection
CVPR 2018
Loss Functions for Multiset Prediction
NIPS 2018
Saliency-based Sequential Image Attention with Multiset Prediction
NIPS 2017
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis
CVPR 2016
Multi-Oriented Text Detection With Fully Convolutional Networks
CVPR 2016
The Application of Two-Level Attention Models in Deep Convolutional Neural Network for Fine-Grained Image Classification
CVPR 2015
Symmetry-Based Text Line Detection in Natural Scenes
CVPR 2015
Multiple Granularity Descriptors for Fine-Grained Categorization
ICCV 2015
Attentional Neural Network: Feature Selection Using Cognitive Feedback
NIPS 2014