Xu Zhang
67 papers · 2015–2026 · 18 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Academic Marathon (11) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (17) π£ Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(129)
π
Conference Polyglot
(17)
π
Academic Marathon
(11)
π±
Topic Pioneer
π
Grand Slam
ποΈ
Keyword Collector
(288)
π
Conference Pioneer
π
Century Club
(55)
π₯
Unstoppable
(8)
π
Trend Setter
β‘
Prolific Year
(12)
Conferences
AAAI (12)
ACL (10)
ICCV (6)
CVPR (5)
INTERSPEECH (4)
ICML (4)
ICLR (4)
EMNLP (4)
COLING (3)
MICCAI (3)
NIPS (3)
ECCV (2)
NSDI (2)
EACL (1)
IJCAI (1)
L4DC (1)
NAACL (1)
WACV (1)
Top co-authors
Keywords
diffusion model
(4)
multimodal learning
(4)
large language model
(3)
vision-language model
(3)
federated learning
(3)
multimodal large language model
(3)
jailbreak attack
(2)
visual-language model
(2)
pre-trained language model
(2)
representation learning
(2)
video understanding
(2)
knowledge graph
(2)
contrastive learning
(2)
multi-task learning
(2)
image generation
(2)
text classification
(2)
image restoration
(2)
zero-shot learning
(2)
semantic segmentation
(2)
anomaly detection
(2)
Papers
EvoNarrator: Modeling Scientific Evolution for Feasible Hypothesis Generation
ACL 2026
Any2RSI: Controllable Remote Sensing Text-to-Image Generation via Any Control and Enriched Description
AAAI 2026
Sortblock: Similarity-Aware Feature Reuse for Diffusion Model
AAAI 2026
DeepInsert: Early Layer Bypass for Efficient and Performant Multimodal Understanding
EACL 2026
Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models
WACV 2026
MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments
ACL 2026
ShadeEdit: A Utility-Preserving and Defense-Evasive Knowledge Manipulation Attack in Federated LLMs
AAAI 2026
HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy
ACL 2026
CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks
ACL 2026
MAVERIX: Multimodal Audio-Visual Evaluation and Recognition IndeX
AAAI 2026
Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random Sketching
AAAI 2026
ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image Restoration
AAAI 2026
RST-Guarder: Enhancing Long-Context Robustness for Safeguards via RST Parsing and Probabilistic Inference
ACL 2026
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach
ICML 2025
SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning
AAAI 2025
A Lightweight Sparse Interaction Network for Time Series Forecasting
AAAI 2025
A General Knowledge Injection Framework for ICD Coding
ACL 2025
MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency
ACL 2025
AA-CLIP: Enhancing Zero-Shot Anomaly Detection via Anomaly-Aware CLIP
CVPR 2025
DAMON: A Dialogue-Aware MCTS Framework for Jailbreaking Large Language Models
EMNLP 2025
MMAG: Multimodal Learning for Mucus Anomaly Grading in Nasal Endoscopy via Semantic Attribute Prompting
EMNLP 2025
Adversarial Data Augmentation for Single Domain Generalization via Lyapunov Exponent-Guided Optimization
ICCV 2025
Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
ICCV 2025
SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies
ICCV 2025
GeoILP: A Synthetic Dataset to Guide Large-Scale Rule Induction
ICLR 2025
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
ICML 2025
Generalization Performance of Ensemble Clustering: From Theory to Algorithm
ICML 2025
Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation Prediction
IJCAI 2025
SimCroP: Radiograph Representation Learning with Similarity-driven Cross-granularity Pre-training
MICCAI 2025
UFO: A UI-Focused Agent for Windows OS Interaction
NAACL 2025
Reduce Redundancy Then Rerank: Enhancing Code Summarization with a Novel Pipeline Framework
COLING 2024
GRACE: Loss-Resilient Real-Time Video through Neural Codecs
NSDI 2024
All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation
NIPS 2024
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models
NIPS 2024
Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference
ICLR 2024
Plug-In Diffusion Model for Sequential Recommendation
AAAI 2024
MAdapter: A Better Interaction between Image and Language for Medical Image Segmentation
MICCAI 2024
Noise Removed Inconsistency Activation Map for Unsupervised Registration of Brain Tumor MRI between Pre-operative and Follow-up Phases
MICCAI 2024
DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly
ECCV 2024
Negative Pre-aware for Noisy Cross-Modal Matching
AAAI 2024
Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation
ACL 2024
HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation
ECCV 2024
FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory
ICCV 2023
G3R: A Graph-Guided Generate-and-Rerank Framework for Complex and Cross-domain Text-to-SQL Generation
ACL 2023
MIL-Decoding: Detoxifying Language Models at Token-Level via Multiple Instance Learning
ACL 2023
Avoiding spurious correlations via logit correction
ICLR 2023
Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection
ICLR 2023
User-Controllable Arbitrary Style Transfer via Entropy Regularization
AAAI 2023
Monaural Speech Separation Method Based on Recurrent Attention with Parallel Branches
INTERSPEECH 2023
Top-k data selection via distributed sample quantile inference
L4DC 2023
CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose
CVPR 2023
An Empirical Study of Instruction-tuning Large Language Models in Chinese
EMNLP 2023
Impairment Representation Learning for Speech Quality Assessment
INTERSPEECH 2022
Byzantine-tolerant federated Gaussian process regression for streaming data
NIPS 2022
Personalized Federated Learning via Variational Bayesian Inference
ICML 2022
Complicate Then Simplify: A Novel Way to Explore Pre-trained Models for Text Classification
COLING 2022
Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark
CVPR 2022
Code Generation From Flowcharts with Texts: A Benchmark Dataset and An Approach
EMNLP 2022
Low-Delay Speech Enhancement Using Perceptually Motivated Target and Loss
INTERSPEECH 2021
A Causal U-Net Based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement
INTERSPEECH 2021
Generalized Relation Learning with Semantic Correlation Awareness for Link Prediction
AAAI 2021
SENSEI: Aligning Video Streaming Quality with Dynamic User Sensitivity
NSDI 2021
Intra-Correlation Encoding for Chinese Sentence Intention Matching
COLING 2020
Unsupervised Embedding Learning via Invariant and Spreading Instance Feature
CVPR 2019
Learning Spread-Out Local Feature Descriptors
ICCV 2017
Learning Discriminative and Transformation Covariant Local Feature Detectors
CVPR 2017
Fast Orthogonal Projection Based on Kronecker Product
ICCV 2015