Xu Zhang

67 papers · 2015–2026 · 18 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (11) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (17) 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (129) 🌍 Conference Polyglot (17) 🏃 Academic Marathon (11) 🌱 Topic Pioneer 🏆 Grand Slam 🗃️ Keyword Collector (288) 🚀 Conference Pioneer 💎 Century Club (55) 🔥 Unstoppable (8) 📈 Trend Setter ⚡ Prolific Year (12)

Conferences

AAAI (12) ACL (10) ICCV (6) CVPR (5) INTERSPEECH (4) ICML (4) ICLR (4) EMNLP (4) COLING (3) MICCAI (3) NIPS (3) ECCV (2) NSDI (2) EACL (1) IJCAI (1) L4DC (1) NAACL (1) WACV (1)

Top co-authors

Xiaojun Wan (6) Yue Wu (5) Deyu Zhou (4) Liang Guo (3) Zejie Liu (3) Xiguang Zheng (3) Xunjian Yin (3) Lianwu Chen (3) Bing Yu (3) Pradeep Natarajan (3)

Keywords

diffusion model (4) multimodal learning (4) large language model (3) vision-language model (3) federated learning (3) multimodal large language model (3) jailbreak attack (2) visual-language model (2) pre-trained language model (2) representation learning (2) video understanding (2) knowledge graph (2) contrastive learning (2) multi-task learning (2) image generation (2) text classification (2) image restoration (2) zero-shot learning (2) semantic segmentation (2) anomaly detection (2)

Papers

EvoNarrator: Modeling Scientific Evolution for Feasible Hypothesis Generation ACL 2026 Any2RSI: Controllable Remote Sensing Text-to-Image Generation via Any Control and Enriched Description AAAI 2026 Sortblock: Similarity-Aware Feature Reuse for Diffusion Model AAAI 2026 DeepInsert: Early Layer Bypass for Efficient and Performant Multimodal Understanding EACL 2026 Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models WACV 2026 MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive and MCP-Augmented Environments ACL 2026 ShadeEdit: A Utility-Preserving and Defense-Evasive Knowledge Manipulation Attack in Federated LLMs AAAI 2026 HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy ACL 2026 CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks ACL 2026 MAVERIX: Multimodal Audio-Visual Evaluation and Recognition IndeX AAAI 2026 Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random Sketching AAAI 2026 ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image Restoration AAAI 2026 RST-Guarder: Enhancing Long-Context Robustness for Safeguards via RST Parsing and Probabilistic Inference ACL 2026 Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach ICML 2025 SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning AAAI 2025 A Lightweight Sparse Interaction Network for Time Series Forecasting AAAI 2025 A General Knowledge Injection Framework for ICD Coding ACL 2025 MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency ACL 2025 AA-CLIP: Enhancing Zero-Shot Anomaly Detection via Anomaly-Aware CLIP CVPR 2025 DAMON: A Dialogue-Aware MCTS Framework for Jailbreaking Large Language Models EMNLP 2025 MMAG: Multimodal Learning for Mucus Anomaly Grading in Nasal Endoscopy via Semantic Attribute Prompting EMNLP 2025 Adversarial Data Augmentation for Single Domain Generalization via Lyapunov Exponent-Guided Optimization ICCV 2025 Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning ICCV 2025 SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies ICCV 2025 GeoILP: A Synthetic Dataset to Guide Large-Scale Rule Induction ICLR 2025 Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion ICML 2025 Generalization Performance of Ensemble Clustering: From Theory to Algorithm ICML 2025 Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation Prediction IJCAI 2025 SimCroP: Radiograph Representation Learning with Similarity-driven Cross-granularity Pre-training MICCAI 2025 UFO: A UI-Focused Agent for Windows OS Interaction NAACL 2025 Reduce Redundancy Then Rerank: Enhancing Code Summarization with a Novel Pipeline Framework COLING 2024 GRACE: Loss-Resilient Real-Time Video through Neural Codecs NSDI 2024 All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation NIPS 2024 DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models NIPS 2024 Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference ICLR 2024 Plug-In Diffusion Model for Sequential Recommendation AAAI 2024 MAdapter: A Better Interaction between Image and Language for Medical Image Segmentation MICCAI 2024 Noise Removed Inconsistency Activation Map for Unsupervised Registration of Brain Tumor MRI between Pre-operative and Follow-up Phases MICCAI 2024 DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly ECCV 2024 Negative Pre-aware for Noisy Cross-Modal Matching AAAI 2024 Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation ACL 2024 HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation ECCV 2024 FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory ICCV 2023 G3R: A Graph-Guided Generate-and-Rerank Framework for Complex and Cross-domain Text-to-SQL Generation ACL 2023 MIL-Decoding: Detoxifying Language Models at Token-Level via Multiple Instance Learning ACL 2023 Avoiding spurious correlations via logit correction ICLR 2023 Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection ICLR 2023 User-Controllable Arbitrary Style Transfer via Entropy Regularization AAAI 2023 Monaural Speech Separation Method Based on Recurrent Attention with Parallel Branches INTERSPEECH 2023 Top-k data selection via distributed sample quantile inference L4DC 2023 CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose CVPR 2023 An Empirical Study of Instruction-tuning Large Language Models in Chinese EMNLP 2023 Impairment Representation Learning for Speech Quality Assessment INTERSPEECH 2022 Byzantine-tolerant federated Gaussian process regression for streaming data NIPS 2022 Personalized Federated Learning via Variational Bayesian Inference ICML 2022 Complicate Then Simplify: A Novel Way to Explore Pre-trained Models for Text Classification COLING 2022 Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark CVPR 2022 Code Generation From Flowcharts with Texts: A Benchmark Dataset and An Approach EMNLP 2022 Low-Delay Speech Enhancement Using Perceptually Motivated Target and Loss INTERSPEECH 2021 A Causal U-Net Based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement INTERSPEECH 2021 Generalized Relation Learning with Semantic Correlation Awareness for Link Prediction AAAI 2021 SENSEI: Aligning Video Streaming Quality with Dynamic User Sensitivity NSDI 2021 Intra-Correlation Encoding for Chinese Sentence Intention Matching COLING 2020 Unsupervised Embedding Learning via Invariant and Spreading Instance Feature CVPR 2019 Learning Spread-Out Local Feature Descriptors ICCV 2017 Learning Discriminative and Transformation Covariant Local Feature Detectors CVPR 2017 Fast Orthogonal Projection Based on Kronecker Product ICCV 2015