Zhaozhuo Xu

47 papers · 2019–2026 · 11 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (11) 🐣 Hot Topic Early Bird

🧭 Keyword Pioneer 🐝 Cross-Pollinator (7) 🌈 Renaissance Researcher (8) 🤝 Dynamic Duo (15) 🏆 Keyword Champion 🏆 Grand Slam 🔬 Deep Specialist (16) 🧬 Topic Evolution ⚡ Prolific Year (16) ❓ The Questioner (2) 🗃️ Keyword Collector (165) 🔥 Unstoppable (5) 💎 Century Club (44)

Conferences

EMNLP (12) NIPS (11) ICML (9) ACL (5) ICLR (2) NAACL (2) UAI (2) AAAI (1) AISTATS (1) IJCNLP (1) OSDI (1)

Top co-authors

Anshumali Shrivastava (15) Zirui Liu (11) Denghui Zhang (9) Shaochen Zhong (7) Kaixiong Zhou (7) Beidi Chen (6) Xia Hu (6) Xiao Huang (5) Tianyi Zhang (5) Zhao Song (4)

Research topics

Education (1)

Keywords

large language model (12) model compression (6) language model (5) maximum inner product search (5) inference optimization (3) representation learning (3) memory efficiency (3) retrieval-augmented generation (2) influence function (2) kv cache compression (2) contrastive learning (2) risk management (2) efficient inference (2) inference efficiency (2) approximate nearest neighbor (2) computational efficiency (2) responsible ai (2) model serving (2) efficient computing (2) ai safety (2)

Papers

Query-Aware Knowledge Retrieval via Hyperbolic Structuring ACL 2026 Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks ACL 2026 Collision to Cognition: Hash-Driven Graph Construction for Efficient RAG ACL 2026 ZEN: Empowering Distributed Training with Sparsity-driven Data Synchronization OSDI 2025 Compression-Aware Computing for Scalable and Sustainable AI AAAI 2025 Taming Language Models for Text-attributed Graph Learning with Decoupled Aggregation ACL 2025 DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic EMNLP 2025 Rescorla-Wagner Steering of LLMs for Undesired Behaviors over Disproportionate Inappropriate Context EMNLP 2025 Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly EMNLP 2025 Profiling LLM’s Copyright Infringement Risks under Adversarial Persuasive Prompting EMNLP 2025 Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity ICLR 2025 Retrieval Augmented Zero-Shot Enzyme Generation for Specified Substrate ICML 2025 Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation ICML 2025 Position: Iterative Online-Offline Joint Optimization is Needed to Manage Complex LLM Copyright Risks ICML 2025 ALinFiK: Learning to Approximate Linearized Future Influence Kernel for Scalable Third-Party LLM Data Valuation NAACL 2025 LLMs and Copyright Risks: Benchmarks and Mitigation Approaches NAACL 2025 Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory UAI 2025 Do LLMs Know to Respect Copyright Notice? EMNLP 2024 ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency EMNLP 2024 TensorOpera Router: A Multi-Model Router for Efficient LLM Inference EMNLP 2024 QUEST: Efficient Extreme Multi-Label Text Classification with Large Language Models on Commodity Hardware EMNLP 2024 KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches EMNLP 2024 In Defense of Structural Sparse Adapters for Concurrent LLM Serving EMNLP 2024 Knowledge Graphs Can be Learned with Just Intersection Features ICML 2024 KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache ICML 2024 TVE: Learning Meta-attribution for Transferable Vision Explainer ICML 2024 Soft Prompt Recovers Compressed LLMs, Transferably ICML 2024 KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization NIPS 2024 SIRIUS : Contexual Sparisty with Correction for Efficient LLMs NIPS 2024 NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention NIPS 2024 FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making NIPS 2024 GNNs Also Deserve Editing, and They Need It More Than Once ICML 2024 Token-wise Influential Training Data Retrieval for Large Language Models ACL 2024 Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model NIPS 2023 One-Pass Distribution Sketch for Measuring Data Heterogeneity in Federated Learning NIPS 2023 Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time NIPS 2023 A Tale of Two Efficient Value Iteration Algorithms for Solving Linear MDPs with Large Action Space AISTATS 2023 Graph Self-supervised Learning via Proximity Distribution Minimization UAI 2023 DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks ICML 2022 Structural Contrastive Representation Learning for Zero-shot Multi-label Text Classification EMNLP 2022 Locality Sensitive Teaching NIPS 2021 Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures NIPS 2021 MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training ICLR 2021 Raw Nav-merge Seismic Data to Subsurface Properties with MLP based Multi-Modal Information Unscrambler NIPS 2021 On Efficient Retrieval of Top Similarity Vectors EMNLP 2019 On Efficient Retrieval of Top Similarity Vectors IJCNLP 2019 Möbius Transformation for Fast Inner Product Search on Graph NIPS 2019