Zhaozhuo Xu
47 papers · 2019–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (11) π£ Hot Topic Early Bird
π§
Keyword Pioneer
π
Cross-Pollinator
(7)
π
Renaissance Researcher
(8)
π€
Dynamic Duo
(15)
π
Keyword Champion
π
Grand Slam
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
β‘
Prolific Year
(16)
β
The Questioner
(2)
ποΈ
Keyword Collector
(165)
π₯
Unstoppable
(5)
π
Century Club
(44)
Conferences
EMNLP (12)
NIPS (11)
ICML (9)
ACL (5)
ICLR (2)
NAACL (2)
UAI (2)
AAAI (1)
AISTATS (1)
IJCNLP (1)
OSDI (1)
Top co-authors
Research topics
Keywords
large language model
(12)
model compression
(6)
language model
(5)
maximum inner product search
(5)
inference optimization
(3)
representation learning
(3)
memory efficiency
(3)
retrieval-augmented generation
(2)
influence function
(2)
kv cache compression
(2)
contrastive learning
(2)
risk management
(2)
efficient inference
(2)
inference efficiency
(2)
approximate nearest neighbor
(2)
computational efficiency
(2)
responsible ai
(2)
model serving
(2)
efficient computing
(2)
ai safety
(2)
Papers
Query-Aware Knowledge Retrieval via Hyperbolic Structuring
ACL 2026
Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks
ACL 2026
Collision to Cognition: Hash-Driven Graph Construction for Efficient RAG
ACL 2026
ZEN: Empowering Distributed Training with Sparsity-driven Data Synchronization
OSDI 2025
Compression-Aware Computing for Scalable and Sustainable AI
AAAI 2025
Taming Language Models for Text-attributed Graph Learning with Decoupled Aggregation
ACL 2025
DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic
EMNLP 2025
Rescorla-Wagner Steering of LLMs for Undesired Behaviors over Disproportionate Inappropriate Context
EMNLP 2025
Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly
EMNLP 2025
Profiling LLMβs Copyright Infringement Risks under Adversarial Persuasive Prompting
EMNLP 2025
Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity
ICLR 2025
Retrieval Augmented Zero-Shot Enzyme Generation for Specified Substrate
ICML 2025
Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation
ICML 2025
Position: Iterative Online-Offline Joint Optimization is Needed to Manage Complex LLM Copyright Risks
ICML 2025
ALinFiK: Learning to Approximate Linearized Future Influence Kernel for Scalable Third-Party LLM Data Valuation
NAACL 2025
LLMs and Copyright Risks: Benchmarks and Mitigation Approaches
NAACL 2025
Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory
UAI 2025
Do LLMs Know to Respect Copyright Notice?
EMNLP 2024
ScaleLLM: A Resource-Frugal LLM Serving Framework by Optimizing End-to-End Efficiency
EMNLP 2024
TensorOpera Router: A Multi-Model Router for Efficient LLM Inference
EMNLP 2024
QUEST: Efficient Extreme Multi-Label Text Classification with Large Language Models on Commodity Hardware
EMNLP 2024
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
EMNLP 2024
In Defense of Structural Sparse Adapters for Concurrent LLM Serving
EMNLP 2024
Knowledge Graphs Can be Learned with Just Intersection Features
ICML 2024
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
ICML 2024
TVE: Learning Meta-attribution for Transferable Vision Explainer
ICML 2024
Soft Prompt Recovers Compressed LLMs, Transferably
ICML 2024
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization
NIPS 2024
SIRIUS : Contexual Sparisty with Correction for Efficient LLMs
NIPS 2024
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention
NIPS 2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
NIPS 2024
GNNs Also Deserve Editing, and They Need It More Than Once
ICML 2024
Token-wise Influential Training Data Retrieval for Large Language Models
ACL 2024
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
NIPS 2023
One-Pass Distribution Sketch for Measuring Data Heterogeneity in Federated Learning
NIPS 2023
Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
NIPS 2023
A Tale of Two Efficient Value Iteration Algorithms for Solving Linear MDPs with Large Action Space
AISTATS 2023
Graph Self-supervised Learning via Proximity Distribution Minimization
UAI 2023
DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks
ICML 2022
Structural Contrastive Representation Learning for Zero-shot Multi-label Text Classification
EMNLP 2022
Locality Sensitive Teaching
NIPS 2021
Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures
NIPS 2021
MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training
ICLR 2021
Raw Nav-merge Seismic Data to Subsurface Properties with MLP based Multi-Modal Information Unscrambler
NIPS 2021
On Efficient Retrieval of Top Similarity Vectors
EMNLP 2019
On Efficient Retrieval of Top Similarity Vectors
IJCNLP 2019
MΓΆbius Transformation for Fast Inner Product Search on Graph
NIPS 2019