Yuhui Xu
9 papers · 2020–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Academic Marathon (5) π Cross-Pollinator (11)
πΊοΈ
Taxonomy Completionist
(15)
Conferences
ICLR (3)
ACL (2)
ICML (2)
AAAI (1)
IJCAI (1)
Top co-authors
Keywords
model compression
(3)
deployment efficiency
(2)
neural network pruning
(1)
neural architecture search
(1)
low-rank approximation
(1)
weight sharing
(1)
rank correlation
(1)
parameter-efficient fine-tuning
(1)
low-rank adaptation
(1)
parameter efficient fine-tuning
(1)
efficient deployment
(1)
parameter efficiency
(1)
inference speed
(1)
graph convolutional network
(1)
stochastic sub-gradient descent
(1)
expert pruning
(1)
once-for-all training
(1)
low-rank adapter
(1)
large language model quantization
(1)
large language model
(1)
Papers
One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments
ACL 2025
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
ICML 2025
ThinK: Thinner Key Cache by Query-Driven Pruning
ICLR 2025
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
ICML 2024
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
ACL 2024
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
ICLR 2024
Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
AAAI 2021
PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search
ICLR 2020
TRP: Trained Rank Pruning for Efficient Deep Neural Networks
IJCAI 2020