Co-occurring keywords
Papers
Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models
ACL 2025
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models
COLING 2025
Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions
EMNLP 2025
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs
EMNLP 2025
GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression
EMNLP 2025
Propulsion: Steering LLM with Tiny Fine-Tuning
COLING 2025