Co-occurring keywords
Papers
KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference
EACL 2026
Transferable Backdoor Attacks for Code Models via Sharpness-Aware Adversarial Perturbation
AAAI 2026
Distillation Dynamics: Towards Understanding Feature-Based Distillation in Vision Transformers
AAAI 2026
ReLUPruner: Rethinking ReLU Importance with Taylor Expansion for Efficient Private Inference
AAAI 2026
Direction Sensitivity–Based Knowledge Distillation: Optimization-Aware Low-Rank Knowledge Transfer
AAAI 2026
Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation
AAAI 2026