Co-occurring keywords
Papers
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs
EMNLP 2025
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference
AAAI 2025