Co-occurring keywords
Papers
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers
ICCV 2025
AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations
COLING 2025
Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation
ICCV 2025