Papers

5,479 papers found
2026 AAAI
OTARo: Once Tuning for All Precisions Toward Robust On-Device LLMs
Shaoyuan Chen, Zhixuan Chen, Dawei Yang et al.
2026 AAAI
2026 AAAI
2026 AAAI
2026 AAAI
FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching
Hongyaoxing Gu, Lijuan Hu, Shuzi Niu et al.
2026 AAAI
2026 AAAI