2025 05 Icml

Two papers on model quantization are accepted to ICML 2025! IntLoRA proposes an integral low-rank adaption method for quantized diffusion models. After low-rank adaptation, all the weights are converted to integers. SliM-LLM proposes a mixed precision quantization method for LLMs.