2026 05 Arxiv
Two papers are released on arxiv. RDKV proposes a rate-distortion bit allocation method for KV cache compression. In this paper, we differentiate the compression-tolerant components from the sensitive components and propose a joint token eviction and quantization method. In Beyond GSD-as-Token, we propose a parameter-efficient fine-tuning framework remote sensing VLMs.