One paper that attacks the memory wall for deploying visual autoregressive models on edge GPUs is accepted to ICPR 2026. Please check the paper here.