Fine-tuning vision-language models on memory-constrained devices
Source: Amazon Science
Overview
A new hybrid optimization approach allows edge devices to fine‑tune vision‑language models using only forward passes, achieving up to 7% higher accuracy than existing techniques.