[Paper] Modular Foundation Model Inference at the Edge: Network-Aware Microservice Optimization
Foundation models (FMs) unlock unprecedented multimodal and multitask intelligence, yet their cloud-centric deployment precludes real-time responsiveness and co...