[Paper] Optimizing Resource Allocation for Geographically-Distributed Inference by Large Language Models
Large language models have demonstrated extraordinary performance in many AI tasks but are expensive to use, even after training, due to their requirement of hi...