A Planet-Scale Computer – Abstract Away Regions via Global Service Placer (GSP)

Gerald Guo

Yunqi Zhang

Yatpang Cheung

Yan Yan

TOPIC: Data, Systems and Networking

@SCALE SERIES: Systems @Scale

TYPE: video

YEAR: 2025

TAGS:

Both public clouds and our hyperscale private cloud have evolved into complex infrastructures with millions of servers spanning numerous global data center regions. Leaving users to manage the complexity of deploying global services across regions incurs significant operational toil and often leads to suboptimal outcomes. Users must select regions, align global traffic routing with service deployments and ensure disaster preparedness, while optimizing cost. They must continually repeat these processes to adapt to workload changes. To eliminate these manual burdens, we introduce the Global Service Placer (GSP), which autonomously places services across regions based on user-defined latency SLOs, abstracting away details of regions from users, such as their geo-distribution and resource availability, while optimizing efficiency. The generality and efficacy of GSP has been demonstrated via onboarding a diverse set of big and complex services. We provide a case study for highly complex AI inference workload and show significant GPU savings.

SUBSCRIBE TO @SCALE