A more detailed description of last weekend’s massive GCP outage is on the blog - there’s so much I winced at I can’t even highlight specific interesting lines. https://status.cloud.google.com/incident/cloud-networking/19009 … Configuration change. Automation making things worse. And the postmortem isn’t even fully done
-
-
beyond just our software; our deployments, changes, tools, operational processes, and we as engineers, are restricted from working on multiple regions at once.
-
That sounds crazy from a SMB perspective, but that's what scale enables. But at least some kind of cooperation and knowledge sharing is done across regions?
- 2 more replies
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.
Customers can communicate between regions if they want to, but AWS teams and services can not. It's a hard wall with precious few exceptions, like some controlled asynchronous replication.