Conversation

Replying to
This is inherently a change management issue. and lack of a rollback plan that should have kicked in the second things started to unravel, and rollback prior to most waking up, and nobody would have noticed.
1
3
Replying to and
Was the outage caused by a error in the configuration of BGP that was pushed out throughout their system? If so isn't 15+ hrs a long time to remedy this type of mistake? When this type of mistake happened to Facebook the outage I believe did not last this long.
2
5