Conversation

Something interesting: it seems Allspaw and his team use outage incidents (think: the Facebook outage recently) as opportunities to share tacit knowledge within the org. Because as you scale, the opportunity to share stories about internal quirks or systems vanishes.
Embedded video
1:30
98 views
Replying to
(I don’t mean the Facebook outage specifically, just the Facebook outage as the kind of software systems outage that Allspaw et al are interested in).
1
One question that will tell you that someone actually does resilience engineering: “I would ask them how easy is it for 50% of their company to stop what they were doing and work on something else (…) it’s not adaptation that matters, but capacity for adaptation without drag”
Embedded video
1:24
203 views
1
6