How Learning is Different Than Fixing

Posted Posted in ACL Posts

I was honored to present a talk at the AllTheTalks conference a few weeks back. tl;dr: slides are here, video and transcript is below The topic was incident analysis (big surprise there!) and the notion of learning and fixing, and how these activities are related but not the same. A key idea here is that rather than focusing on simply focusing on […]

The Negotiability of “Severity” Levels

Posted Posted in ACL Posts, Incident Analysis

What does the term severity mean, in the context of incidents involving software systems? Merriam-Webster gives us this: “the quality or state of being severe: the condition of being very bad, serious, unpleasant, or harsh.” Here are a few colloquial definitions: “Severity measures the effort and expense required by the service provider to manage and resolve an […]

Hindsight and Sacrifice Decisions

Posted Posted in ACL Posts, Uncategorized

A few weeks ago I tweeted this thread which references sacrifice decisions and contrasts some facets of the Knight Capital (2012) case and the NYSE trading halt (2015) case: On Aug 1, 2012, a company named Knight Capital experienced a business-destroying incident. Much has been written about it, but that’s not the topic of this thread. […]